Research Engineer - Posttraining

4 Hours ago • All levels
Research Development

Job Description

Periodic Labs is an AI + physical sciences lab focused on building state-of-the-art models for scientific discovery. The company is well-funded and rapidly expanding, fostering a culture where team members are empowered to solve problems without bureaucracy. In this role, you will post-train frontier models to autonomously execute various stages of the scientific discovery pipeline, including generating hypotheses, designing experiments for actual labs, and operating scientific equipment. You will collaborate with leading physical science experts to develop high-quality evaluation and training tasks, scale RL environments, design reward functions, and conduct large-scale RL runs to automate scientific discovery.
Good To Have:
  • Creating and scaling RL environments for LLMs
  • Creating high-quality evals for frontier models
  • Working closely with domain experts to define evaluation criteria, tools, and environments for agents
  • Carefully crafting training datasets and reward functions, with LLMs and/or human trainers
  • Training frontier LLMs with RL

Add these skills to join the top 1% applicants for this job

game-texts

About Periodic Labs

We are an AI + physical sciences lab building state of the art models to make novel scientific discoveries. We are well funded and growing rapidly. Team members are owners who identify and solve problems without boundaries or bureaucracy. We eagerly learn new tools and new science to push forward our mission.

About the role

In this role, you will post-train frontier models to autonomously run various parts of the scientific discovery pipeline. Models you train will generate hypotheses, design experiments that run in an actual lab, operate sophisticated scientific equipment, and more. You will work with the world’s leading experts in the physical sciences in order to create high-quality evaluation and training tasks, scale up RL environments, design creative reward functions, and run large-scale RL runs, all in service of automating scientific discovery.

You might thrive in this role if you have experience:

  • Creating and scaling RL environments for LLMs
  • Creating high-quality evals for frontier models
  • Working closely with domain experts to define evaluation criteria, tools, and environments for agents
  • Carefully crafting training datasets and reward functions, with LLMs and/or human trainers
  • Training frontier LLMs with RL

Set alerts for more jobs like Research Engineer - Posttraining
Set alerts for new jobs by Periodic Labs
Set alerts for new Research Development jobs in United States
Set alerts for new jobs in United States
Set alerts for Research Development (Remote) jobs

Contact Us
hello@outscal.com
Made in INDIA 💛💙