Machine Learning Research Scientist / Research Engineer, Post-Training

1 Day ago • All levels • $220,000 PA - $325,000 PA

Job Summary

Job Description

Scale collaborates with leading AI labs to provide high-quality data, accelerating advancements in GenAI research. The role focuses on optimizing data curation and evaluation to enhance LLM capabilities across text and multimodal modalities. The responsibilities include researching and developing novel post-training techniques like SFT, RLHF, and reward modeling, designing new approaches to preference optimization, and analyzing model behavior to identify weaknesses and propose solutions for bias mitigation and model robustness. This role involves collaboration with researchers and engineers to establish best practices in data-driven AI development and providing technical and strategic input to top foundation model labs for the development of next-generation generative AI models.
Must have:
  • Ph.D. or Master's in CS, Machine Learning, AI, or related fields.
  • Deep understanding of deep learning, reinforcement learning, and model fine-tuning.
  • Experience with post-training techniques like RLHF or instruction tuning.
  • Excellent written and verbal communication skills.
Good to have:
  • Published research at major AI conferences or journals.
  • Previous experience in a customer-facing role.

Job Details

Scale works with the industry’s leading AI labs to provide high quality data and accelerate progress in GenAI research. We are looking for Research Scientists and Research Engineers with expertise in LLM post-training (SFT, RLHF, reward modeling). This role will focus on optimizing data curation and eval to enhance LLM capabilities in both text and multimodal modalities.

In this role, you will develop novel methods to improve the alignment and generalization of large-scale generative models. You will collaborate with researchers and engineers to define best practices in data-driven AI development. You will also partner with top foundation model labs to provide both technical and strategic input on the development of the next generation of generative AI models.

You will:

  • Research and develop novel post-training techniques, including SFT, RLHF, and reward modeling, to enhance LLM core capabilities in both text and multimodal modalities.
  • Design and experiment new approaches to preference optimization.
  • Analyze model behavior, identify weaknesses, and propose solutions for bias mitigation and model robustness.
  • Publish research findings in top-tier AI conferences.

Ideally you’d have:

  • Ph.D. or Master's degree in Computer Science, Machine Learning, AI, or a related field.
  • Deep understanding of deep learning, reinforcement learning, and large-scale model fine-tuning.
  • Experience with post-training techniques such as RLHF, preference modeling, or instruction tuning.
  • Excellent written and verbal communication skills
  • Published research in areas of machine learning at major conferences (NeurIPS, ICML, ICLR, ACL, EMNLP, CVPR, etc.) and/or journals
  • Previous experience in a customer facing role.

Similar Jobs

Google - Software Engineer III, Research

Google

Tel Aviv-Yafo, Tel Aviv District, Israel (On-Site)
1 Week ago
Microsoft - Senior Machine Learning Engineer

Microsoft

Vancouver, British Columbia, Canada (On-Site)
2 Weeks ago
Scale AI - Engineering Manager, Pay & Incentives

Scale AI

San Francisco, California, United States (Hybrid)
1 Day ago
Scale AI - SEAL Research Scientist, Scalable Oversight

Scale AI

San Francisco, California, United States (On-Site)
1 Day ago
Scale AI - Senior Software Engineer

Scale AI

San Francisco, California, United States (Hybrid)
1 Day ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

ByteDance - Software Engineer in ML Engineering Platform

ByteDance

Seattle, Washington, United States (On-Site)
6 Months ago
ByteDance - Research Engineer / Scientist - AI for Databases

ByteDance

Seattle, Washington, United States (On-Site)
3 Days ago
Passive Logic - AI Control Theory & Optimization Scientist

Passive Logic

Salt Lake City, Utah, United States (On-Site)
4 Months ago
Scale AI - Software Engineer, Cloud Infrastructure

Scale AI

San Francisco, California, United States (On-Site)
6 Months ago
Netflix - Software Engineer (L5), N-Tech Software Engineering

Netflix

United States (Remote)
6 Months ago
ByteDance - Student Researcher (Foundation Models - Reasoning, Planning & Agent) - Doubao (Seed) - 2025 Start (PhD)

ByteDance

Seattle, Washington, United States (On-Site)
6 Months ago
ByteDance - Research Scientist- Foundation Model, Vision and Language

ByteDance

Seattle, Washington, United States (On-Site)
6 Months ago
Scale AI - Staff AI Product Manager, Generative AI

Scale AI

San Francisco, California, United States (On-Site)
1 Day ago
ByteDance - Research Engineer Intern

ByteDance

Seattle, Washington, United States (On-Site)
3 Days ago

Get notifed when new similar jobs are uploaded

Jobs in San Francisco, California, United States

Onehouse - Senior Software Engineer, Open Source

Onehouse

Sunnyvale, California, United States (Hybrid)
10 Months ago
Tencent - Senior Finance Manager

Tencent

California, United States (On-Site)
3 Weeks ago
ByteDance - Tech Lead Manager, Authorization Product

ByteDance

San Jose, California, United States (On-Site)
2 Weeks ago
Google - Software Engineer III, Front End, Google Cloud Platforms

Google

Kirkland, Washington, United States (On-Site)
2 Weeks ago
Google - Senior Technical Program Manager I, Security, Google Cloud Platforms

Google

Sunnyvale, California, United States (On-Site)
2 Weeks ago
Twitch - Data Scientist

Twitch

San Francisco, California, United States (On-Site)
1 Week ago
The Walt Disney Company - Financial Member Experience Manager I - Branch

The Walt Disney Company

Lake Buena Vista, Florida, United States (On-Site)
1 Week ago
llnl - Research Scientist - Cell Biology

llnl

Livermore, California, United States (On-Site)
5 Days ago
Meta - QA Engineering Lead, Reality Labs (Wearables)

Meta

Los Angeles, California, United States (On-Site)
5 Months ago
Ember Lab - Graphics Programmer

Ember Lab

California, United States (Hybrid)
1 Week ago

Get notifed when new similar jobs are uploaded

Similar Category Jobs

Looks like we're out of matches

Set up an alert and we'll send you similar jobs the moment they appear!

About The Company

San Francisco, California, United States (On-Site)

San Francisco, California, United States (On-Site)

Doha, Doha Municipality, Qatar (On-Site)

San Francisco, California, United States (On-Site)

San Francisco, California, United States (On-Site)

San Francisco, California, United States (On-Site)

San Francisco, California, United States (On-Site)

View All Jobs

Get notified when new jobs are added by Scale AI

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug