Machine Learning Engineer, GenAI Quality

1 Month ago • 3 Years + • $172,000 PA - $300,000 PA

Job Summary

Job Description

This role focuses on developing ML systems to automate data quality evaluation and generation using large language models. You will build scalable systems to assess quality across accuracy, instruction adherence, factuality, and reasoning — and design robust evaluation frameworks to ensure alignment with human standards. You will be deeply involved in the full lifecycle: from model design and fine-tuning, to prototyping, deployment, and monitoring. You will partner closely with engineering, research, and product teams to deliver cutting-edge solutions for both customers and internal GenAI data engines.
Must have:
  • 3+ years of experience designing, training, and deploying ML models
  • Strong background in NLP, LLMs, and deep learning frameworks
  • Experience building microservices and deploying ML pipelines
  • Practical knowledge of LLM fine-tuning and evaluation
  • Strong programming skills and a solid foundation in algorithms
Good to have:
  • Experience with post-training LLM techniques
  • Familiarity with data evaluation pipelines, dataset curation
  • Background in multimodal ML or model evaluation

Job Details

About Scale:

Scale’s Generative AI ML team develops models and services to power high-quality data generation and evaluation for the most advanced large language models on earth. We also conduct applied research on model supervision and algorithmic approaches that support frontier models for Scale’s applied-ML teams and the broader AI community. Scale is uniquely positioned at the center of the AI ecosystem as a leading provider of training and evaluation data, end-to-end ML lifecycle solutions, and frontier evaluations for public and private institutions.

About The Role:

This role focuses on developing ML systems to automate data quality evaluation and generation using large language models. You’ll build scalable systems to assess quality across accuracy, instruction adherence, factuality, and reasoning — and design robust evaluation frameworks to ensure alignment with human standards. This is one of the highest impact areas in the company and directly accelerates the development of aligned, performant foundation models.

You’ll be deeply involved in the full lifecycle: from model design and fine-tuning, to prototyping, deployment, and monitoring. You’ll partner closely with engineering, research, and product teams to deliver cutting-edge solutions for both customers and internal GenAI data engines — Scale’s fastest-growing business.

If you’re excited about combining human-machine evaluation, scaling high-quality training data, and shaping the next generation of foundation models, we’d love to hear from you.

You will:

  • Design, fine-tune, and evaluate large language models for structured quality evaluation and data generation tasks
  • Develop robust evaluation frameworks to assess performance across accuracy, instruction following, reasoning, and other critical dimensions
  • Build and maintain scalable ML services to automatically assess and generate high-quality training and evaluation data
  • Research and apply state-of-the-art techniques in LLM training, post-training alignment (e.g., instruction tuning, RLHF), and tool-augmented reasoning
  • Collaborate with research scientists, engineers, and product teams to integrate your work into production services used by top AI developers

Ideally you’d have:

  • 3+ years of experience designing, training, and deploying ML models in production environments
  • Strong background in NLP, LLMs, and deep learning frameworks like PyTorch, TensorFlow, or JAX
  • Experience building microservices and deploying ML pipelines in cloud environments (e.g., AWS or GCP)
  • Practical knowledge of LLM fine-tuning and evaluation for tasks like factuality, instruction adherence, and chain-of-thought reasoning
  • Strong programming skills (e.g., Python) and a solid foundation in algorithms and data structures
  • Strong communication skills and experience working cross-functionally

Nice to haves:

  • Experience with post-training LLM techniques (instruction tuning, RLHF, tool use, or agent-based reasoning)
  • Familiarity with data evaluation pipelines, dataset curation, or scalable annotation workflows
  • Background in multimodal ML or model evaluation across domains such as code or long-context generation

Similar Jobs

Henkel - Data Scientist-Intern

Henkel

Pune, Maharashtra, India (On-Site)
8 Months ago
Qualcomm - Software Engineer, Gaming AI

Qualcomm

San Diego, California, United States (On-Site)
2 Weeks ago
bytedance - Software Engineer, Model Interference

bytedance

San Jose, California, United States (On-Site)
4 Months ago
Reddit - Principal Machine Learning Engineer

Reddit

United States (Remote)
2 Weeks ago
bytedance - Research Scientist in ML Systems

bytedance

San Jose, California, United States (On-Site)
7 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Urban Games - Generative AI Research Intern (PhD or Master’s)

Urban Games

Singapore (On-Site)
2 Weeks ago
London stock Exchange - Senior AI Platform Engineer

London stock Exchange

London, England, United Kingdom (On-Site)
1 Week ago
bytedance - Algorithm Engineer - Enterprise Solution RD

bytedance

San Jose, California, United States (On-Site)
2 Months ago
shyft labs - Engineering Manager - Data Platform

shyft labs

Noida, Uttar Pradesh, India (On-Site)
3 Months ago
Mindstorm studios - Software Engineer (AI/ML)

Mindstorm studios

Lahore, Punjab, Pakistan (On-Site)
4 Weeks ago
NVIDIA - Principal Engineer

NVIDIA

United States (Remote)
3 Months ago
Luxoft - Senior ML Engineer

Luxoft

Poland, Ohio, United States (Remote)
5 Months ago
bytedance - Student Researcher (Doubao (Seed) - Foundation Model - Speech Understanding) - 2025 Start (PhD)

bytedance

Seattle, Washington, United States (On-Site)
7 Months ago
bytedance - Research Scientist in ML Systems

bytedance

San Jose, California, United States (On-Site)
7 Months ago
Google - Senior ML Systems Engineer, AICore

Google

Taipei City, Taiwan (On-Site)
1 Month ago

Get notifed when new similar jobs are uploaded

Jobs in San Francisco, California, United States

SoundCloud - Senior Accounting Manager - General Ledger

SoundCloud

New York, United States (On-Site)
2 Weeks ago
Glean - Technical Enablement Manager

Glean

Palo Alto, California, United States (Hybrid)
2 Weeks ago
ELk studios - Senior Java Engineer

ELk studios

Franklin, Tennessee, United States (On-Site)
2 Months ago
London stock Exchange - Software Engineer

London stock Exchange

St. Louis, Missouri, United States (On-Site)
1 Week ago
Google - Technical Program Manager II, Product Quality, Pixel

Google

Mountain View, California, United States (On-Site)
1 Month ago
Next Level Business Services - Java Developer (Full Time)

Next Level Business Services

Littleton, Colorado, United States (On-Site)
7 Months ago
Activision - Producer, Call of Duty

Activision

Santa Monica, California, United States (On-Site)
1 Week ago
Scale AI - SEAL Research Scientist, Agent Robustness

Scale AI

San Francisco, California, United States (On-Site)
1 Month ago
Internet Brands - District Manager

Internet Brands

Atlanta, Georgia, United States (On-Site)
1 Week ago
The Walt Disney Company - Principal Software Engineer - Ad Platform

The Walt Disney Company

Glendale, California, United States (On-Site)
4 Months ago

Get notifed when new similar jobs are uploaded

Similar Category Jobs

Looks like we're out of matches

Set up an alert and we'll send you similar jobs the moment they appear!

About The Company

San Francisco, California, United States (On-Site)

San Francisco, California, United States (On-Site)

San Francisco, California, United States (On-Site)

Seattle, Washington, United States (Remote)

San Francisco, California, United States (On-Site)

San Francisco, California, United States (On-Site)

San Francisco, California, United States (On-Site)

San Francisco, California, United States (On-Site)

San Francisco, California, United States (On-Site)

San Francisco, California, United States (On-Site)

View All Jobs

Get notified when new jobs are added by Scale AI

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug