Home >

Jobs >

Machine Learning Engineer, GenAI Quality

Scale AI

California, United States (On-site)

Machine Learning Engineer, GenAI Quality

4 Months ago • 3 Years + • Quality Assurance • $172,000 PA - $300,000 PA

Job Summary

Job Description

This role focuses on developing ML systems to automate data quality evaluation and generation using large language models. You will build scalable systems to assess quality across accuracy, instruction adherence, factuality, and reasoning — and design robust evaluation frameworks to ensure alignment with human standards. You will be deeply involved in the full lifecycle: from model design and fine-tuning, to prototyping, deployment, and monitoring. You will partner closely with engineering, research, and product teams to deliver cutting-edge solutions for both customers and internal GenAI data engines.

Must have:

3+ years of experience designing, training, and deploying ML models
Strong background in NLP, LLMs, and deep learning frameworks
Experience building microservices and deploying ML pipelines
Practical knowledge of LLM fine-tuning and evaluation
Strong programming skills and a solid foundation in algorithms

Good to have:

Experience with post-training LLM techniques
Familiarity with data evaluation pipelines, dataset curation
Background in multimodal ML or model evaluation

11 skills required

11 skills required for this role

Add these skills to join the top 1% applicants for this job

communication

data-structures

prototyping

aws

pytorch

deep-learning

microservices

python

algorithms

tensorflow

machine-learning

Job Details

About Scale:

Scale’s Generative AI ML team develops models and services to power high-quality data generation and evaluation for the most advanced large language models on earth. We also conduct applied research on model supervision and algorithmic approaches that support frontier models for Scale’s applied-ML teams and the broader AI community. Scale is uniquely positioned at the center of the AI ecosystem as a leading provider of training and evaluation data, end-to-end ML lifecycle solutions, and frontier evaluations for public and private institutions.

About The Role:

This role focuses on developing ML systems to automate data quality evaluation and generation using large language models. You’ll build scalable systems to assess quality across accuracy, instruction adherence, factuality, and reasoning — and design robust evaluation frameworks to ensure alignment with human standards. This is one of the highest impact areas in the company and directly accelerates the development of aligned, performant foundation models.

You’ll be deeply involved in the full lifecycle: from model design and fine-tuning, to prototyping, deployment, and monitoring. You’ll partner closely with engineering, research, and product teams to deliver cutting-edge solutions for both customers and internal GenAI data engines — Scale’s fastest-growing business.

If you’re excited about combining human-machine evaluation, scaling high-quality training data, and shaping the next generation of foundation models, we’d love to hear from you.

You will:

Design, fine-tune, and evaluate large language models for structured quality evaluation and data generation tasks
Develop robust evaluation frameworks to assess performance across accuracy, instruction following, reasoning, and other critical dimensions
Build and maintain scalable ML services to automatically assess and generate high-quality training and evaluation data
Research and apply state-of-the-art techniques in LLM training, post-training alignment (e.g., instruction tuning, RLHF), and tool-augmented reasoning
Collaborate with research scientists, engineers, and product teams to integrate your work into production services used by top AI developers

Ideally you’d have:

3+ years of experience designing, training, and deploying ML models in production environments
Strong background in NLP, LLMs, and deep learning frameworks like PyTorch, TensorFlow, or JAX
Experience building microservices and deploying ML pipelines in cloud environments (e.g., AWS or GCP)
Practical knowledge of LLM fine-tuning and evaluation for tasks like factuality, instruction adherence, and chain-of-thought reasoning
Strong programming skills (e.g., Python) and a solid foundation in algorithms and data structures
Strong communication skills and experience working cross-functionally

Nice to haves:

Experience with post-training LLM techniques (instruction tuning, RLHF, tool use, or agent-based reasoning)
Familiarity with data evaluation pipelines, dataset curation, or scalable annotation workflows
Background in multimodal ML or model evaluation across domains such as code or long-context generation

Similar Jobs

Account Manager

YouGov

Copenhagen, Denmark (On-Site)

• 1 Month ago

Analyst - Finance & Strategy

Opendoor

Chennai, Tamil Nadu, India (Hybrid)

• 2 Months ago

Manager Legal-PML

Paytm

Mumbai, Maharashtra, India (On-Site)

• 2 Months ago

Active Directory Engineer III

Rackspace Technology

Pune, Maharashtra, India (On-Site)

• 2 Months ago

Technical Project Manager

Any Desk

Stuttgart, Baden-Württemberg, Germany (Hybrid)

• 2 Months ago

Test Automation Lead

Capgemini

Bengaluru, Karnataka, India (On-Site)

• 3 Months ago

Intermediate QA Analyst - Affirmative Action for Women

Experian

Blumenau, State Of Santa Catarina, Brazil (On-Site)

• 2 Months ago

Senior Staff System Validation Engineer

Marvell

Santa Clara, California, United States (On-Site)

• 2 Months ago

Junior QA Engineer

Veeam Software

Lisbon, Lisbon, Portugal (On-Site)

• 2 Months ago

Lead QA Engineer

hogarth

Sunnyvale, California, United States (Hybrid)

• 3 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Senior VFX Artist

Gunzilla

Kyiv, Kyiv City, Ukraine (On-Site)

• 4 Months ago

Senior Manager, Financial Planning & Analysis

Moloco

Redwood City, California, United States (On-Site)

• 1 Month ago

Senior Writer, Apple Ads

Apple

Culver City, California, United States (On-Site)

• 2 Months ago

Senior Contract Manager

gitlab

United States (Remote)

• 2 Months ago

Applications and Support Engineer - Process Burners

Zeeco, Inc.

Stamford, England, United Kingdom (On-Site)

• 11 Months ago

Solutions Architect

Razer

Singapore (On-Site)

• 11 Months ago

Media Bill Pay Technician

Dentsu

Montreal, Quebec, Canada (On-Site)

• 3 Months ago

Senior/Staff Web Engineer

Flow

Palo Alto, California, United States (Hybrid)

• 10 Months ago

GTM Leader (F&A & BFSI)

Zamp

Bengaluru, Karnataka, India (Hybrid)

• 5 Months ago

Senior Technical Operations Engineer

zoox

Foster City, California, United States (On-Site)

• 2 Months ago

Get notifed when new similar jobs are uploaded

Jobs in San Francisco, CA, United States

Cloud Infrastructure Software Developer

Apple

Seattle, Washington, United States (On-Site)

• 3 Months ago

Product Designer, Assistant

Glean

Palo Alto, California, United States (On-Site)

• 10 Months ago

Software Engineering Manager, Test Software

Apple

San Diego, California, United States (On-Site)

• 3 Months ago

Business Development Representative

Dave Ramsey

Franklin, Tennessee, United States (On-Site)

• 1 Month ago

Senior Software Engineer - Desktop Platform

Discord

San Francisco, California, United States (Remote)

• 4 Months ago

Service Inside Account Manager

extreme network

Salem, New Hampshire, United States (Remote)

• 2 Months ago

8 Month Computer Vision/Machine Learning Intern

Safari AI

New York, United States (On-Site)

• 2 Months ago

Software Engineer - Cloud Foundation

Adobe

Mountain View, California, United States (On-Site)

• 2 Months ago

Account Director - Partnerships

160over90

New York, New York, United States (On-Site)

• 4 Months ago

Privacy

Microsoft

United States (On-Site)

• 2 Months ago

Get notifed when new similar jobs are uploaded

Quality Assurance Jobs

UAS Test Pilot

FlockSafety

Lafayette, Indiana, United States (On-Site)

• 2 Months ago

Wireless RF OTA MIMO Validation Engineer

Apple

Cupertino, California, United States (On-Site)

• 3 Months ago

Quality Engineer II

Nordson Corporation

Allen, Texas, United States (On-Site)

• 3 Months ago

QA Specialist (Disabled)

Spyke Games

İstanbul, Türkiye (On-Site)

• 10 Months ago

Test Automation Lead

Accenture

Bengaluru, Karnataka, India (On-Site)

• 4 Months ago

QA Specialist

Playgendary

Limassol, Limassol, Cyprus (Remote)

• 6 Months ago

AIML - Staff Machine Learning Engineer, Siri Search Quality

Apple

Cupertino, California, United States (On-Site)

• 3 Months ago

Principal Quality Engineer – Engineering Excellence

Trellix

Bengaluru, Karnataka, India (On-Site)

• 3 Months ago

Senior QA Technician

creative assembly

Horsham, England, United Kingdom (Hybrid)

• 1 Month ago

Test & Validation Trainee

Valeo

Bobigny, Île-de-France, France (On-Site)

• 3 Months ago

Get notifed when new similar jobs are uploaded

About The Company

Scale AI

58 Active Jobs

Get notified when new jobs are added by Scale AI

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

A global community of game builders. Helping people upskill and land jobs in the best gaming studios.

Company

Key Links

hello@outscal.com

Made in INDIA 💛💙

Machine Learning Engineer, GenAI Quality

Job Summary

Job Description

11 skills required

11 skills required for this role

Job Details

Similar Jobs

Account Manager

Analyst - Finance & Strategy

Manager Legal-PML

Active Directory Engineer III

Technical Project Manager

Test Automation Lead

Intermediate QA Analyst - Affirmative Action for Women

Senior Staff System Validation Engineer

Junior QA Engineer

Lead QA Engineer

Similar Skill Jobs

Senior VFX Artist

Senior Manager, Financial Planning & Analysis

Senior Writer, Apple Ads

Senior Contract Manager

Applications and Support Engineer - Process Burners

Solutions Architect

Media Bill Pay Technician

Senior/Staff Web Engineer

GTM Leader (F&A & BFSI)

Senior Technical Operations Engineer

Jobs in San Francisco, CA, United States

Cloud Infrastructure Software Developer

Product Designer, Assistant

Software Engineering Manager, Test Software

Business Development Representative

Senior Software Engineer - Desktop Platform

Service Inside Account Manager

8 Month Computer Vision/Machine Learning Intern

Software Engineer - Cloud Foundation

Account Director - Partnerships

Privacy

Quality Assurance Jobs

UAS Test Pilot

Wireless RF OTA MIMO Validation Engineer

Quality Engineer II

QA Specialist (Disabled)

Test Automation Lead

QA Specialist

AIML - Staff Machine Learning Engineer, Siri Search Quality

Principal Quality Engineer – Engineering Excellence

Senior QA Technician

Test & Validation Trainee

About The Company

Head of Evaluation and Oversight Research

Tech Lead Manager, Machine Learning Research Scientist- LLM Evals

Machine Learning Research Scientist / Engineer, Reasoning

AI Strategic Projects Lead, International Public Sector

AGC, Commercial

Machine Learning Research Scientist / Research Engineer, Post-Training

Machine Learning Engineer, Enterprise

Finance Manager, Enterprise BU

Solutions Engineer - Robotics

Machine Learning Research Engineer - Robotics

Level Up Your Career in Game Development!