Machine Learning Research Scientist / Engineer

3 Months ago • 3 Years + • Research Development • $220,000 PA - $325,000 PA

Job Summary

Job Description

The Machine Learning Research Scientist / Engineer will be at the forefront of AI research and real-world implementation, with a strong focus on reasoning within large language models (LLMs). The role involves studying critical data types for advancing LLM-based agents, including browser and software engineering (SWE) agents. The candidate will shape Scale’s data strategy by identifying effective data sources and methodologies for improving LLM reasoning and contribute to impactful research on language model reasoning.
Must have:
  • Practical experience working with LLMs and proficiency in relevant frameworks.
  • A track record of published research in top ML and NLP venues.
  • At least three years of experience solving complex ML challenges.
Good to have:
  • Hands-on experience fine-tuning open-source LLMs or leading bespoke LLM fine-tuning projects.
  • Research and practical experience in building applications and evaluations related to LLM-based agents.
  • Experience with agent frameworks such as OpenHands, Swarm, LangGraph, or similar.
  • Familiarity with advanced agentic reasoning techniques such as STaR and PLANSEARCH.
  • Proficiency in cloud-based ML development, with experience in AWS or GCP environments.

Job Details

About Scale

At Scale AI, our mission is to accelerate the development of AI applications. For 8 years, Scale has been the leading AI data foundry, fueling the most exciting advancements in AI, including generative AI, defense applications, and autonomous vehicles. With our recent Series F round, we’re amplifying access to high-quality data to drive progress toward Artificial General Intelligence (AGI). Building on our history of model evaluation with enterprise and government customers, we are expanding our capabilities to set new standards for both public and private evaluations.

About This Role

This role operates at the forefront of AI research and real-world implementation, with a strong focus on reasoning within large language models (LLMs). The ideal candidate will study the data types critical for advancing LLM-based agents, including browser and software engineering (SWE) agents. You will play a key role in shaping Scale’s data strategy by identifying the most effective data sources and methodologies for improving LLM reasoning. Success in this role requires a deep understanding of LLMs, planning algorithms, and novel approaches to agentic reasoning, as well as creativity in tackling challenges related to data generation, model interaction, and evaluation. You will contribute to impactful research on language model reasoning, collaborate with external researchers, and work closely with engineering teams to bring state-of-the-art advancements into scalable, real-world solutions.

Ideally, you’d have:

  • Practical experience working with LLMs, with proficiency in frameworks like PyTorch, JAX, or TensorFlow. You should also be skilled at rapidly interpreting research literature and turning new ideas into working prototypes.

  • A track record of published research in top ML and NLP venues (e.g., ACL, EMNLP, NAACL, NeurIPS, ICML, ICLR, CoLLM, etc.).

  • At least three years of experience solving complex ML challenges, either in a research setting or product development, particularly in areas related to LLM capabilities and reasoning.

  • Strong written and verbal communication skills, along with the ability to work effectively across teams.

Nice to have:

  • Hands-on experience fine-tuning open-source LLMs or leading bespoke LLM fine-tuning projects using PyTorch/JAX.

  • Research and practical experience in building applications and evaluations related to LLM-based agents, including tool-use, text-to-SQL, browser agents, coding agents, and GUI agents.

  • Experience with agent frameworks such as OpenHands, Swarm, LangGraph, or similar.

  • Familiarity with advanced agentic reasoning techniques such as STaR and PLANSEARCH.

  • Proficiency in cloud-based ML development, with experience in AWS or GCP environments.

Our research interviews are designed to assess candidates' ability to prototype and debug ML models, their depth of understanding in research concepts, and their alignment with our organizational culture. We do not conduct LeetCode-style problem-solving assessments.

Similar Jobs

Optiv - Client Manager - Cybersecurity Sales

Optiv

Fort Worth, Texas, United States (On-Site)
1 Month ago
Valve corporation - Game Development Software Engineer

Valve corporation

Bellevue, Washington, United States (On-Site)
9 Months ago
Tesla - Inside Sales Advisor

Tesla

Manchester, England, United Kingdom (On-Site)
5 Months ago
PhonePe - Assistant Manager Legal/Manager Legal

PhonePe

Bengaluru, Karnataka, India (On-Site)
3 Weeks ago
Trellix - Manager, Customer Success

Trellix

Cork, County Cork, Ireland (On-Site)
1 Month ago
Sword Health - Senior ML Engineer (Portugal Based Remote/Hybrid)

Sword Health

Porto, Porto District, Portugal (Remote)
6 Days ago
Brillio - Enterprise Architect - AI, Healthcare

Brillio

Jersey City, New Jersey, United States (Hybrid)
1 Week ago
NXP - Internship - AI/ML Engineer

NXP

Nijmegen, Gelderland, Netherlands (On-Site)
1 Year ago
Match Group - Staff AI Engineer, Trust & Safety Operations

Match Group

New York, United States (Hybrid)
2 Months ago
CD PROJEKT RED - Lead AI Engineer

CD PROJEKT RED

Boston, Massachusetts, United States (Hybrid)
1 Month ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Triple dot studios - Lead Product Designer

Triple dot studios

Toronto, Ontario, Canada (Hybrid)
1 Month ago
EveryMatrix - Trainee QA & Configuration Manager

EveryMatrix

L'viv, Dnipropetrovsk Oblast, Ukraine (Hybrid)
2 Months ago
Fieldguide - Principal Program Manager

Fieldguide

San Francisco, California, United States (Remote)
2 Weeks ago
Zinnia - Business Analyst

Zinnia

Pune, Maharashtra, India (On-Site)
2 Weeks ago
IGG - HR & Admin Intern

IGG

Singapore (On-Site)
9 Months ago
ISG - Principal Consultant, Digital Sourcing Solution

ISG

Toronto, Ontario, Canada (Remote)
2 Months ago
Wargaming - Lead Level Artist (World of Tanks)

Wargaming

Nicosia, Nicosia, Cyprus (Hybrid)
9 Months ago
Euromonitor - Senior Marketing Operations Manager

Euromonitor

London, England, United Kingdom (Hybrid)
1 Week ago
fluence - Sr. Maintenance Engineer

fluence

London, England, United Kingdom (Hybrid)
2 Months ago
WebFX - Jr. Social Media Ads and Analytics Specialist

WebFX

Harrisburg, Pennsylvania, United States (On-Site)
9 Months ago

Get notifed when new similar jobs are uploaded

Jobs in San Francisco, California, United States

Nice - Analyst Relations and Customer Content Specialist

Nice

Atlanta, Georgia, United States (On-Site)
1 Month ago
bytedance - Student Researcher (Doubao (Seed) - Music Foundation Model) - 2025 Start (PhD)

bytedance

San Jose, California, United States (On-Site)
9 Months ago
Gearbox - Senior Site Reliability Engineer

Gearbox

Frisco, Texas, United States (On-Site)
2 Months ago
Carbon Health - Clinic Manager

Carbon Health

Folsom, California, United States (On-Site)
1 Week ago
LightForce Orthodontics - Associate Area Sales Manager

LightForce Orthodontics

Philadelphia, Pennsylvania, United States (On-Site)
3 Weeks ago
QuinStreet - Entry Level Sales Representative

QuinStreet

Orlando, Florida, United States (Hybrid)
3 Months ago
ChainGuard - Corporate Account Executive - West

ChainGuard

United States (Remote)
2 Weeks ago
Google - Staff Software Engineer, Google Cloud AI

Google

Sunnyvale, California, United States (On-Site)
3 Months ago
Toast - Retail Account Executive

Toast

Roanoke, Virginia, United States (On-Site)
1 Month ago
Nightfall AI - Senior ML Platform Backend Engineer

Nightfall AI

San Francisco, California, United States (Hybrid)
2 Months ago

Get notifed when new similar jobs are uploaded

Research Development Jobs

Apple - AIML - Machine Learning Engineer, Siri and Information Intelligence

Apple

Santa Clara, California, United States (On-Site)
1 Month ago
cirrus logic - Mixed-Signal CAD/Design Engineer – AI-Driven EDA CAD Development

cirrus logic

Austin, Texas, United States (Hybrid)
1 Month ago
Apple - Software Engineer, IS&T AiDP Applied Machine Learning

Apple

Sunnyvale, California, United States (On-Site)
3 Months ago
Playtika - Youda - R&D Group Manager

Playtika

Netherlands (Hybrid)
3 Months ago
NXP - Internship - AI/ML Engineer

NXP

Nijmegen, Gelderland, Netherlands (On-Site)
1 Year ago
Sailpoint - Staff AI Engineer

Sailpoint

Austin, Texas, United States (On-Site)
3 Weeks ago
Reddit - Senior Machine Learning Engineer, LS Embeddings

Reddit

United States (Remote)
3 Weeks ago
NXP - Internship - AI/ML Verification Framework for Mixed-Signal Systems

NXP

Eindhoven, North Brabant, Netherlands (On-Site)
1 Month ago
Instrumental - AI Grantwriting Associate

Instrumental

(Remote)
3 Months ago
Nousresearch - Research Scientist

Nousresearch

(On-Site)
2 Weeks ago

Get notifed when new similar jobs are uploaded

About The Company

San Francisco, California, United States (On-Site)

San Francisco, California, United States (On-Site)

San Francisco, California, United States (On-Site)

San Francisco, California, United States (On-Site)

San Francisco, California, United States (On-Site)

San Francisco, California, United States (On-Site)

London, England, United Kingdom (On-Site)

London, England, United Kingdom (On-Site)

San Francisco, California, United States (On-Site)

San Francisco, California, United States (Hybrid)

View All Jobs

Get notified when new jobs are added by Scale AI

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug