Machine Learning Engineer

3 Months ago • 4 Years + • Research Development • $210,000 PA - $250,000 PA

Job Summary

Job Description

LMArena is seeking a Senior Machine Learning Engineer to scale and strengthen core infrastructure for real-world AI evaluation. You will play a foundational role in building, deploying, and improving model benchmarking systems, working across data pipelines, inference APIs, and new evaluation methodologies. This role involves partnering with researchers, engineers, and product leadership to translate new ideas into reliable systems, improve reproducibility, scale to new modalities, and deepen the understanding and comparison of frontier models. The company is trusted by leading AI organizations and has a significant user base, aiming to guide the next generation of safe, aligned AI systems.
Must have:
  • Strong programming skills across typical recommendation/LLM stacks
  • Experience in deep learning or reward model training
  • Experience with LLMs for fine-tuning, prompt engineering, function calling
  • Self-motivated and takes ownership
  • Passion for shipping quality products
  • 4+ years of industry experience
  • Solid understanding of statistics and evaluation methodologies
Perks:
  • 210k - 250k + equity
  • Competitive salary and meaningful equity
  • Comprehensive healthcare coverage
  • Opportunity to work on cutting-edge AI
  • Culture valuing transparency, trust, and community impact

Job Details

Machine Learning Engineer at LMArena

Location: SF Bay Area/Remote

Type: Full-Time

About the Role:

LMArena is seeking a Senior Machine Learning Engineer to help scale and strengthen the core infrastructure that powers real-world AI evaluation. You’ll play a foundational role in shaping how we build, deploy, and improve our model benchmarking systems, working across data pipelines, inference APIs, and new evaluation methodologies. This is an opportunity to apply your technical expertise to a platform trusted by millions, and to help define how cutting-edge AI is assessed in the wild.

As one of the first ML engineers on the team, you’ll partner closely with researchers, engineers, and product leadership to turn new ideas into reliable systems. You’ll help us move fast while staying rigorous, improving reproducibility, scaling up to new modalities, and deepening our ability to understand and compare frontier models.

Responsibilities:

  • Architect and build what will become our core modeling for data and evaluation products

  • Own the full stack data, model training, and eval pipelines

  • Help grow a culture of feedback and rapid product iteration as we build new features as a tight-nit team

  • Conduct research into state-of-the-art evaluation methods and contribute to the long-term vision for a centralized, scalable evaluation platform.

Who is LMArena?

Created by researchers from UC Berkeley’s SkyLab, LMArena is an open platform where everyone can easily access, explore and interact with the world’s leading AI models. By comparing them side by side and casting votes for the better response, the community helps shape a public leaderboard, making AI progress more transparent, and grounded in real-world usage.

Why Join Us?

Trusted by organizations like Google, OpenAI, Meta, xAI, and more, LMArena is rapidly becoming essential infrastructure for transparent, human-centered AI evaluation at scale. With over one million monthly users and growing developer adoption, our impact is helping guide the next generation of safe, aligned AI systems—grounded in open access and collective feedback.

Our work is regularly referenced by industry leaders pushing the frontier of safe and reliable AI. Sundar Pichai, Jeff Dean, Elon Musk, and Sam Altman.

  • High Impact: Your work will be used daily by the world’s most advanced AI labs.

  • Global Reach: Develop data infrastructure powering millions of real-world evaluations, influencing AI reliability across industries at the top-tier

  • Exceptional Team: We are a small team of top talent from Google, DeepMind, Discord, Vercel, UC Berkeley, and Stanford.

Requirements:

  • Strong programming skills with the ability to work across the stack in a typical recommendation system or LLM stack

  • Experience in deep learning, language models or reward model training

  • Experience in working with LLM for fine tuning, prompt engineering, function calling etc

  • Self-motivated with a willingness to take ownership of tasks

  • A passion for shipping quality products

  • 4+ years of industry experience or relevant projects

  • Solid understanding of statistics, and various tools and methodologies for evaluating uncertainty in a way that is specific to the given product being shipped

What we offer:

  • 210k - 250k + equity. Actual compensation will depend on job-related knowledge, skills, experience, and candidate location.

  • Competitive salary and meaningful equity

  • Comprehensive healthcare coverage (medical, dental, vision)

  • The opportunity to work on cutting-edge AI with a small, mission-driven team

  • A culture that values transparency, trust, and community impact

Come help build the space where anyone can explore and help shape the future of AI.

Similar Jobs

Datahub - Full-Stack Engineer

Datahub

Palo Alto, California, United States (Hybrid)
1 Month ago
bytedance - Android/iOS Engineer, Flow - 2025 Start

bytedance

Singapore (On-Site)
9 Months ago
Match Group - Senior Software Engineer, Backend

Match Group

Palo Alto, California, United States (Hybrid)
1 Month ago
Apple - Camera ISP Algorithm Engineer - Auto Focus

Apple

Cupertino, California, United States (On-Site)
1 Month ago
Immutable - Customer Growth Manager (EMEA)

Immutable

Dubai, Dubai, United Arab Emirates (Remote)
1 Month ago
Anavation - Senior Vulnerability Researcher

Anavation

Lorton, Virginia, United States (Hybrid)
4 Months ago
broadcom - R&D Software Engineer

broadcom

Bengaluru, Karnataka, India (On-Site)
1 Month ago
Match Group - Senior Machine Learning Engineer

Match Group

Vancouver, British Columbia, Canada (Hybrid)
1 Month ago
Paper Stacking games - Client Development - AI Direction

Paper Stacking games

Shanghai, China (On-Site)
3 Weeks ago
bytedance - Research Scientist Graduate (High-Performance Computing (Algorithm Acceleration)- Vision AI Platform)

bytedance

San Jose, California, United States (On-Site)
4 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Grammarly - Data Scientist

Grammarly

Berlin, Berlin, Germany (Hybrid)
1 Month ago
whoop - Senior Software Engineer (Full Stack)

whoop

Boston, Massachusetts, United States (On-Site)
1 Month ago
Immutable - Customer Growth Manager (EMEA)

Immutable

Dubai, Dubai, United Arab Emirates (Remote)
1 Month ago
Match Group - Senior Software Engineer, Backend

Match Group

West Hollywood, California, United States (Hybrid)
1 Month ago
NXP - Senior Product Marketing Manager

NXP

Shanghai, China (On-Site)
1 Month ago
bytedance - iOS Software Engineer, Flow

bytedance

Singapore (On-Site)
9 Months ago
bytedance - Android Software Engineer, Flow

bytedance

Singapore (On-Site)
9 Months ago
HoYoverse - Product Manager, AI-Powered Services

HoYoverse

Singapore, Singapore (On-Site)
3 Months ago
Plaid  - Experienced Software Engineer – Network Enablement

Plaid

San Francisco, California, United States (On-Site)
3 Months ago
Match Group - Senior Software Engineer, Backend

Match Group

Palo Alto, California, United States (Hybrid)
1 Month ago

Get notifed when new similar jobs are uploaded

Jobs in California, United States

Lilt - Polish US-based Medical Translators needed

Lilt

United States (Remote)
2 Weeks ago
Philo - Lead Data Scientist

Philo

San Francisco, California, United States (Remote)
3 Months ago
Open Systems Technologies - Veterinary Assistant

Open Systems Technologies

Bondurant, Iowa, United States (On-Site)
4 Weeks ago
Super.com - Engineering Manager, Payment Processing

Super.com

Canada, Kentucky, United States (Remote)
2 Months ago
Crowd Strick - Engineering Manager, Linux Sensor (Remote)

Crowd Strick

United States (Remote)
2 Weeks ago
bytedance - Cloud Network Engineer

bytedance

Seattle, Washington, United States (On-Site)
4 Months ago
The Walt Disney Company - Character Designer - Unannounced CG Series (Disney Televison Animation)

The Walt Disney Company

Glendale, California, United States (On-Site)
3 Weeks ago
Cognite - Senior Site Reliability Engineer

Cognite

Austin, Texas, United States (Hybrid)
1 Year ago
Figma - Director, People Analytics

Figma

San Francisco, California, United States (Remote)
1 Month ago
Gigamon - Regional Sales Director – US Air Force

Gigamon

Vienna, Virginia, United States (On-Site)
1 Month ago

Get notifed when new similar jobs are uploaded

Research Development Jobs

bytedance - Machine Learning Engineer Intern (E-commerce-Supply Chain & Logistics)

bytedance

Seattle, Washington, United States (On-Site)
4 Months ago
bytedance - AI Model Optimization Engineer

bytedance

San Jose, California, United States (On-Site)
4 Months ago
Apple - Engineering Program Manager, AIML Annotation & Visualization

Apple

Cupertino, California, United States (On-Site)
2 Months ago
NielsenIQ - Machine Learning Engineer

NielsenIQ

Barcelona, Catalonia, Spain (On-Site)
2 Months ago
Valeo - Lead - Digital & AI

Valeo

Chennai, Tamil Nadu, India (On-Site)
3 Months ago
Playtika - Games R&D-Flutter Client Developer

Playtika

Poland (On-Site)
8 Months ago
Apple - AIML - Machine Learning Engineer, Siri and Information Intelligence

Apple

Santa Clara, California, United States (On-Site)
2 Months ago
Qualcomm - AI SW Engineer/Senior Engineer, AI PC SDK

Qualcomm

Taipei City, Taiwan (On-Site)
1 Month ago
Microsoft - Senior Applied Researcher

Microsoft

Redmond, Washington, United States (On-Site)
3 Months ago
WebTech Corporation - Senior Director, AI & Data Architecture

WebTech Corporation

Bengaluru, Karnataka, India (On-Site)
2 Months ago

Get notifed when new similar jobs are uploaded

About The Company

California, United States (Hybrid)

San Francisco, California, United States (Hybrid)

California, United States (Hybrid)

United States (Remote)

California, United States (Hybrid)

United States (Remote)

California, United States (Hybrid)

California, United States (Hybrid)

California, United States (Hybrid)

California, United States (Hybrid)

View All Jobs

Get notified when new jobs are added by LMArena

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug