Senior Software Engineer, Data Infrastructure

2 Months ago • All levels • Data Analysis • $210,000 PA - $250,000 PA

Job Summary

Job Description

LMArena is seeking a Software Engineer to join their team and build the data infrastructure that powers real-world AI evaluation. The role involves designing and building data pipelines to process and analyze user vote data, directly impacting AI model performance evaluation. This is an opportunity to contribute to ensuring accurate and fair evaluation of human preferences across different models, shaping the future of AI development. The engineer will collaborate with researchers, engineers, and product leadership to retrieve data insights, improve data quality, scale infrastructure, and deepen the ability to compare frontier models and predict human preferences. Responsibilities include designing and building robust data pipelines for data ingestion, processing, and transformation, collaborating with stakeholders to understand data needs, creating result dashboards and reports, ensuring data integrity and quality, and scaling data infrastructure.
Must have:
  • Strong software engineering background with data engineering focus.
  • Proficiency in SQL and Python, Scala, or R.
  • Experience with data processing frameworks (Spark, Ray Data).
  • Experience with big data analytics platforms (Databricks, Snowflake).
  • Experience designing, implementing, and optimizing production data pipelines.
Good to have:
  • Prior work in data analytics or datalake platforms.
  • Experience with advanced data analysis tools (Delta lake, streaming tables).
  • Exposure to machine learning.
Perks:
  • 210k - 250k + equity
  • Competitive salary and meaningful equity
  • Comprehensive healthcare coverage (medical, dental, vision)
  • Opportunity to work on cutting-edge AI with a small, mission-driven team
  • Culture that values transparency, trust, and community impact

Job Details

Software Engineer, Data Infrastructure at LMArena

Location: SF Bay Area/Remote

Type: Full-Time

About the Role:

LMArena is seeking a Software Engineer to join our team and build the data infrastructure that powers real-world AI evaluation. You'll play a crucial role in designing and building the data pipelines that process and analyze over 3 millions user vote data, directly impacting how we understand and evaluate AI model performance. This role is ideal for someone who thrives in fast-moving environments and interested in building products to ensure accurate and fair evaluation of human preferences across different models, which will shape the direction of future AI development.

As an early member of our data engineering team, you'll partner closely with researchers, engineers, and product leadership to retrieve valuable data and insights from human votes and feedback. You'll help us move fast while staying rigorous, improving data quality, scaling our infrastructure to new levels, and deepening our ability to compare frontier models and predict human preferences.

Responsibilities:

  • Design and build robust data pipelines to ingest, process, and transform user vote data to features essential for model performance evaluation.

  • Collaborate with researchers and product leadership to understand product goals and necessary data.

  • Design and implement solutions to generate result dashboards and reports, providing useful information for the public, model providers, and researchers.

  • Ensure the integrity, data quality, and reliability of the pipelines.

  • Scale our data infrastructure to accommodate increasing data volumes and evolving analytical needs.

Who is LMArena?

Created by researchers from UC Berkeley’s SkyLab, LMArena is an open platform where everyone can easily access, explore and interact with the world’s leading AI models. By comparing them side by side and casting votes for the better response, the community helps shape a public leaderboard, making AI progress more transparent, and grounded in real-world usage.

Why Join Us?

Trusted by organizations like Google, OpenAI, Meta, xAI, and more, LMArena is rapidly becoming essential infrastructure for transparent, human-centered AI evaluation at scale. With over one million monthly users and growing developer adoption, our impact is helping guide the next generation of safe, aligned AI systems—grounded in open access and collective feedback.

Our work is regularly referenced by industry leaders pushing the frontier of safe and reliable AI. Sundar Pichai, Jeff Dean, Elon Musk, and Sam Altman.

  • High Impact: Your work will be used daily by the world’s most advanced AI labs.

  • Global Reach: Develop data infrastructure powering millions of real-world evaluations, influencing AI reliability across industries at the top-tier

  • Exceptional Team: We are a small team of top talent from Google, DeepMind, Discord, Vercel, UC Berkeley, and Stanford.

Requirements:

  • Strong software engineering background with a dedicated focus on data engineering and big data technologies.

  • Proficiency in SQL and at least one programming language commonly used for data analysis (Python (preferred), Scala, R).

  • Hands-on experience with data processing and pipeline frameworks (Apache Spark, Ray Data, etc.) and at least one popular big data analytics platform (Databricks, Snowflake).

  • Demonstrated experience in designing, implementing, optimizing, and debugging production data pipelines.

Preferred Qualifications:

  • Prior work in data analytics or datalake platforms.

  • Experience in advanced data analysis tools, such as Delta lake, streaming tables.

  • Exposure to machine learning is a plus.

What we offer:

  • 210k - 250k + equity. Actual compensation will depend on job-related knowledge, skills, experience, and candidate location.

  • Competitive salary and meaningful equity

  • Comprehensive healthcare coverage (medical, dental, vision)

  • The opportunity to work on cutting-edge AI with a small, mission-driven team

  • A culture that values transparency, trust, and community impact

Come help build the space where anyone can explore and help shape the future of AI.

Similar Jobs

Qube Cinema - Engineer – Technical Support

Qube Cinema

Chennai, Tamil Nadu, India (On-Site)
2 Months ago
CyberArk - Senior Integration Engineer

CyberArk

United States (Hybrid)
1 Month ago
pentair - Pool Inside Service/Support Representative

pentair

Apopka, Florida, United States (Remote)
3 Weeks ago
cyara - Senior Software Development Engineer in Test (SDET)

cyara

Hyderabad, Telangana, India (Hybrid)
11 Months ago
Autodesk - Senior Software Engineer (Power Platform)

Autodesk

Bengaluru, Karnataka, India (On-Site)
1 Year ago
Nagarro - Associate Staff Consultant, Business Analyst

Nagarro

Canada (Remote)
9 Months ago
Capgemini - Data Strategy & Consulting

Capgemini

Pune, Maharashtra, India (On-Site)
2 Months ago
onwards Search - Senior Data Analyst, Marketing

onwards Search

Memphis, Tennessee, United States (Remote)
4 Weeks ago
Honor - Director of Data Platform & Analytics

Honor

United States (Remote)
2 Weeks ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Avalanche Studios Group - Senior Audio Software Engineer

Avalanche Studios Group

Salt Lake City, Utah, United States (Hybrid)
2 Months ago
Open Systems Technologies - SAP LE Consultant

Open Systems Technologies

Ridgefield Park, New Jersey, United States (On-Site)
3 Weeks ago
Apple - Program Manager, Trust & Safety (Data)

Apple

Cupertino, California, United States (On-Site)
1 Month ago
broadcom - Client Services Consultant

broadcom

Mumbai, Maharashtra, India (On-Site)
2 Months ago
OKX - Manager, Legal Response Team

OKX

Budapest, Hungary (On-Site)
3 Weeks ago
Moloco - Senior Software Engineer (Tech Lead)

Moloco

Bengaluru, Karnataka, India (On-Site)
3 Months ago
Motorola solutions - Senior Java Engineer(Core Java, Springboot, Hibernate, Postgre SQL/My SQL)

Motorola solutions

Bengaluru, Karnataka, India (On-Site)
2 Weeks ago
TransUnion - JAVA Developer

TransUnion

Bengaluru, Karnataka, India (Hybrid)
2 Weeks ago
truecaller - Release Test Engineer

truecaller

Stockholm, Stockholm County, Sweden (On-Site)
3 Months ago
Ubisoft - Lead Audio Designer

Ubisoft

Shanghai, Shanghai, China (On-Site)
4 Months ago

Get notifed when new similar jobs are uploaded

Jobs in California, United States

Salesforce - Performance Engineering, LMTS

Salesforce

Burlington, Massachusetts, United States (Hybrid)
2 Months ago
Loft Orbital - Test Infrastructure Technical Lead

Loft Orbital

Golden, Colorado, United States (Hybrid)
1 Month ago
HCL Tech - Senior Test Lead - Embedded Software

HCL Tech

California, United States (On-Site)
1 Month ago
Glean - Senior Commercial Counsel

Glean

Nashville, Tennessee, United States (Hybrid)
1 Month ago
Morning Star - Vice President, Business Development

Morning Star

New York, United States (Hybrid)
1 Month ago
Matte projects - Senior Art Director

Matte projects

New York, United States (On-Site)
1 Month ago
Rackspace Technology - Enterprise Healthcare IT Services - Sales Executive V - Southeast

Rackspace Technology

United States (Remote)
2 Weeks ago
AECOM - Senior CAD Specialist

AECOM

Buffalo, New York, United States (On-Site)
1 Month ago
Safe security - Regional Vice President, Sales (New England)

Safe security

Boston, Massachusetts, United States (Remote)
5 Months ago
ISS Stoxx - Voting Operations - Custodian Operations Jr. Analyst

ISS Stoxx

Norman, Oklahoma, United States (On-Site)
2 Months ago

Get notifed when new similar jobs are uploaded

Data Analysis Jobs

N-ix - Senior Data Engineer with Snowflake

N-ix

(On-Site)
1 Month ago
Siemens  - Data Engineer - IoT Projects

Siemens

Amadora, Lisbon, Portugal (Hybrid)
2 Months ago
TransUnion - Senior Consultant, Data Science and Analytics

TransUnion

Hong Kong (On-Site)
2 Months ago
Lambda - Data Center Operations Engineer

Lambda

Salt Lake City, Utah, United States (On-Site)
3 Weeks ago
Ziff Davis - Data Scientist

Ziff Davis

Malaga, Western Australia, Australia (Remote)
2 Months ago
Nagarro - Associate Principal Consultant, Business Analyst

Nagarro

(On-Site)
9 Months ago
Wildlife Studios - Senior Data Scientist

Wildlife Studios

São Paulo, Brazil (On-Site)
3 Months ago
Nagarro - Staff Consultant, Business Analyst

Nagarro

India (Remote)
9 Months ago
Rackspace Technology - Data Architect

Rackspace Technology

Vietnam (Remote)
6 Months ago
Motorola solutions - Senior Data Scientist

Motorola solutions

Bengaluru, Karnataka, India (On-Site)
2 Years ago

Get notifed when new similar jobs are uploaded

About The Company

San Francisco, California, United States (Hybrid)

California, United States (Hybrid)

United States (Remote)

California, United States (Hybrid)

United States (Remote)

California, United States (Hybrid)

California, United States (Hybrid)

California, United States (Hybrid)

California, United States (Hybrid)

California, United States (Hybrid)

View All Jobs

Get notified when new jobs are added by LMArena

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug