Senior Software Engineer, Data Infrastructure

1 Month ago • All levels • Data Analysis • $210,000 PA - $250,000 PA

Job Summary

Job Description

LMArena is seeking a Software Engineer to join their team and build the data infrastructure that powers real-world AI evaluation. The role involves designing and building data pipelines to process and analyze user vote data, directly impacting AI model performance evaluation. This is an opportunity to contribute to ensuring accurate and fair evaluation of human preferences across different models, shaping the future of AI development. The engineer will collaborate with researchers, engineers, and product leadership to retrieve data insights, improve data quality, scale infrastructure, and deepen the ability to compare frontier models and predict human preferences. Responsibilities include designing and building robust data pipelines for data ingestion, processing, and transformation, collaborating with stakeholders to understand data needs, creating result dashboards and reports, ensuring data integrity and quality, and scaling data infrastructure.
Must have:
  • Strong software engineering background with data engineering focus.
  • Proficiency in SQL and Python, Scala, or R.
  • Experience with data processing frameworks (Spark, Ray Data).
  • Experience with big data analytics platforms (Databricks, Snowflake).
  • Experience designing, implementing, and optimizing production data pipelines.
Good to have:
  • Prior work in data analytics or datalake platforms.
  • Experience with advanced data analysis tools (Delta lake, streaming tables).
  • Exposure to machine learning.
Perks:
  • 210k - 250k + equity
  • Competitive salary and meaningful equity
  • Comprehensive healthcare coverage (medical, dental, vision)
  • Opportunity to work on cutting-edge AI with a small, mission-driven team
  • Culture that values transparency, trust, and community impact

Job Details

Software Engineer, Data Infrastructure at LMArena

Location: SF Bay Area/Remote

Type: Full-Time

About the Role:

LMArena is seeking a Software Engineer to join our team and build the data infrastructure that powers real-world AI evaluation. You'll play a crucial role in designing and building the data pipelines that process and analyze over 3 millions user vote data, directly impacting how we understand and evaluate AI model performance. This role is ideal for someone who thrives in fast-moving environments and interested in building products to ensure accurate and fair evaluation of human preferences across different models, which will shape the direction of future AI development.

As an early member of our data engineering team, you'll partner closely with researchers, engineers, and product leadership to retrieve valuable data and insights from human votes and feedback. You'll help us move fast while staying rigorous, improving data quality, scaling our infrastructure to new levels, and deepening our ability to compare frontier models and predict human preferences.

Responsibilities:

  • Design and build robust data pipelines to ingest, process, and transform user vote data to features essential for model performance evaluation.

  • Collaborate with researchers and product leadership to understand product goals and necessary data.

  • Design and implement solutions to generate result dashboards and reports, providing useful information for the public, model providers, and researchers.

  • Ensure the integrity, data quality, and reliability of the pipelines.

  • Scale our data infrastructure to accommodate increasing data volumes and evolving analytical needs.

Who is LMArena?

Created by researchers from UC Berkeley’s SkyLab, LMArena is an open platform where everyone can easily access, explore and interact with the world’s leading AI models. By comparing them side by side and casting votes for the better response, the community helps shape a public leaderboard, making AI progress more transparent, and grounded in real-world usage.

Why Join Us?

Trusted by organizations like Google, OpenAI, Meta, xAI, and more, LMArena is rapidly becoming essential infrastructure for transparent, human-centered AI evaluation at scale. With over one million monthly users and growing developer adoption, our impact is helping guide the next generation of safe, aligned AI systems—grounded in open access and collective feedback.

Our work is regularly referenced by industry leaders pushing the frontier of safe and reliable AI. Sundar Pichai, Jeff Dean, Elon Musk, and Sam Altman.

  • High Impact: Your work will be used daily by the world’s most advanced AI labs.

  • Global Reach: Develop data infrastructure powering millions of real-world evaluations, influencing AI reliability across industries at the top-tier

  • Exceptional Team: We are a small team of top talent from Google, DeepMind, Discord, Vercel, UC Berkeley, and Stanford.

Requirements:

  • Strong software engineering background with a dedicated focus on data engineering and big data technologies.

  • Proficiency in SQL and at least one programming language commonly used for data analysis (Python (preferred), Scala, R).

  • Hands-on experience with data processing and pipeline frameworks (Apache Spark, Ray Data, etc.) and at least one popular big data analytics platform (Databricks, Snowflake).

  • Demonstrated experience in designing, implementing, optimizing, and debugging production data pipelines.

Preferred Qualifications:

  • Prior work in data analytics or datalake platforms.

  • Experience in advanced data analysis tools, such as Delta lake, streaming tables.

  • Exposure to machine learning is a plus.

What we offer:

  • 210k - 250k + equity. Actual compensation will depend on job-related knowledge, skills, experience, and candidate location.

  • Competitive salary and meaningful equity

  • Comprehensive healthcare coverage (medical, dental, vision)

  • The opportunity to work on cutting-edge AI with a small, mission-driven team

  • A culture that values transparency, trust, and community impact

Come help build the space where anyone can explore and help shape the future of AI.

Similar Jobs

Looks like we're out of matches

Set up an alert and we'll send you similar jobs the moment they appear!

Similar Skill Jobs

Looks like we're out of matches

Set up an alert and we'll send you similar jobs the moment they appear!

Jobs in California, United States

Looks like we're out of matches

Set up an alert and we'll send you similar jobs the moment they appear!

Data Analysis Jobs

Looks like we're out of matches

Set up an alert and we'll send you similar jobs the moment they appear!

About The Company

California, United States (Hybrid)

United States (Remote)

California, United States (Hybrid)

California, United States (Remote)

California, United States (Hybrid)

California, United States (Hybrid)

California, United States (Hybrid)

San Francisco, California, United States (Hybrid)

California, United States (Hybrid)

California, United States (Remote)

View All Jobs

Get notified when new jobs are added by LMArena

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug