Location: SF Bay Area
Type: Full-Time
About the Role:
LMArena is seeking a hands-on, vision-led Product Lead to define the future of how people interact with, evaluate, and understand AI systems. You’ll shape the core user experience of a consumer AI product, how people explore models, cast votes, and contribute real-world evaluations, while also delivering powerful tools and insights for model providers and developers using LMArena to benchmark and improve their AI systems. This role sits at the intersection of public transparency and frontier AI innovation. It’s ideal for someone who builds trust through product design, thrives in ambiguity, and wants to help build a category-defining company from the ground up.
Responsibilities:
Define, scope, and prioritize product initiatives that improve the model evaluation experience for both consumers and model providers
Design personalized experiences for consumers, such as custom leaderboards or voting history, to deepen engagement and user stickiness
Partner with engineering, design, and research to build intuitive, high-trust features across web, backend, and dashboards
Develop features that drive growth, retention and meaningful value across the core user experience as they exploring models, casting votes, and contribute to real-world benchmark data
Develop tools and interfaces that help model providers test, compare, and gain insight into the performance of their systems
Translate community feedback and usage patterns into product opportunities and iterative improvements
Maintain a clear, evolving roadmap aligned with company priorities, user needs, and AI ecosystem developments
Define success metrics, run lightweight experiments, and guide decisions with data, observation, and feedback
Stay informed on AI trends, new model releases, and research benchmarks to anticipate future product needs and opportunities
Who is LMArena?
Created by researchers from UC Berkeley’s SkyLab, LMArena is an open platform where everyone can easily access, explore and interact with the world’s leading AI models. By comparing them side by side and casting votes for the better response, the community helps shape a public leaderboard, making AI progress more transparent, and grounded in real-world usage.
Why Join Us?
Trusted by organizations like Google, OpenAI, Meta, xAI, and more, LMArena is rapidly becoming essential infrastructure for transparent, human-centered AI evaluation at scale. With over one million monthly users and growing developer adoption, our impact is helping guide the next generation of safe, aligned AI systems—grounded in open access and collective feedback.
Trusted by organizations like Google, OpenAI, Meta, xAI, and more, LMArena is rapidly becoming essential infrastructure for transparent, human-centered AI evaluation at scale. With over one million monthly users and growing developer adoption, our impact is helping guide the next generation of safe, aligned AI systems—grounded in open access and collective feedback.
Our work is regularly referenced by industry leaders pushing the frontier of safe and reliable AI. Sundar Pichai, Jeff Dean, Elon Musk, and Sam Altman.
High Impact: Your work will be used daily by the world’s most advanced AI labs.
Global Reach: Develop data infrastructure powering millions of real-world evaluations, influencing AI reliability across industries at the top-tier
Exceptional Team: We are a small team of top talent from Google, DeepMind, Discord, Vercel, UC Berkeley, and Stanford.
Requirements:
5–7 years of experience in product management, preferably in a technical, data-heavy, or API-driven environment
Proven ability to build and scale 0→1 products, especially in ambiguous or frontier problem spaces with a small and nimble team
Strong product intuition with a track record of turning complex systems into intuitive user experiences
Excellent communication skills, with the ability to collaborate across engineering, design, and marketing
Comfortable working with technical concepts such as LLMs, model evaluation, benchmarking, or developer tooling
Experience prioritizing and shipping features in a fast-paced, startup environment
Familiarity with defining product metrics, running experiments, and making data-informed decisions
Deep sense of ownership, urgency, and curiosity, motivated to build something enduring and impactful
Preferred Qualifications:
Familiarity with Arena’s tech stack (e.g. Next.js, HonoJS, Postgres) and modern web product development
Experience working on developer platforms, research tools, benchmarking systems, or API-driven products. Tools that serve and help grow a developer community.
Knowledge of LLMs, open model ecosystems, or AI evaluation methodologies (e.g. leaderboard logic, voting, bias analysis)
Exposure to real-time systems, model deployment workflows, or analytics infrastructure
Comfort working closely with design systems (e.g. Tailwind, ShadCN) and front-end development teams
Strong interest in transparency, open source, and building tools that empower the AI community
Background in product-led growth, engagement loops, or user acquisition
What we offer:
The cash compensation for this position is 204k-231k. Actual compensation will depend on job-related knowledge, skills, experience, and candidate location.
Competitive salary and meaningful equity
Comprehensive healthcare coverage (medical, dental, vision)
The opportunity to work on cutting-edge AI with a small, mission-driven team
A culture that values transparency, trust, and community impact
Come help build the space where anyone can explore and help shape the future of AI.