Software Engineer, GenAI Model Evaluation

2 Months ago • 5 Years + • Artificial Intelligence

About the job

Job Description

Software Engineer to build and evaluate GenAI models at Scale. Must have 5+ years of experience, proficiency in Python, Node, React, Next.js, and MongoDB, experience scaling products at startups, and strong problem-solving skills.
Must have:
  • 5+ years experience
  • Python, Node, React
  • Scaling products
  • Problem-solving skills
Good to have:
  • AI platforms
  • Generative models
  • ML infrastructure
  • AI-powered solutions
Perks:
  • High impact work
  • Entrepreneurial environment
Not hearing back from companies?
Unlock the secrets to a successful job application and accelerate your journey to your next opportunity.

About Job

Software is eating the world, but AI is eating software. We live in unprecedented times – AI has the potential to exponentially augment human intelligence. As the world adjusts to this new reality, leading tech companies are racing to build LLMs at billion dollar scale, while large enterprises figure out how to add it to their products. To ensure that these models are safe, aligned, and highly useful, they require extremely high quality human-generated data and evaluation. Since before the launch of ChatGPT, through to the latest generation of frontier models coming out today, Scale has been at the forefront of providing the post-training, fine-tuning, and human preference alignment (RLHF) data needed to ensure these models are capable, aligned, and useful via our Generative AI Data Engine. The data we are producing is some of the most important work for how humanity will interact with AI.

As customers train their models on this data, and constantly aim to improve them, a critical need is having trustworthy evaluations of model performance, and an ability to identify weaknesses and potential vulnerabilities. Conducting these evaluations with our human experts constitutes a significant and growing portion of Scale’s work—thus assisting model developers in iteratively understanding where to focus their technical investments.

The GenAI Safety & Evaluation product team at Scale is at the heart of this work, building a world-class customer-facing model evaluation platform. This platform enables customers to easily launch new evaluation workflows, deep dive into evaluation results down to the test case level to understand weaknesses and benchmark performance, and use these insights to drive model development roadmaps. In building this product, you will have a chance to shape the way that models across the industry are evaluated, impacting billions of people around the world. And as a newer product at Scale, you will have the opportunity to build something impactful from the ground up.

As part of the Safety & Evaluation product team, you will partner closely with researchers from Scale’s Safety, Evaluations, and Alignment Lab (SEAL) on productization of novel research, as well as Scale’s expert red team, which supports AI safety via rigorous model testing trusted by the White House, major enterprises, and leading model developers.

We’re looking for entrepreneurial Software Engineers to join our team. In this role, you'll be given the opportunity to build these products and drive millions of dollars in revenue. You’ll also get widespread exposure to the forefront of the AI race as Scale sees it in enterprises, startups, governments, and large tech companies.

The ideal person is a natural entrepreneurial engineer who can take an ambiguous scope and lead the execution of outcomes, doing what it takes to hit them including both backend and frontend coding, defining requirements, coordinating with other eng and operations teams at Scale, etc. We strongly believe the best engineers own outcomes and deeply understand customer problems.

You will:

  • Own large new areas within our product
  • Work across backend, frontend, and interacting with LLMs and/or other ML models
  • Deliver experiments at a high velocity and level of quality to engage our customers
  • Work across the entire product lifecycle from conceptualization through production
  • Be able, and willing, to multi-task and learn new technologies quickly
  • Collaborating with cross-functional teams to define, design, and ship new product features and experiences.
  • Must be able to commute to the San Francisco Office 1-2x weekly. 

Ideally you’d have:

  • 5+ years of full-time engineering experience, post-graduation
  • Proficiencies in one or more of Python, Node, React, Next.js and MongoDB
  • Solid background in algorithms, data structures, and object-oriented programming.
  • Experience scaling products at hyper-growth startups
  • Excitement to work with AI technologies
  • Strong written and verbal communication skills
  • Strong problem-solving skills, and be able to work independently or as part of a team.

Nice to haves:

  • Strong knowledge of software engineering best practices.
  • Have experience with AI platforms and technologies, including generative models and LLMs.
  • Experience building ML infrastructure and AI-powered solutions.
View Full Job Description

Add your resume

80%

Upload your resume, increase your shortlisting chances by 80%

About The Company

San Francisco, California, United States (Hybrid)

San Francisco, California, United States (On-Site)

San Francisco, California, United States (On-Site)

San Francisco, California, United States (On-Site)

Mexico City, Mexico City, Mexico (On-Site)

San Francisco, California, United States (On-Site)

Washington, District Of Columbia, United States (On-Site)

San Francisco, California, United States (On-Site)

San Francisco, California, United States (On-Site)

San Francisco, California, United States (On-Site)

View All Jobs

Get notified when new jobs are added by Scale AI

Similar Jobs

Wargaming - Frontend Developer (Data Warehouse Team)

Wargaming, Czechia (Remote)

Gamezop - Software Engineer - Frontend

Gamezop, India (Remote)

Ziff Davis - Senior Full Stack Software Engineer

Ziff Davis, United States (Hybrid)

Flow - Senior/Staff Web Engineer

Flow, United States (Hybrid)

Meetelise - Junior Research Scientist

Meetelise, (On-Site)

Eleven Labs - AI Safety Engineer

Eleven Labs, United States (Remote)

Barbaricum - Senior Technical Project Manager

Barbaricum, United States (On-Site)

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

undefined - Technical Consultant, West

United States (Remote)

Token Metrics - Crypto QA Engineer (Remote)

Token Metrics, Türkiye (Remote)

Wargaming - Frontend Developer (Data Warehouse Team)

Wargaming, Czechia (Remote)

Eli Lilly and Company - Full Stack Engineer

Eli Lilly and Company, India (On-Site)

Ironhide Game - Fullstack Developer - Unity 3D

Ironhide Game, Uruguay (On-Site)

Flow - Senior/Staff Web Engineer

Flow, United States (Hybrid)

Get notifed when new similar jobs are uploaded

Jobs in San Francisco, California, United States

Hudl - Product Design Manager

Hudl, United States (Remote)

Paypal - Product Marketing Director, Credit

Paypal, United States (Hybrid)

Argus Labs - Senior  Software Engineer (Game Server)

Argus Labs, United States (On-Site)

PlayStation Global - Senior Site Reliability Engineer

PlayStation Global, United States (On-Site)

Intel Corporation - Foundry Development Reliability Intern

Intel Corporation, United States (Hybrid)

Xsolla - Human Resources Assistant

Xsolla, United States (Hybrid)

Rackspace Technology - Lead Enterprise Engagement Manager

Rackspace Technology, United States (Remote)

Inworld AI - Sales / GTM Lead - USA

Inworld AI, United States (On-Site)

Get notifed when new similar jobs are uploaded

Artificial Intelligence Jobs

Get notifed when new similar jobs are uploaded