AI Infrastructure Engineer, ML Data Platform

2 Months ago • 2 Years + • Data Analysis • $188,000 PA - $225,600 PA

Job Summary

Job Description

As a Data Infrastructure Engineer on the AI Infrastructure team, you will design, build, and scale the data platform that powers all R&D and applied ML initiatives at Scale. You will collaborate closely with product engineering, platform engineering, and ML researchers to build robust and easy-to-use APIs and data pipelines. Your work will play a critical role in advancing frontier ML research, accelerating the data sales cycle, and improving data quality - all while optimizing infrastructure costs. You will design, implement, and maintain scalable data platforms to support diverse R&D and applied ML workloads and participate in the team’s on-call process.
Must have:
  • 2+ years of experience in building large-scale data systems.
  • Expertise in modern data platform technologies.
  • Experience with containerization and deployment technologies.
  • Strong problem solving skills in a dynamic environment.
Good to have:
  • Familiarity with ML development tools.
  • Experience with various storage systems.
  • Exposure to orchestration platforms.
  • Experience supporting post-training workflows.
  • Experience in a fast-moving startup environment.

Job Details

Scale’s AI Infrastructure team supports both R&D and applied Generative AI initiatives, driving breakthroughs in areas of post-training research such as AI safety, agents, and evaluating state-of-the-art model performance.

As a Data Infrastructure Engineer on the AI Infrastructure team, you will design, build, and scale the data platform that powers all R&D and applied ML initiatives at Scale. Collaborating closely with product engineering, platform engineering, and ML researchers, you will build robust and easy-to-use APIs and data pipelines. Your work will play a critical role in advancing frontier ML research, accelerating the data sales cycle, and improving data quality - all while optimizing infrastructure costs.

You will:

  • Design, implement, and maintain scalable data platforms to support diverse R&D and applied ML workloads.
  • Partner with ML researchers, product engineers, and operations teams to align data infrastructure with organizational goals.
  • Collaborate with ML researchers to build data access tools that help advance the state of frontier post-training research.
  • Participate in our team’s on call process to ensure the availability of our services.
  • Own projects end-to-end, from requirements, scoping, design, to implementation, in a highly collaborative and cross-functional environment.

Ideally you'd have:

  • 2+ years of experience in building and operating large-scale distributed data systems that support ML workloads.
  • Expertise in modern data platform technologies.
  • Experience working with standard containerization & deployment technologies like Kubernetes, Helm, Terraform, Docker, etc.
  • Strong problem solving skills and the ability to work effectively in a fast paced, dynamic environment.

Nice to haves:

  • Familiarity with ML development tools such as PyTorch, HuggingFace, or Weights & Biases.
  • Experience with a variety of storage systems: object (S3), document (MongoDB), relational (Postgres), and distributed (Redis, Elasticsearch).
  • Exposure to orchestration platforms like Temporal, Airflow, or AWS Step Functions.
  • Experience supporting post-training workflows such as evaluation, fine-tuning, and RLHF in LLM systems.
  • Experience working in a fast-moving startup or high-scale ML infra environment.

Similar Jobs

Apple - AIML - Staff Machine Learning Engineer - Reinforcement Learning

Apple

Santa Clara, California, United States (On-Site)
2 Weeks ago
Reddit - Senior Data Science Manager, Ads Marketplace

Reddit

United States (Remote)
1 Month ago
Scopely - VP, Product Management - Star Trek Fleet Command

Scopely

Dublin, County Dublin, Ireland (On-Site)
8 Months ago
Morning Star - Talent Development Specialist

Morning Star

Mumbai, Maharashtra, India (Hybrid)
3 Weeks ago
Tekion Corp - Manager, Product Design

Tekion Corp

Bengaluru, Karnataka, India (On-Site)
3 Weeks ago
Zoe - Senior Product Data Analyst

Zoe

United Kingdom (Remote)
1 Week ago
blend - Lead Data Scientist

blend

Hyderabad, Telangana, India (On-Site)
1 Week ago
Ubisoft - Senior ML Data Scientist

Ubisoft

Montreal, Quebec, Canada (On-Site)
5 Months ago
Krafton - Data Engineer

Krafton

Seoul, South Korea (On-Site)
2 Weeks ago
Gram Games - Data Analyst

Gram Games

Istanbul, İstanbul, Türkiye (Hybrid)
2 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Apple - Quality Systems Engineer - Infrastructure

Apple

Cupertino, California, United States (On-Site)
3 Weeks ago
PlayStation Global - Creator Platform Planning Manager

PlayStation Global

Aliso Viejo, California, United States (On-Site)
3 Months ago
N-ix - Middle Data Science/AI Engineer

N-ix

Langenfeld, North Rhine-Westphalia, Germany (Hybrid)
1 Month ago
Optiv - Client Manager - Cybersecurity Sales

Optiv

Fort Worth, Texas, United States (On-Site)
3 Weeks ago
Rockstar Games - Senior DevOps Engineer

Rockstar Games

Edinburgh, Scotland, United Kingdom (On-Site)
9 Months ago
Minted - Principal UX Designer

Minted

(Remote)
2 Months ago
Thales - Head of Compensation and Benefits

Thales

Singapore (On-Site)
1 Month ago
Vertx Inc. - Tax Analyst II

Vertx Inc.

United States (Remote)
3 Weeks ago
bounteous - Associate Manager, Marketing Automation

bounteous

Chennai, Tamil Nadu, India (Hybrid)
1 Month ago
The Walt Disney Company - Brand & Content Marketing Specialist

The Walt Disney Company

Petaling Jaya, Selangor, Malaysia (On-Site)
4 Months ago

Get notifed when new similar jobs are uploaded

Jobs in San Francisco, California, United States

Ansys - Senior Product Sales Manager - Channel

Ansys

Canonsburg, Pennsylvania, United States (Remote)
1 Week ago
Rivian - Retail Operations Lead

Rivian

Irvine, California, United States (On-Site)
3 Weeks ago
Anavation - Program Manager

Anavation

Clarksburg, West Virginia, United States (On-Site)
3 Weeks ago
Apple - Data Engineer

Apple

New York, New York, United States (On-Site)
3 Weeks ago
Apple - Business Systems Analyst

Apple

Austin, Texas, United States (On-Site)
1 Month ago
Motorola solutions - Sr. Systems Engineer

Motorola solutions

Plantation, Florida, United States (Remote)
1 Month ago
Coherent corp. - Sales Specialist

Coherent corp.

Santa Clara, California, United States (Hybrid)
2 Weeks ago
Riot Games - VFX Artist II - VALORANT, Premium Content

Riot Games

United States (On-Site)
4 Months ago
Coherent corp. - Principal Silicon Photonics Packaging Engineer

Coherent corp.

Fremont, California, United States (On-Site)
2 Months ago
Apple - Wireless RF PHY FW Engineer

Apple

Sunnyvale, California, United States (On-Site)
1 Month ago

Get notifed when new similar jobs are uploaded

Data Analysis Jobs

Electronic Arts - Advanced Data Analyst, UGX

Electronic Arts

Vancouver, British Columbia, Canada (Hybrid)
3 Months ago
ness digital  - Big Data Engineer

ness digital

Timișoara, Timiș, Romania (Remote)
5 Months ago
Simcorp - Senior Financial Data Analyst

Simcorp

Noida, Uttar Pradesh, India (Hybrid)
1 Month ago
Capgemini - Data Engineer

Capgemini

Kolkata, West Bengal, India (On-Site)
1 Month ago
Ion - Data Engineer

Ion

Budapest, Hungary (On-Site)
8 Months ago
OKX - Data Analyst & Business Strategy Director

OKX

Singapore (On-Site)
1 Month ago
dun bradstreet - Senior Data Engineer

dun bradstreet

Hyderabad, Telangana, India (Hybrid)
3 Weeks ago
Illumina - Staff, SAP Quality Business Process Analyst (SAP QM)

Illumina

San Diego, California, United States (On-Site)
1 Month ago
Normalyze - Customer Success Engineer - Data Security - Implementation - DSPM - Bangalore

Normalyze

Bengaluru, Karnataka, India (Remote)
8 Months ago
version 1 - Business Systems Analyst

version 1

Dublin, County Dublin, Ireland (Hybrid)
1 Month ago

Get notifed when new similar jobs are uploaded

About The Company

New York, United States (On-Site)

San Francisco, California, United States (On-Site)

San Francisco, California, United States (On-Site)

San Francisco, California, United States (On-Site)

San Francisco, California, United States (On-Site)

San Francisco, California, United States (On-Site)

San Francisco, California, United States (On-Site)

San Francisco, California, United States (On-Site)

San Francisco, California, United States (On-Site)

View All Jobs

Get notified when new jobs are added by Scale AI

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug