Senior Software Engineer, HPC Platform Modernization

5 Months ago • 7 Years + • Devops • $185,000 PA - $252,000 PA

Job Summary

Job Description

Zoox is seeking an experienced Software Engineer to modernize their High-Performance Computing (HPC) infrastructure and its supporting ecosystem. This role involves developing key frameworks and services for Autonomous Vehicle development, utilizing technologies like Ray.io and SLURM. The engineer will be responsible for distributed system design, algorithmic job scheduling, and cloud scaling. The position offers a high degree of independence and the opportunity to shape the company's compute scaling strategy, working with autonomy and software teams to enhance developer experiences.
Must have:
  • 7+ years of experience
  • Experience with Ray.io
  • Experience with Kubernetes
  • Experience with Ray.io/Kubernetes on AWS/Azure/GCP
  • Proficiency in Python
Good to have:
  • Exposure to ML workloads
  • Experience with Kubernetes/SLURM at scale (>10k nodes)
  • Experience with SLURM
Perks:
  • Paid time off
  • Zoox Stock Appreciation Rights
  • Amazon RSUs
  • Health insurance
  • Long-term care insurance
  • Disability insurance
  • Life insurance

Job Details

Zoox is looking for an experienced Software Engineer to work on key new frameworks and infrastructure modernization for our custom High-Performance Computing infrastructure and its supporting ecosystem of tools and services. Zoox HPC services combine industry-best scheduling and workload orchestration technologies, such as Ray.io and SLURM, with value-add workflows specifically for Autonomous Vehicle development. These HPC services form the backbone of development workflows across all Zoox software teams, from data engineering to training our AI models in Perception, Planner, Prediction, to simulation, and more. You will take on a breadth of end-to-end responsibilities including distributed system design, algorithmic job scheduling, and adaptive cloud scaling in support of all of Zoox’s computational needs.

The position comes with a high degree of independence and the opportunity to help define Zoox’s compute scaling strategy, both technically and organizationally. You will work closely with stakeholders in Autonomy and Software teams to iterate on world-class developer experiences, incorporating the latest industry tools and best practices.

In this role, you will:

  • Evaluate new distributed system paradigms and technologies to meet Zoox’s ever-growing computational and storage needs
  • Strike a balance between incremental improvements to Zoox’s existing in-house HPC infrastructure and greenfield services and abstractions.
  • Create production-grade web service APIs, SDKs, and other tools to provide a world-class developer experience for all of Zoox’s software teams.

Qualifications

  • 7+ years of experience
  • Experience with Ray.io, particularly Ray Core and Ray Data
  • Experience with Kubernetes, particularly for heterogeneous workloads and clusters
  • Experience with Ray.io and Kubernetes deployed on Amazon Web Services (AWS) or other similar cloud providers such as Azure or GCP
  • Proficiency with Python

Bonus Qualifications

  • Exposure to machine learning workloads (training, inference, data generation, etc) from a compute infra service provider perspective
  • Experience with Kubernetes or SLURM at scale (>10k+ nodes)
  • Experience with SLURM workload manager

$185,000 - $252,000 a year
Base Salary Range

There are three major components to compensation for this position: salary, Amazon Restricted Stock Units (RSUs), and Zoox Stock Appreciation Rights. A sign-on bonus may be offered as part of the compensation package. The listed range applies only to the base salary. Compensation will vary based on geographic location and level. Leveling, as well as positioning within a level, is determined by a range of factors, including, but not limited to, a candidate's relevant years of experience, domain knowledge, and interview performance. The salary range listed in this posting is representative of the range of levels Zoox is considering for this position.

Zoox also offers a comprehensive package of benefits, including paid time off (e.g. sick leave, vacation, bereavement), unpaid time off, Zoox Stock Appreciation Rights, Amazon RSUs, health insurance, long-term care insurance, long-term and short-term disability insurance, and life insurance.

About Zoox
Zoox is developing the first ground-up, fully autonomous vehicle fleet and the supporting ecosystem required to bring this technology to market. Sitting at the intersection of robotics, machine learning, and design, Zoox aims to provide the next generation of mobility-as-a-service in urban environments. We’re looking for top talent that shares our passion and wants to be part of a fast-moving and highly execution-oriented team.

Follow us on LinkedIn

Accommodations
If you need an accommodation to participate in the application or interview process please reach out to accommodations@zoox.com or your assigned recruiter.

A Final Note:
You do not need to match every listed expectation to apply for this position. Here at Zoox, we know that diverse perspectives foster the innovation we need to be successful, and we are committed to building a team that encompasses a variety of backgrounds, experiences, and skills.

Similar Jobs

endava - Senior Java Automation Tester

endava

Pitești, Argeș, Romania (On-Site)
1 Month ago
Epic Games - Senior Web Engineer

Epic Games

Cary, North Carolina, United States (On-Site)
4 Months ago
Miratech - Senior Automation Testing Engineer (Python)

Miratech

Bengaluru, Karnataka, India (On-Site)
1 Month ago
Hedra - Senior Backend Engineer

Hedra

New York, New York, United States (On-Site)
3 Months ago
Eneba Games - Data Engineer

Eneba Games

Lithuania (Remote)
5 Months ago
Ion - Software Architect - Java Multi-Tenant SAAS Cloud Native

Ion

Pune, Maharashtra, India (On-Site)
8 Months ago
Cadence - Lead Solutions Engineer

Cadence

Hsinchu, Hsinchu City, Taiwan (On-Site)
1 Week ago
FICO - Cloud Lead Engineer

FICO

Bengaluru, Karnataka, India (On-Site)
1 Week ago
Sonar Source - Solutions Engineer LATAM - English & Portuguese Speaking

Sonar Source

Austin, Texas, United States (On-Site)
1 Week ago
Zones - Client Solutions Architect

Zones

Baltimore, Maryland, United States (Remote)
3 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

YouGov - Data Analytics and Insights Consultant

YouGov

Milan, Lombardy, Italy (Hybrid)
1 Month ago
Roof Stacks - Senior Platform Engineer

Roof Stacks

Istanbul, İstanbul, Türkiye (On-Site)
4 Months ago
Assystems - Senior Software Engineer

Assystems

Gurugram, Haryana, India (On-Site)
7 Months ago
playrix  - Principal Golang Engineer (Cross-Game Server)

playrix

Ireland (Remote)
2 Months ago
smarsh - Cloud Engineer III-Observability

smarsh

India (Hybrid)
5 Months ago
Devoteam - Cloud Data Engineer SQL GCP

Devoteam

Beni-Mellal, Béni Mellal-Khenifra, Morocco (On-Site)
5 Months ago
version 1 - Databricks Engineer

version 1

Bengaluru, Karnataka, India (On-Site)
1 Month ago
Barracuda - Manager, Technical Support

Barracuda

Bengaluru, Karnataka, India (On-Site)
3 Months ago
Lulalend - Senior Credit Risk Analyst

Lulalend

Cape Town, Western Cape, South Africa (On-Site)
2 Weeks ago
Hashlist - Senior Data Engineer

Hashlist

Pune, Maharashtra, India (Hybrid)
7 Months ago

Get notifed when new similar jobs are uploaded

Jobs in Foster City, California, United States

Epic Games - Senior Data Analyst, Unreal Engine & Creator Products

Epic Games

Cary, North Carolina, United States (On-Site)
5 Months ago
Toast - Retail Account Executive

Toast

Green Bay, Wisconsin, United States (On-Site)
1 Week ago
Greenworks Sunrise Global Marketing - Material Handler

Greenworks Sunrise Global Marketing

Morganton, North Carolina, United States (On-Site)
3 Weeks ago
OKX - Director, Human Resources Business Partner - P&E

OKX

San Jose, California, United States (On-Site)
1 Month ago
Nordson Corporation - Principal Field Service Technician

Nordson Corporation

Chandler, Arizona, United States (On-Site)
3 Weeks ago
Evolution  - Online Game Presenter (Receptionist Alternative) No Experience Necessary, $20-$25 hr

Evolution

Atlantic City, New Jersey, United States (On-Site)
8 Months ago
Monstera Games - Cloud Engineer

Monstera Games

Boston, Massachusetts, United States (On-Site)
3 Days ago
Gearbox - Visual Effects Artist

Gearbox

Frisco, Texas, United States (On-Site)
1 Year ago
studio Frog  - Data Scientist

studio Frog

Seattle, Washington, United States (On-Site)
2 Days ago
hogarth - Senior Financial Analyst - Rev Ops

hogarth

New York, United States (Hybrid)
2 Weeks ago

Get notifed when new similar jobs are uploaded

Devops Jobs

Palo Alto Networks - Marketplace Operations Manager (Cloud Service Providers)

Palo Alto Networks

Munich, Bavaria, Germany (On-Site)
1 Month ago
Zscaler - Senior DevOps Engineer

Zscaler

Ramat Gan, Tel Aviv District, Israel (Hybrid)
2 Weeks ago
London stock Exchange - Developer Platform Engineer

London stock Exchange

London, England, United Kingdom (On-Site)
1 Month ago
Google - Software Engineer III, Google Cloud Platforms

Google

San Francisco, California, United States (On-Site)
1 Month ago
Synechron - DevOps Engineer (Cloud & Automation Expert)

Synechron

Pune, Maharashtra, India (On-Site)
2 Weeks ago
NVIDIA - Senior Solutions Architect, Omniverse Platform

NVIDIA

Beijing, Beijing, China (On-Site)
3 Months ago
Reddit - Senior Site Reliability Engineer

Reddit

Amsterdam, North Holland, Netherlands (On-Site)
1 Month ago
Epic Games - Automation Engineer

Epic Games

Cary, North Carolina, United States (On-Site)
3 Months ago
bytedance - Backend Software Engineer - Global E-Commerce Supply Chain Merchant Platform

bytedance

Seattle, Washington, United States (On-Site)
7 Months ago
Nice - Solution Engineer

Nice

United States (Remote)
1 Week ago

Get notifed when new similar jobs are uploaded

About The Company

Zoox is transforming mobility-as-a-service by developing a fully autonomous, purpose-built fleet designed for AI to drive and humans to enjoy.

Foster City, California, United States (Hybrid)

Foster City, California, United States (On-Site)

Foster City, California, United States (On-Site)

Foster City, California, United States (Hybrid)

Foster City, California, United States (Hybrid)

Foster City, California, United States (Hybrid)

Foster City, California, United States (Hybrid)

Foster City, California, United States (Hybrid)

Foster City, California, United States (On-Site)

View All Jobs

Get notified when new jobs are added by zoox