Site Reliability Engineer

2 Months ago • 6 Years + • Devops • $165,000 PA - $222,000 PA

Job Summary

Job Description

Zoox is seeking a platform/site reliability engineer to ensure the uptime of services crucial to autonomous vehicle development. This role involves designing, deploying, operating, and continuously improving fault-tolerant systems. The engineer will work with large data volumes and compute-intensive tasks on CPUs and GPUs. Key responsibilities include designing scalable and reliable systems, optimizing performance, developing monitoring and alerting, collaborating with software engineers for deployment automation, conducting root cause analysis, and implementing disaster recovery plans. Experience with cloud platforms (AWS, GCP, Azure), Kubernetes, networking, storage, databases, and programming languages like Python, Go, C/C++, or Java is required.
Must have:
  • 6+ years of site reliability engineering experience
  • Experience with large-scale distributed systems
  • Proven experience with cloud platforms (AWS, GCP, Azure)
  • Expertise in Kubernetes
  • Deep understanding of networking, storage, databases
  • Strong programming skills (Python, Go, C/C++, Java)
  • Experience with infrastructure as code tools (Ansible, Salt, Terraform, CloudFormation)
Good to have:
  • Experience in automotive/autonomous vehicle industry
  • Knowledge of security best practices
  • Previous leadership/mentorship experience
Perks:
  • Paid time off
  • Zoox Stock Appreciation Rights
  • Amazon Restricted Stock Units (RSUs)
  • Health insurance
  • Long-term care insurance
  • Disability insurance
  • Life insurance
  • Sign-on bonus

Job Details

Zoox is looking for a platform/site reliability engineer who will be responsible for measuring and maintaining the uptime of the many services critical to the development process for autonomous vehicles. In this role, you will be heavily involved in all phases of rolling out a service from designing systems that are easy to maintain and fault-tolerant through deployment, operation, and continual improvement. Zoox is a robotics company and our ethos of automation extends throughout the infrastructure components we build. Be prepared to work with systems handling large volumes of data and data-processing pipelines performing compute-intensive tasks on CPUs and GPUs.


In this role, you will:
  • Design and implement highly scalable and reliable systems to support Zoox's autonomous vehicle platform.
  • Optimize system performance, reliability, and scalability.
  • Develop and maintain monitoring, alerting, and reporting systems to ensure proactive identification and resolution of issues.
  • Collaborate with software engineering teams to improve deployment processes and automation.
  • Conduct root cause analysis of production issues and implement corrective actions.
  • Implement disaster recovery and business continuity plans.


Qualifications
  • 6+ years of experience in site reliability engineering or a similar role, with a strong background in working with large-scale distributed systems.
  • Proven experience with cloud platforms such as AWS, GCP, or Azure.
  • Expertise in container orchestration technologies like Kubernetes.
  • Deep understanding of networking, storage, and database technologies.
  • Strong programming skills in languages such as Python, Go, C/C++ or Java.
  • Experience with infrastructure as code tools such as Ansible, Salt, Terraform or CloudFormation.


Preferred Qualifications
  • Experience in the automotive or autonomous vehicle industry.
  • Knowledge of security best practices and compliance requirements.
  • Previous experience in a leadership or mentorship role.


Compensation

There are three major components to compensation for this position: salary, Amazon Restricted Stock Units (RSUs), and Zoox Stock Appreciation Rights. The salary range for this position is $165,000 to $222,000. A sign-on bonus may be offered as part of the compensation package. Compensation will vary based on geographic location and level. Leveling, as well as positioning within a level, is determined by a range of factors, including, but not limited to, a candidate's relevant years of experience, domain knowledge, and interview performance. The salary range listed in this posting is representative of the range of levels Zoox is considering for this position.

 

Zoox also offers a comprehensive package of benefits including paid time off (e.g. sick leave, vacation, bereavement), unpaid time off, Zoox Stock Appreciation Rights, Amazon RSUs, health insurance, long-term care insurance, long-term and short-term disability insurance, and life insurance.


About Zoox

Zoox is developing the first ground-up, fully autonomous vehicle fleet and the supporting ecosystem required to bring this technology to market. Sitting at the intersection of robotics, machine learning, and design, Zoox aims to provide the next generation of mobility-as-a-service in urban environments. We’re looking for top talent that shares our passion and wants to be part of a fast-moving and highly execution-oriented team.


Follow us on LinkedIn


Accommodations

If you need an accommodation to participate in the application or interview process please reach out to accommodations@zoox.com or your assigned recruiter.


A Final Note:

You do not need to match every listed expectation to apply for this position. Here at Zoox, we know that diverse perspectives foster the innovation we need to be successful, and we are committed to building a team that encompasses a variety of backgrounds, experiences, and skills.

Similar Jobs

SciPlay - Director, Software Engineering

SciPlay

Cedar Falls, Iowa, United States (Hybrid)
2 Months ago
Luxoft - Regular Android HMI Architect

Luxoft

Cairo, Cairo Governorate, Egypt (On-Site)
8 Months ago
bytedance - Machine Learning Engineer - Pico Perception

bytedance

San Jose, California, United States (On-Site)
4 Months ago
Qualcomm - Senior AI Camera Systems Engineer

Qualcomm

Hyderabad, Telangana, India (On-Site)
3 Months ago
Mapbox - Software Development Engineer II, Guidance (C++)

Mapbox

United States (Remote)
1 Month ago
Ajmera Infotech - Backend Engineer – Build fail-proof systems at global scale

Ajmera Infotech

Austin, Texas, United States (On-Site)
1 Month ago
Attio - Site Reliability Engineer

Attio

London, England, United Kingdom (Hybrid)
1 Month ago
easygo - Staff DevOps Engineer - Core Infrastructure

easygo

Melbourne, Victoria, Australia (On-Site)
1 Month ago
Workato - Senior Automation Engineer

Workato

Hyderabad, Telangana, India (On-Site)
1 Month ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Capgemini - Android Middleware/Framework

Capgemini

Bengaluru, Karnataka, India (On-Site)
3 Months ago
Rippling - Senior Forward Deployed Engineer

Rippling

San Francisco, California, United States (On-Site)
7 Months ago
Airbyte - Solutions Engineer

Airbyte

San Francisco, California, United States (On-Site)
3 Months ago
CGS Carrers - Technical Support Architect

CGS Carrers

Braga, Braga, Portugal (Hybrid)
3 Months ago
bytedance - Backend Software Engineer - CapCut - San Jose

bytedance

San Jose, California, United States (On-Site)
9 Months ago
playthree - Unity Game Developer

playthree

London, England, United Kingdom (Hybrid)
3 Months ago
Apple - RF System Integration Engineer - Cellular

Apple

Cupertino, California, United States (On-Site)
3 Months ago
bytedance - Software Engineer Graduate (Multi Cloud CDN)

bytedance

San Jose, California, United States (On-Site)
4 Months ago
Thatgamecompany - Senior DevOps Engineer (LiveOps)

Thatgamecompany

Shanghai, Shanghai, China (On-Site)
4 Months ago
bytedance - Research Scientist Graduate, (Distributed NoSQL Database Systems) - 2026 Start (PhD)

bytedance

Seattle, Washington, United States (On-Site)
1 Month ago

Get notifed when new similar jobs are uploaded

Jobs in Foster City, California, United States

Gupta Media - Account Director, Paid Media

Gupta Media

Boston, Massachusetts, United States (On-Site)
4 Months ago
world relief - Asylum Seeker Case Worker

world relief

Chicago, Illinois, United States (On-Site)
1 Month ago
Apple - Apple Watch System Validation - Power Engineer

Apple

Cupertino, California, United States (On-Site)
3 Months ago
Yodlee - Senior Copywriter

Yodlee

Franklin, Tennessee, United States (On-Site)
2 Months ago
Roblox - Director of Prepaid Global Operations

Roblox

San Mateo, California, United States (On-Site)
2 Months ago
HCL Tech - Senior Technical Lead - Spring Boot

HCL Tech

Colorado, United States (On-Site)
2 Months ago
Open Systems Technologies - Entry Level Manager

Open Systems Technologies

Nashville, Tennessee, United States (On-Site)
1 Month ago
eBay - Software Engineer 3

eBay

San Jose, California, United States (Hybrid)
4 Weeks ago
Whatnot - Technical Program Manager, Infrastructure

Whatnot

San Francisco, California, United States (On-Site)
3 Months ago
nissan - Regional Sales Operations Analyst 1

nissan

Atlanta, Georgia, United States (On-Site)
1 Month ago

Get notifed when new similar jobs are uploaded

Devops Jobs

Apple - Senior ML Infrastructure Engineer

Apple

Cupertino, California, United States (On-Site)
2 Months ago
VGW - Senior DevOps Engineer

VGW

Kraków, Lesser Poland Voivodeship, Poland (On-Site)
1 Month ago
Brillio - Full Stack/Architect (Python, React, Strapi, AWS, Terraform)

Brillio

New York, United States (Remote)
1 Month ago
Palo Alto Networks - Sr Principal FinOps/DevOps Engineer (Cortex)

Palo Alto Networks

Santa Clara, California, United States (On-Site)
2 Months ago
Apple - Site Reliability Engineer

Apple

Cupertino, California, United States (On-Site)
1 Month ago
Ajmera Infotech - Kubernetes Engineer

Ajmera Infotech

Hyderabad, Telangana, India (On-Site)
1 Month ago
Prophecy - Delivery Solution Architect

Prophecy

(Remote)
3 Months ago
Abridge - Senior Software Engineer, SRE

Abridge

San Francisco, California, United States (Hybrid)
3 Months ago
NVIDIA - Solutions Architect

NVIDIA

Taipei City, Taiwan (On-Site)
7 Months ago
Ramp - Software Engineer | Infrastructure

Ramp

New York, United States (Hybrid)
1 Month ago

Get notifed when new similar jobs are uploaded

About The Company

Zoox is transforming mobility-as-a-service by developing a fully autonomous, purpose-built fleet designed for AI to drive and humans to enjoy.

Foster City, California, United States (On-Site)

Foster City, California, United States (On-Site)

Foster City, California, United States (On-Site)

Foster City, California, United States (Hybrid)

Foster City, California, United States (On-Site)

Foster City, California, United States (Hybrid)

Foster City, California, United States (Hybrid)

Foster City, California, United States (Hybrid)

Foster City, California, United States (Hybrid)

Foster City, California, United States (Hybrid)

View All Jobs

Get notified when new jobs are added by zoox

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug