Site Reliability Engineer

1 Week ago • 6 Years + • Devops • $165,000 PA - $222,000 PA

Job Summary

Job Description

Zoox is seeking a platform/site reliability engineer to ensure the uptime of services crucial to autonomous vehicle development. This role involves designing, deploying, operating, and continuously improving fault-tolerant systems. The engineer will work with large data volumes and compute-intensive tasks on CPUs and GPUs. Key responsibilities include designing scalable and reliable systems, optimizing performance, developing monitoring and alerting, collaborating with software engineers for deployment automation, conducting root cause analysis, and implementing disaster recovery plans. Experience with cloud platforms (AWS, GCP, Azure), Kubernetes, networking, storage, databases, and programming languages like Python, Go, C/C++, or Java is required.
Must have:
  • 6+ years of site reliability engineering experience
  • Experience with large-scale distributed systems
  • Proven experience with cloud platforms (AWS, GCP, Azure)
  • Expertise in Kubernetes
  • Deep understanding of networking, storage, databases
  • Strong programming skills (Python, Go, C/C++, Java)
  • Experience with infrastructure as code tools (Ansible, Salt, Terraform, CloudFormation)
Good to have:
  • Experience in automotive/autonomous vehicle industry
  • Knowledge of security best practices
  • Previous leadership/mentorship experience
Perks:
  • Paid time off
  • Zoox Stock Appreciation Rights
  • Amazon Restricted Stock Units (RSUs)
  • Health insurance
  • Long-term care insurance
  • Disability insurance
  • Life insurance
  • Sign-on bonus

Job Details

Zoox is looking for a platform/site reliability engineer who will be responsible for measuring and maintaining the uptime of the many services critical to the development process for autonomous vehicles. In this role, you will be heavily involved in all phases of rolling out a service from designing systems that are easy to maintain and fault-tolerant through deployment, operation, and continual improvement. Zoox is a robotics company and our ethos of automation extends throughout the infrastructure components we build. Be prepared to work with systems handling large volumes of data and data-processing pipelines performing compute-intensive tasks on CPUs and GPUs.


In this role, you will:
  • Design and implement highly scalable and reliable systems to support Zoox's autonomous vehicle platform.
  • Optimize system performance, reliability, and scalability.
  • Develop and maintain monitoring, alerting, and reporting systems to ensure proactive identification and resolution of issues.
  • Collaborate with software engineering teams to improve deployment processes and automation.
  • Conduct root cause analysis of production issues and implement corrective actions.
  • Implement disaster recovery and business continuity plans.


Qualifications
  • 6+ years of experience in site reliability engineering or a similar role, with a strong background in working with large-scale distributed systems.
  • Proven experience with cloud platforms such as AWS, GCP, or Azure.
  • Expertise in container orchestration technologies like Kubernetes.
  • Deep understanding of networking, storage, and database technologies.
  • Strong programming skills in languages such as Python, Go, C/C++ or Java.
  • Experience with infrastructure as code tools such as Ansible, Salt, Terraform or CloudFormation.


Preferred Qualifications
  • Experience in the automotive or autonomous vehicle industry.
  • Knowledge of security best practices and compliance requirements.
  • Previous experience in a leadership or mentorship role.


Compensation

There are three major components to compensation for this position: salary, Amazon Restricted Stock Units (RSUs), and Zoox Stock Appreciation Rights. The salary range for this position is $165,000 to $222,000. A sign-on bonus may be offered as part of the compensation package. Compensation will vary based on geographic location and level. Leveling, as well as positioning within a level, is determined by a range of factors, including, but not limited to, a candidate's relevant years of experience, domain knowledge, and interview performance. The salary range listed in this posting is representative of the range of levels Zoox is considering for this position.

 

Zoox also offers a comprehensive package of benefits including paid time off (e.g. sick leave, vacation, bereavement), unpaid time off, Zoox Stock Appreciation Rights, Amazon RSUs, health insurance, long-term care insurance, long-term and short-term disability insurance, and life insurance.


About Zoox

Zoox is developing the first ground-up, fully autonomous vehicle fleet and the supporting ecosystem required to bring this technology to market. Sitting at the intersection of robotics, machine learning, and design, Zoox aims to provide the next generation of mobility-as-a-service in urban environments. We’re looking for top talent that shares our passion and wants to be part of a fast-moving and highly execution-oriented team.


Follow us on LinkedIn


Accommodations

If you need an accommodation to participate in the application or interview process please reach out to accommodations@zoox.com or your assigned recruiter.


A Final Note:

You do not need to match every listed expectation to apply for this position. Here at Zoox, we know that diverse perspectives foster the innovation we need to be successful, and we are committed to building a team that encompasses a variety of backgrounds, experiences, and skills.

Similar Jobs

Looks like we're out of matches

Set up an alert and we'll send you similar jobs the moment they appear!

Similar Skill Jobs

Looks like we're out of matches

Set up an alert and we'll send you similar jobs the moment they appear!

Jobs in Foster City, California, United States

Looks like we're out of matches

Set up an alert and we'll send you similar jobs the moment they appear!

Devops Jobs

Looks like we're out of matches

Set up an alert and we'll send you similar jobs the moment they appear!

About The Company

Zoox is transforming mobility-as-a-service by developing a fully autonomous, purpose-built fleet designed for AI to drive and humans to enjoy.

Foster City, California, United States (Hybrid)

Foster City, California, United States (Hybrid)

Foster City, California, United States (On-Site)

Seattle, Washington, United States (On-Site)

Foster City, California, United States (On-Site)

Foster City, California, United States (On-Site)

Foster City, California, United States (On-Site)

Foster City, California, United States (On-Site)

Foster City, California, United States (On-Site)

Foster City, California, United States (Hybrid)

View All Jobs

Get notified when new jobs are added by zoox

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug