Staff Software Engineer, ML Performance Optimization

1 Minute ago β€’ All levels β€’ Research Development β€’ $210,000 PA - $289,000 PA

Job Summary

Job Description

Zoox is building autonomous robotaxis for safe, reliable, clean, and enjoyable transportation. The ML Platform team is crucial for enabling ML and CV innovations. This role leads ML Performance Optimization, aiming to make the Training and Inference platform fast and efficient. You will work across all ML teams, including Perception, Prediction, Planner, Simulation, Collision Avoidance, and Advanced Hardware Engineering, significantly impacting ML practices at Zoox. The team builds and operates core ML tools, deep learning frameworks, and inference systems, offering significant growth opportunities.
Must have:
  • Develop and execute a strategic vision for ML Performance Optimization.
  • Lead the design, implementation, and operation of cutting-edge ML Training and inference performance optimization techniques.
  • Collaborate closely with cross-functional teams to define requirements and align on architectural decisions.
  • Enable and mentor engineers in the team.
  • Strong experience with training frameworks like PyTorch, leveraging GPUs efficiently for distributed model training.
  • Experience with GPU-accelerated inference using TensorRT, Ray Serve, or similar frameworks.
  • Experience using profiling tools like NVIDIA's Nsight or PyTorch's Profiler.
  • Proficient in Python and C++.
  • Experience with model compression techniques.
Good to have:
  • 10+ years of total experience, including 4+ years on large-scale model training or inference platforms.
  • Excellent leadership skills with a demonstrated ability to lead high-performing engineering teams.
Perks:
  • Amazon Restricted Stock Units (RSUs)
  • Zoox Stock Appreciation Rights
  • Sign-on bonus (may be offered)
  • Paid time off (sick leave, vacation, bereavement)
  • Unpaid time off
  • Health insurance
  • Long-term care insurance
  • Long-term and short-term disability insurance
  • Life insurance

Job Details

Zoox is on a mission to reimagine transportation and ground-up build autonomous robotaxis that are safe, reliable, clean, and enjoyable for everyone. We are still in the early stages of deploying our robotaxis on public roads, and it is a great time to join Zoox and have a significant impact in executing this mission. The ML Platform team at Zoox plays a crucial role in enabling innovations in ML and CV to make autonomous driving as seamless as possible.

The Opportunity

Are you excited to lead our ML Performance Optimization initiatives and make our Training and Inference platform that enables autonomous driving as fast and efficient as possible? You will get to work across all ML teams within Zoox - Perception, Prediction, Planner, Simulation, Collision Avoidance, and Advanced Hardware Engineering group and have the opportunity to significantly push the boundaries of how ML is practiced within Zoox.

We build and operate the base layer of ML tools, deep learning frameworks, and inference systems used by our applied research teams for in- and off-vehicle ML use cases. You will lead a team of strong software engineers and act as a force multiplier for our internal customers. This team has a lot of growth opportunities as we expand our robotaxi deployments and venture into new ML domains. If you want to learn more about our stack behind autonomous driving, please look here.

In this role, you will:

  • Develop and execute a strategic vision for the ML Performance Optimization team to unlock ML innovation in autonomous driving and rider experience.
  • Lead the design, implementation, and operation of cutting-edge ML Training and inference performance optimization techniques.
  • Collaborate closely with x-functional teams, including ML researchers, software engineers, data engineers, and hardware engineers, to define requirements and align on architectural decisions.
  • Enable the engineers in the team to grow their careers by providing technical guidance and mentorship.

Qualifications

  • Strong experience with training frameworks like PyTorch, leveraging GPUs efficiently for distributed model training.
  • Experience with GPU-accelerated inference using TensorRT, Ray Serve, or similar frameworks.
  • Experience using profiling tools like NVIDIA's Nsight or PyTorch's Profiler for identifying model training and serving bottlenecks.
  • Proficient in Python and C++
  • Experience with model compression techniques to reduce model size and improve performance.

Bonus Qualifications

  • 10+ years of total experience, including 4+ years of working on large-scale model training or inference platforms.
  • Excellent leadership skills with a demonstrated ability to lead high-performing engineering teams.

Base Salary Range

There are three major components to compensation for this position: salary, Amazon Restricted Stock Units (RSUs), and Zoox Stock Appreciation Rights. A sign-on bonus may be offered as part of the compensation package. The listed range applies only to the base salary. Compensation will vary based on geographic location and level. Leveling, as well as positioning within a level, is determined by a range of factors, including, but not limited to, a candidate's relevant years of experience, domain knowledge, and interview performance. The salary range listed in this posting is representative of the range of levels Zoox is considering for this position.

Zoox also offers a comprehensive package of benefits, including paid time off (e.g. sick leave, vacation, bereavement), unpaid time off, Zoox Stock Appreciation Rights, Amazon RSUs, health insurance, long-term care insurance, long-term and short-term disability insurance, and life insurance.

Accommodations

If you need an accommodation to participate in the application or interview process please reach out to accommodations@zoox.com or your assigned recruiter.

A Final Note:

You do not need to match every listed expectation to apply for this position. Here at Zoox, we know that diverse perspectives foster the innovation we need to be successful, and we are committed to building a team that encompasses a variety of backgrounds, experiences, and skills.

Similar Jobs

Looks like we're out of matches

Set up an alert and we'll send you similar jobs the moment they appear!

Similar Skill Jobs

Looks like we're out of matches

Set up an alert and we'll send you similar jobs the moment they appear!

Jobs in Foster City, California, United States

Looks like we're out of matches

Set up an alert and we'll send you similar jobs the moment they appear!

Research Development Jobs

Looks like we're out of matches

Set up an alert and we'll send you similar jobs the moment they appear!

About The Company

Zoox is transforming mobility-as-a-service by developing a fully autonomous, purpose-built fleet designed for AI to drive and humans to enjoy.

Foster City, California, United States (On-Site)

Foster City, California, United States (Hybrid)

Foster City, California, United States (On-Site)

Foster City, California, United States (Hybrid)

Foster City, California, United States (Remote)

Foster City, California, United States (On-Site)

Fremont, California, United States (On-Site)

Foster City, California, United States (Hybrid)

Foster City, California, United States (Hybrid)

View All Jobs

Get notified when new jobs are added by zoox

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug