Senior Staff Software Engineer, On-Device Machine Learning

1 Hour ago • 8-12 Years • Artificial Intelligence • $248,000 PA - $349,000 PA

Job Summary

Job Description

This Senior Staff Software Engineer role at Google focuses on on-device machine learning infrastructure. Responsibilities include developing and guiding critical projects in areas like LiteRT, MediaPipe, and GenAI Inference, enabling on-device deployment of models like Gemini Nano and Gemma across various accelerators. The role requires optimizing on-device model inference and improving performance through model representation, runtime, and kernel implementation. The ideal candidate will have extensive experience in software development, ML design, and leading technical projects, along with expertise in ML frameworks (PyTorch, TensorFlow, etc.) and on-device ML SDKs/tooling. Experience with Android, iOS, and web browser deployments is also crucial.
Must have:
  • 8+ years software development experience (Python, C, C++)
  • 7+ years leading ML projects, design, and infrastructure experience
  • 5+ years testing and launching software products
  • Experience with ML frameworks (PyTorch, TensorFlow)
  • Performance analysis and optimization expertise
Good to have:
  • Master's/PhD in related field
  • 5+ years technical leadership experience
  • Experience with on-device ML SDKs (TensorFlow Lite, Core ML)
  • Knowledge of ML converters/compilers and runtimes
  • Understanding of Generative AI model architectures

Job Details

Minimum qualifications:

  • Bachelor’s degree or equivalent practical experience.
  • 8 years of experience with software development in one or more programming languages (e.g., Python, C, C++).
  • 7 years of experience leading technical project strategy, ML design, and working with industry-scale ML infrastructure (e.g., model deployment, model evaluation, data processing, debugging, fine tuning).
  • 5 years of experience testing, and launching software products.
  • Experience in performance analysis and optimization including GPU programming, mobile GPU, system architecture, performance modeling, benchmarking, machine learning infrastructure, or other similar experience.
  • Experience with ML frameworks (e.g., PyTorch, JAX, TensorFlow).

Preferred qualifications:

  • Master’s degree or PhD in Engineering, Computer Science, or a related technical field.
  • 5 years of experience in a technical leadership role leading project teams, setting technical direction, excellent communication, collaboration, working in a complex, matrixed organization.
  • Experience with on-device ML SDKs/tooling (e.g., TensorFlow Lite, ExecuTorch, Core ML, SNPE/QNN).
  • In-depth knowledge of ML converters/compilers and runtimes, and hardware-accelerated ML inference techniques.
  • Understanding of Generative AI model architectures and their optimization for on-device execution.
  • Proven track record of leading and delivering successful ML projects focused on on-device deployment (Android, iOS, web browsers, or embedded devices).

About the job

Google's software engineers develop the next-generation technologies that change how billions of users connect, explore, and interact with information and one another. Our products need to handle information at massive scale, and extend well beyond web search. We're looking for engineers who bring fresh ideas from all areas, including information retrieval, distributed computing, large-scale system design, networking and data storage, security, artificial intelligence, natural language processing, UI design and mobile; the list goes on and is growing every day. As a software engineer, you will work on a specific project critical to Google’s needs with opportunities to switch teams and projects as you and our fast-paced business grow and evolve. We need our engineers to be versatile, display leadership qualities and be enthusiastic to take on new problems across the full-stack as we continue to push technology forward.

Google AI Edge enables developers and Google products to deploy AI across mobile, web, and embedded with our AI edge stack - from low-code APIs to hardware specific acceleration libraries to achieve leading performance and device optionality at scale. Our team focuses on cross-platform infrastructure and solutions aligned with Google's biz needs: serving our Google Products across Android, iOS and Web, third-party developers, and building specialized solutions for Pixel.

The ML, Systems, & Cloud AI (MSCA) organization at Google designs, implements, and manages the hardware, software, machine learning, and systems infrastructure for all Google services (Search, YouTube, etc.) and Google Cloud. Our end users are Googlers, Cloud customers and the billions of people who use Google services around the world.

We prioritize security, efficiency, and reliability across everything we do - from developing our latest TPUs to running a global network, while driving towards shaping the future of hyperscale computing. Our global impact spans software and hardware, including Google Cloud’s Vertex AI, the leading AI platform for bringing Gemini models to enterprise customers.

The US base salary range for this full-time position is $248,000-$349,000 + bonus + equity + benefits. Our salary ranges are determined by role, level, and location. Within the range, individual pay is determined by work location and additional factors, including job-related skills, experience, and relevant education or training. Your recruiter can share more about the specific salary range for your preferred location during the hiring process.

Please note that the compensation details listed in US role postings reflect the base salary only, and do not include bonus, equity, or benefits. Learn more about .

Responsibilities

  • Develop and guide critical projects in Google's on-device ML infrastructure (e.g., LiteRT, MediaPipe, GenAI Inference).
  • Enable on-device deployment of key models, such as Gemini Nano and Gemma, across various accelerators (GPU/Pixel TPU/NPUs/CPU) on Android, Chrome, and more.
  • Improve performance of on-device model inference via optimizations in the model representation, on-device runtime, and kernel implementation.

Similar Jobs

NVIDIA - Senior Solution Engineer, Mission Control

NVIDIA

Santa Clara, California, United States (On-Site)
3 Weeks ago
Resemble AI - Deep Learning Speech Researcher

Resemble AI

Mountain View, California, United States (On-Site)
8 Months ago
Meta - Software Engineer, Machine Learning

Meta

Burlingame, California, United States (On-Site)
5 Months ago
Google - Customer Engineer, AI Infrastructure

Google

Seattle, Washington, United States (On-Site)
1 Hour ago
El Ranchito - Machine Learning Artist Intern

El Ranchito

Madrid, Community Of Madrid, Spain (On-Site)
1 Week ago
NVIDIA - Machine Learning Software Platform Architect

NVIDIA

Canada (On-Site)
1 Month ago
Tencent - Machine Learning Development Intern

Tencent

Auckland, Auckland, New Zealand (On-Site)
1 Month ago
Hudl - Staff Engineer - Machine Learning

Hudl

London, England, United Kingdom (Hybrid)
4 Months ago
NVIDIA - Deep Learning Performance Architect

NVIDIA

Bengaluru, Karnataka, India (Hybrid)
2 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Match Group - Senior ML Software Engineering Team Leader

Match Group

Seoul, South Korea (Hybrid)
6 Months ago
Evolution - Data Scientist

Evolution

Warsaw, Masovian Voivodeship, Poland (Hybrid)
8 Months ago
Ubisoft - Lead R&D Scientist

Ubisoft

Shanghai, Shanghai, China (On-Site)
3 Weeks ago
NVIDIA - Senior Math Libraries Engineers - Python APIs

NVIDIA

Remote, Oregon, United States (Remote)
1 Month ago
Meta - Research Scientist Intern, Machine Perception for Input and Interaction (PhD)

Meta

Burlingame, California, United States (On-Site)
5 Months ago
Meta - Software Engineer, Machine Learning

Meta

Menlo Park, California, United States (On-Site)
5 Months ago
NVIDIA - Senior System Software Engineer - Triton Inference Server

NVIDIA

Washington, United States (Remote)
1 Month ago
ByteDance - Senior Machine Learning Engineer

ByteDance

San Jose, California, United States (On-Site)
2 Months ago
ByteDance - Video Analysis and Quality Algorithm Engineer

ByteDance

Seattle, Washington, United States (On-Site)
2 Days ago
ByteDance - Video Analysis and Quality Algorithm Intern 2023 Summer/Fall (PHD)

ByteDance

San Diego, California, United States (On-Site)
5 Months ago

Get notifed when new similar jobs are uploaded

Jobs in Sunnyvale, California, United States

NVIDIA - Senior Systems Software Engineer, Data Center - CUDA

NVIDIA

Santa Clara, California, United States (On-Site)
2 Months ago
Pika - Research Scientist

Pika

Palo Alto, California, United States (On-Site)
4 Months ago
Nintendo - Engineer (NTD)

Nintendo

Redmond, Washington, United States (On-Site)
8 Months ago
Meta - Research Scientist, Machine Learning (PhD)

Meta

Pittsburgh, Pennsylvania, United States (On-Site)
5 Months ago
Onward Search - Editorial Coordinator

Onward Search

Santa Monica, California, United States (Hybrid)
2 Days ago
Next Level Business Services - oracle adf developer

Next Level Business Services

Miami, Florida, United States (On-Site)
6 Months ago
Capcom - Operations & Facilities Manager

Capcom

California, United States (On-Site)
1 Month ago
The Walt Disney Company - Content Programming Manager (Live News & Sports)

The Walt Disney Company

Glendale, California, United States (Hybrid)
2 Months ago
Microsoft - Support Escalation Manager - Service Manager

Microsoft

Texas, United States (Hybrid)
2 Hours ago
Google - Senior Software Engineer, Machine Learning, Google Ads

Google

New York, New York, United States (On-Site)
1 Hour ago

Get notifed when new similar jobs are uploaded

Artificial Intelligence Jobs

ByteDance - Research Scientist, AI Foundation

ByteDance

San Jose, California, United States (On-Site)
3 Weeks ago
ByteDance - Software Engineer Intern (Doubao (Seed) - Machine Learning System) - 2025 Summer (PhD)

ByteDance

San Jose, California, United States (On-Site)
5 Months ago
ByteDance - Student Researcher (Doubao (Seed) - Foundation Model - Video Generative Model)

ByteDance

San Jose, California, United States (On-Site)
3 Weeks ago
NVIDIA - Senior Deep Learning Engineer

NVIDIA

Redmond, Washington, United States (On-Site)
2 Weeks ago
ByteDance - Student Researcher (Doubao (Seed) - Foundation Model - Vision Generative AI)

ByteDance

San Jose, California, United States (On-Site)
3 Weeks ago
Saama Technologies,  Inc  - NLP Engineer

Saama Technologies, Inc

(Remote)
1 Month ago
Outlier - Software Engineer (Python)

Outlier

Faridabad, Haryana, India (Hybrid)
6 Months ago
Microsoft - Platform Engineering Manager

Microsoft

Redmond, Washington, United States (Hybrid)
3 Weeks ago
Google - Software Engineer III, AI/ML, Google Cloud AI

Google

Sunnyvale, California, United States (On-Site)
1 Hour ago
NVIDIA - Deep Learning Performance Architect

NVIDIA

Hyderabad, Telangana, India (Hybrid)
1 Month ago

Get notifed when new similar jobs are uploaded

About The Company

A problem isn't truly solved until it's solved for all. Googlers build products that help create opportunities for everyone, whether down the street or across the globe. Bring your insight, imagination and a healthy disregard for the impossible. Bring everything that makes you unique. Together, we can build for everyone.

Portland, Oregon, United States (On-Site)

Mountain View, California, United States (On-Site)

Mountain View, California, United States (On-Site)

Taipei City, Taiwan (On-Site)

Atlanta, Georgia, United States (On-Site)

Bengaluru, Karnataka, India (On-Site)

View All Jobs

Get notified when new jobs are added by Google

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug