Senior Staff Software Engineer, On-Device Machine Learning

1 Month ago • 8-12 Years • Artificial Intelligence • $248,000 PA - $349,000 PA

Job Summary

Job Description

This Senior Staff Software Engineer role at Google focuses on on-device machine learning infrastructure. Responsibilities include developing and guiding critical projects in areas like LiteRT, MediaPipe, and GenAI Inference, enabling on-device deployment of models like Gemini Nano and Gemma across various accelerators. The role requires optimizing on-device model inference and improving performance through model representation, runtime, and kernel implementation. The ideal candidate will have extensive experience in software development, ML design, and leading technical projects, along with expertise in ML frameworks (PyTorch, TensorFlow, etc.) and on-device ML SDKs/tooling. Experience with Android, iOS, and web browser deployments is also crucial.
Must have:
  • 8+ years software development experience (Python, C, C++)
  • 7+ years leading ML projects, design, and infrastructure experience
  • 5+ years testing and launching software products
  • Experience with ML frameworks (PyTorch, TensorFlow)
  • Performance analysis and optimization expertise
Good to have:
  • Master's/PhD in related field
  • 5+ years technical leadership experience
  • Experience with on-device ML SDKs (TensorFlow Lite, Core ML)
  • Knowledge of ML converters/compilers and runtimes
  • Understanding of Generative AI model architectures

Job Details

Minimum qualifications:

  • Bachelor’s degree or equivalent practical experience.
  • 8 years of experience with software development in one or more programming languages (e.g., Python, C, C++).
  • 7 years of experience leading technical project strategy, ML design, and working with industry-scale ML infrastructure (e.g., model deployment, model evaluation, data processing, debugging, fine tuning).
  • 5 years of experience testing, and launching software products.
  • Experience in performance analysis and optimization including GPU programming, mobile GPU, system architecture, performance modeling, benchmarking, machine learning infrastructure, or other similar experience.
  • Experience with ML frameworks (e.g., PyTorch, JAX, TensorFlow).

Preferred qualifications:

  • Master’s degree or PhD in Engineering, Computer Science, or a related technical field.
  • 5 years of experience in a technical leadership role leading project teams, setting technical direction, excellent communication, collaboration, working in a complex, matrixed organization.
  • Experience with on-device ML SDKs/tooling (e.g., TensorFlow Lite, ExecuTorch, Core ML, SNPE/QNN).
  • In-depth knowledge of ML converters/compilers and runtimes, and hardware-accelerated ML inference techniques.
  • Understanding of Generative AI model architectures and their optimization for on-device execution.
  • Proven track record of leading and delivering successful ML projects focused on on-device deployment (Android, iOS, web browsers, or embedded devices).

About the job

Google's software engineers develop the next-generation technologies that change how billions of users connect, explore, and interact with information and one another. Our products need to handle information at massive scale, and extend well beyond web search. We're looking for engineers who bring fresh ideas from all areas, including information retrieval, distributed computing, large-scale system design, networking and data storage, security, artificial intelligence, natural language processing, UI design and mobile; the list goes on and is growing every day. As a software engineer, you will work on a specific project critical to Google’s needs with opportunities to switch teams and projects as you and our fast-paced business grow and evolve. We need our engineers to be versatile, display leadership qualities and be enthusiastic to take on new problems across the full-stack as we continue to push technology forward.

Google AI Edge enables developers and Google products to deploy AI across mobile, web, and embedded with our AI edge stack - from low-code APIs to hardware specific acceleration libraries to achieve leading performance and device optionality at scale. Our team focuses on cross-platform infrastructure and solutions aligned with Google's biz needs: serving our Google Products across Android, iOS and Web, third-party developers, and building specialized solutions for Pixel.

The ML, Systems, & Cloud AI (MSCA) organization at Google designs, implements, and manages the hardware, software, machine learning, and systems infrastructure for all Google services (Search, YouTube, etc.) and Google Cloud. Our end users are Googlers, Cloud customers and the billions of people who use Google services around the world.

We prioritize security, efficiency, and reliability across everything we do - from developing our latest TPUs to running a global network, while driving towards shaping the future of hyperscale computing. Our global impact spans software and hardware, including Google Cloud’s Vertex AI, the leading AI platform for bringing Gemini models to enterprise customers.

The US base salary range for this full-time position is $248,000-$349,000 + bonus + equity + benefits. Our salary ranges are determined by role, level, and location. Within the range, individual pay is determined by work location and additional factors, including job-related skills, experience, and relevant education or training. Your recruiter can share more about the specific salary range for your preferred location during the hiring process.

Please note that the compensation details listed in US role postings reflect the base salary only, and do not include bonus, equity, or benefits. Learn more about .

Responsibilities

  • Develop and guide critical projects in Google's on-device ML infrastructure (e.g., LiteRT, MediaPipe, GenAI Inference).
  • Enable on-device deployment of key models, such as Gemini Nano and Gemma, across various accelerators (GPU/Pixel TPU/NPUs/CPU) on Android, Chrome, and more.
  • Improve performance of on-device model inference via optimizations in the model representation, on-device runtime, and kernel implementation.

Similar Jobs

Hitachi - Senior AI Data Scientist

Hitachi

Chennai, Tamil Nadu, India (On-Site)
7 Months ago
The Walt Disney Company - Principal Machine Learning Engineer

The Walt Disney Company

San Francisco, California, United States (On-Site)
1 Month ago
ByteDance - Engineering Manager - Applied Machine Learning Algorithm

ByteDance

San Jose, California, United States (On-Site)
6 Months ago
Vigaet - Internship -AI Agents

Vigaet

(Remote)
6 Months ago
Krafton  - Lead of Physical AI Agent, Research Scientist

Krafton

Seoul, South Korea (On-Site)
1 Month ago
Microsoft - Research Engineering Intern

Microsoft

North Holland, Netherlands (On-Site)
1 Month ago
Ubisoft - Senior ML Data Scientist

Ubisoft

Montreal, Quebec, Canada (On-Site)
4 Months ago
Meta - Software Engineer, Machine Learning

Meta

Menlo Park, California, United States (On-Site)
6 Months ago
NVIDIA - Applied Research Intern - 2025

NVIDIA

Yerevan, Yerevan, Armenia (On-Site)
3 Months ago
Meta - Research Scientist Intern, Language and Multimodal Research for MetaAI (PhD)

Meta

New York, New York, United States (On-Site)
6 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Google - Senior Software Engineer, Machine Learning (Recommendations, Rankings, and Predictions)

Google

Mountain View, California, United States (On-Site)
1 Month ago
Twitch - Sr. Applied Scientist

Twitch

San Francisco, California, United States (On-Site)
1 Month ago
ByteDance - Software Engineer in ML Engineering Platform

ByteDance

Seattle, Washington, United States (On-Site)
6 Months ago
NVIDIA - Deep Learning Software Engineering Intern, Test Development - 2025

NVIDIA

Shanghai, Shanghai, China (On-Site)
1 Month ago
Ludeo - Senior Data Scientist

Ludeo

Tel Aviv-Yafo, Tel Aviv District, Israel (On-Site)
1 Month ago
Google - Field Solutions Architect, Generative AI, Google Cloud

Google

London, England, United Kingdom (On-Site)
1 Month ago
ByteDance - Video Analysis and Quality Algorithm Intern 2023 Summer/Fall (PHD)

ByteDance

San Diego, California, United States (On-Site)
6 Months ago
NVIDIA - Software Engineering Intern, Deep Learning Accelerator - Fall 2025

NVIDIA

Santa Clara, California, United States (On-Site)
1 Month ago
Outlier - Software Engineer (Python)

Outlier

Faridabad, Haryana, India (Hybrid)
7 Months ago
Ciklum - Senior Data Scientist

Ciklum

Chennai, Tamil Nadu, India (Hybrid)
7 Months ago

Get notifed when new similar jobs are uploaded

Jobs in Sunnyvale, California, United States

ByteDance - Research Scientist, Data Management and Security - Infrastructure System Lab

ByteDance

San Jose, California, United States (On-Site)
1 Month ago
Google - Open Career Opportunities, Autonomous (Self-Driving) Vehicle Jobs, Waymo

Google

Novi, Michigan, United States (On-Site)
6 Months ago
Aristocrat Gaming - Technical Product Owner

Aristocrat Gaming

Las Vegas, Nevada, United States (Hybrid)
3 Months ago
Samsung Semiconductor - Intern, Enterprise DRAM Product Marketing

Samsung Semiconductor

San Jose, California, United States (Hybrid)
3 Months ago
Tencent - Product Lead, Game Marketing Technology - SaaS Platform

Tencent

Los Angeles, California, United States (On-Site)
2 Months ago
The Walt Disney Company - VP, Finance Transformation

The Walt Disney Company

Burbank, California, United States (On-Site)
1 Month ago
Google - Data Center Facilities Technician, Mechanical

Google

Pryor, Oklahoma, United States (On-Site)
1 Month ago
ByteDance - Cloud Network Engineer

ByteDance

San Jose, California, United States (On-Site)
1 Month ago
The Walt Disney Company - Director, Inventory Analytics

The Walt Disney Company

New York, New York, United States (On-Site)
1 Month ago
Modulate - Solutions Engineer

Modulate

Somerville, Massachusetts, United States (Hybrid)
2 Months ago

Get notifed when new similar jobs are uploaded

Artificial Intelligence Jobs

Microsoft - Member of Technical Staff, AI

Microsoft

Mountain View, California, United States (On-Site)
1 Month ago
Razer - Solutions Architect

Razer

Singapore (On-Site)
7 Months ago
Ubisoft - Senior C++ Programmer - Machine Learning

Ubisoft

Montreal, Quebec, Canada (On-Site)
2 Months ago
Google - Senior Technical Solutions Consultant, Cloud Risk AI

Google

Sunnyvale, California, United States (On-Site)
1 Month ago
AI Fund - AI Fund-Principal

AI Fund

Palo Alto, California, United States (Hybrid)
7 Months ago
Keywords Studios - Research Associate - AI

Keywords Studios

(Remote)
2 Months ago
Google - Field Solutions Architect, Generative AI, Google Cloud

Google

London, England, United Kingdom (On-Site)
1 Month ago
ByteDance - Software Engineer Intern (Doubao (Seed) - Machine Learning System) - 2025 Summer (PhD)

ByteDance

San Jose, California, United States (On-Site)
6 Months ago
Google - Creative Technologist, Demos and Experiments

Google

Sunnyvale, California, United States (On-Site)
1 Month ago
Interface AI - Staff Software Engineer, Backend

Interface AI

United States (Remote)
3 Months ago

Get notifed when new similar jobs are uploaded

About The Company

A problem isn't truly solved until it's solved for all. Googlers build products that help create opportunities for everyone, whether down the street or across the globe. Bring your insight, imagination and a healthy disregard for the impossible. Bring everything that makes you unique. Together, we can build for everyone.

London, England, United Kingdom (On-Site)

Fremont, California, United States (On-Site)

Bengaluru, Karnataka, India (On-Site)

Reston, Virginia, United States (On-Site)

Sunnyvale, California, United States (On-Site)

New Taipei, New Taipei City, Taiwan (On-Site)

Dublin, County Dublin, Ireland (On-Site)

San Jose, California, United States (On-Site)

Mexico City, Mexico City, Mexico (On-Site)

View All Jobs

Get notified when new jobs are added by Google

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug