Machine Learning System Tooling Tech Lead, Silicon

7 Hours ago • 5 Years + • Artificial Intelligence • Research & Development

Job Summary

Job Description

This Machine Learning System Tooling Tech Lead role at Google involves designing, developing, and maintaining tools and infrastructure for analyzing ML workloads and hardware performance. Responsibilities include developing power and performance models, visualizations, and dashboards to communicate performance insights. The role requires building models and benchmarks for workload analysis, driving architectural decisions, and collaborating with cross-functional teams to improve workload analysis flows. Candidates need expertise in computer architecture, ML accelerators, and tooling development for power, performance, and architecture analysis. Experience with ML algorithms and compiler optimization is also crucial.
Must have:
  • 5+ years experience with computer architecture
  • Experience with ML accelerators
  • Tooling development for power/performance analysis
  • Develop and maintain performance models
  • Collaborate with cross-functional teams
Good to have:
  • Master's or PhD in performance evaluation for ML systems
  • Experience writing ML algorithms (NLP, image processing)
  • Compiler architecting and optimization experience
  • Understanding of compiler flows and TensorFlow

Job Details


Minimum qualifications:

  • Bachelor's degree in Electrical Engineering, Computer Engineering, Computer Science, a related field, or equivalent practical experience.
  • 5 years of experience with computer architecture concepts, including microarchitecture, cache hierarchy, pipelining, and memory subsystems.

Preferred qualifications:

  • Master’s degree or PhD with an emphasis on performance evaluation for Machine Learning (ML) systems.
  • Experience with ML accelerators (e.g., having worked on ML software models or accelerator architectures).
  • Experience writing ML algorithms for e.g., recommendation systems, Natural Language Processing (NLP), image and vision.
  • Experience in tooling development for power, performance and architecture analysis.
  • Experience in architecting and optimizing compilers.
  • Understanding of compiler flows, software involved in translating a high-level language (e.g., TensorFlow) to hardware instructions.

About the job

Be part of a team that pushes boundaries, developing custom silicon solutions that power the future of Google's direct-to-consumer products. You'll contribute to the innovation behind products loved by millions worldwide. Your expertise will shape the next generation of hardware experiences, delivering unparalleled performance, efficiency, and integration.

Google's mission is to organize the world's information and make it universally accessible and useful. Our team combines the best of Google AI, Software, and Hardware to create radically helpful experiences. We research, design, and develop new technologies and hardware to make computing faster, seamless, and more powerful. We aim to make people's lives better through technology.

Responsibilities

  • Design, develop, and maintain tools and infrastructure for analyzing Machine Learning (ML) workloads and hardware performance.
  • Develop and maintain power and performance models.
  • Develop visualizations and dashboards to communicate performance insights to engineers.
  • Build models, benchmarks for workload analysis and help to drive architectural decisions.
  • Collaborate with cross-functional teams to improve the workload analysis flows, including debuggability and tracing.

Similar Jobs

ByteDance - Senior Software Engineer - Generative AI

ByteDance

San Jose, California, United States (On-Site)
2 Months ago
ByteDance - AI Security Researcher - Security - San Jose

ByteDance

San Jose, California, United States (On-Site)
5 Months ago
Ubisoft - Senior ML Data Scientist

Ubisoft

Montreal, Quebec, Canada (On-Site)
3 Weeks ago
Krafton  - Deep Learning Engineer - RL

Krafton

Seoul, South Korea (On-Site)
3 Months ago
ByteDance - Applied Scientist Intern (Computational Modeling & Optimization)

ByteDance

San Jose, California, United States (On-Site)
2 Days ago
Pika - Research Scientist

Pika

Palo Alto, California, United States (On-Site)
4 Months ago
Lionbridge Games - Games Language AI Specialist (Linguist)

Lionbridge Games

Masovian Voivodeship, Poland (On-Site)
2 Months ago
Meta - Software Engineer, Machine Learning

Meta

Redmond, Washington, United States (On-Site)
5 Months ago
Ubisoft - Senior C++ Programmer - Machine Learning Content Creation Technology Group

Ubisoft

Montreal, Quebec, Canada (On-Site)
3 Weeks ago
Microsoft - Member of Technical Staff, AI Pre-Training

Microsoft

Zürich, Zurich, Switzerland (On-Site)
1 Day ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

ByteDance - Software Engineer Intern (Doubao (Seed) - Machine Learning System) - 2025 Summer (MS)

ByteDance

San Jose, California, United States (On-Site)
5 Months ago
NVIDIA - Senior Software Engineer, Deep Learning Inference, TensorRT

NVIDIA

Santa Clara, California, United States (Hybrid)
1 Month ago
NVIDIA - DGX Cloud Infrastructure Engineering Intern - Fall 2025

NVIDIA

Santa Clara, California, United States (On-Site)
1 Week ago
Canva - Machine Learning Engineer - Ecosystem Experiences

Canva

Surry Hills, New South Wales, Australia (Remote)
2 Weeks ago
ByteDance - Student Researcher (Doubao (Seed) - Foundation Model - Speech & Audio) - 2025 Start (PhD)

ByteDance

San Jose, California, United States (On-Site)
5 Months ago
Ubisoft - Scientifique principal en données ML _ Groupe Technologique Content Creation

Ubisoft

Montreal, Quebec, Canada (On-Site)
3 Months ago
Arkose Labs - Senior Machine Learning Researcher

Arkose Labs

Pune, Maharashtra, India (Hybrid)
6 Months ago
NVIDIA - Software Engineering Intern, AI Engineering - 2025

NVIDIA

Shanghai, Shanghai, China (On-Site)
2 Months ago
ByteDance - Research Scientist, Foundation Model, Music Intelligence

ByteDance

San Jose, California, United States (On-Site)
5 Months ago
NVIDIA - Principal Engineer

NVIDIA

(Remote)
1 Month ago

Get notifed when new similar jobs are uploaded

Jobs in New Taipei, New Taipei City, Taiwan

Trend Micro - Sr. Engineer

Trend Micro

Taipei City, Taiwan (On-Site)
6 Months ago
Google - Software Engineering Manager, System Acceleration, Silicon

Google

New Taipei, New Taipei City, Taiwan (On-Site)
7 Hours ago
NVIDIA - Senior Tool and Methodology Development Software Engineer

NVIDIA

Hsinchu, Hsinchu City, Taiwan (On-Site)
3 Weeks ago
NVIDIA - Test Floor Engineer

NVIDIA

Hsinchu, Hsinchu City, Taiwan (On-Site)
2 Months ago
Trend Micro - Sr. Software Engineer (XDR for Networks)

Trend Micro

Taipei City, Taiwan (On-Site)
6 Months ago
Garena - Garena - Strategy & Operations

Garena

Taipei City, Taiwan (On-Site)
4 Months ago
Corsair - Document Control Assistant

Corsair

Taiwan (On-Site)
3 Weeks ago
Appier - Campaign Executive

Appier

Taipei City, Taiwan (On-Site)
3 Months ago
WildBrain - Licensing Manager

WildBrain

Taipei City, Taiwan (Hybrid)
1 Day ago
Trend Micro - (Sr.) Cloud Developer (Security Playbooks)

Trend Micro

Taipei City, Taiwan (On-Site)
6 Months ago

Get notifed when new similar jobs are uploaded

Artificial Intelligence Jobs

Inworld AI - Staff / Principal Machine Learning Engineer - USA

Inworld AI

Mountain View, California, United States (Remote)
5 Months ago
Lionbridge Games - Games Language AI Specialist (Linguist)

Lionbridge Games

Masovian Voivodeship, Poland (On-Site)
1 Day ago
ByteDance - Research Engineer- Foundation Model AI Platform- Seattle

ByteDance

Seattle, Washington, United States (On-Site)
5 Months ago
Meta - Software Engineer, Machine Learning

Meta

Fremont, California, United States (Remote)
5 Months ago
ByteDance - Research Scientist, Vision Foundation Model

ByteDance

San Jose, California, United States (On-Site)
5 Months ago
Google - Customer Engineer II, Cloud AI

Google

San Francisco, California, United States (On-Site)
8 Hours ago
Google - Software Engineer III, AI/ML, Geo

Google

Mountain View, California, United States (On-Site)
8 Hours ago
ByteDance - Machine Learning Engineer, Tech Lead - Engineering Efficiency and AI Code Assistant

ByteDance

San Jose, California, United States (On-Site)
4 Months ago
Microsoft - Applied Researcher II

Microsoft

Redmond, Washington, United States (On-Site)
16 Hours ago

Get notifed when new similar jobs are uploaded

About The Company

A problem isn't truly solved until it's solved for all. Googlers build products that help create opportunities for everyone, whether down the street or across the globe. Bring your insight, imagination and a healthy disregard for the impossible. Bring everything that makes you unique. Together, we can build for everyone.

Dublin, County Dublin, Ireland (On-Site)

Sunnyvale, California, United States (On-Site)

Tel Aviv-Yafo, Tel Aviv District, Israel (On-Site)

Warsaw, Masovian Voivodeship, Poland (On-Site)

Hyderabad, Telangana, India (On-Site)

Sunnyvale, California, United States (On-Site)

Sydney, New South Wales, Australia (On-Site)

Waterloo, Ontario, Canada (On-Site)

View All Jobs

Get notified when new jobs are added by Google

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug