Machine Learning System Tooling Tech Lead, Silicon

1 Month ago • 5 Years + • Artificial Intelligence • Research & Development

Job Summary

Job Description

This Machine Learning System Tooling Tech Lead role at Google involves designing, developing, and maintaining tools and infrastructure for analyzing ML workloads and hardware performance. Responsibilities include developing power and performance models, visualizations, and dashboards to communicate performance insights. The role requires building models and benchmarks for workload analysis, driving architectural decisions, and collaborating with cross-functional teams to improve workload analysis flows. Candidates need expertise in computer architecture, ML accelerators, and tooling development for power, performance, and architecture analysis. Experience with ML algorithms and compiler optimization is also crucial.
Must have:
  • 5+ years experience with computer architecture
  • Experience with ML accelerators
  • Tooling development for power/performance analysis
  • Develop and maintain performance models
  • Collaborate with cross-functional teams
Good to have:
  • Master's or PhD in performance evaluation for ML systems
  • Experience writing ML algorithms (NLP, image processing)
  • Compiler architecting and optimization experience
  • Understanding of compiler flows and TensorFlow

Job Details


Minimum qualifications:

  • Bachelor's degree in Electrical Engineering, Computer Engineering, Computer Science, a related field, or equivalent practical experience.
  • 5 years of experience with computer architecture concepts, including microarchitecture, cache hierarchy, pipelining, and memory subsystems.

Preferred qualifications:

  • Master’s degree or PhD with an emphasis on performance evaluation for Machine Learning (ML) systems.
  • Experience with ML accelerators (e.g., having worked on ML software models or accelerator architectures).
  • Experience writing ML algorithms for e.g., recommendation systems, Natural Language Processing (NLP), image and vision.
  • Experience in tooling development for power, performance and architecture analysis.
  • Experience in architecting and optimizing compilers.
  • Understanding of compiler flows, software involved in translating a high-level language (e.g., TensorFlow) to hardware instructions.

About the job

Be part of a team that pushes boundaries, developing custom silicon solutions that power the future of Google's direct-to-consumer products. You'll contribute to the innovation behind products loved by millions worldwide. Your expertise will shape the next generation of hardware experiences, delivering unparalleled performance, efficiency, and integration.

Google's mission is to organize the world's information and make it universally accessible and useful. Our team combines the best of Google AI, Software, and Hardware to create radically helpful experiences. We research, design, and develop new technologies and hardware to make computing faster, seamless, and more powerful. We aim to make people's lives better through technology.

Responsibilities

  • Design, develop, and maintain tools and infrastructure for analyzing Machine Learning (ML) workloads and hardware performance.
  • Develop and maintain power and performance models.
  • Develop visualizations and dashboards to communicate performance insights to engineers.
  • Build models, benchmarks for workload analysis and help to drive architectural decisions.
  • Collaborate with cross-functional teams to improve the workload analysis flows, including debuggability and tracing.

Similar Jobs

Google - Software Engineer III, Machine Learning, Search

Google

Seattle, Washington, United States (On-Site)
6 Months ago
The Walt Disney Company - Sr Machine Learning Engineer

The Walt Disney Company

Santa Monica, California, United States (On-Site)
6 Months ago
ByteDance - Student Researcher (Doubao (Seed) - Foundation Model - Speech & Audio) - 2025 Start (PhD)

ByteDance

San Jose, California, United States (On-Site)
6 Months ago
Stonewall Collision & Auto Painting - Senior Data Scientist

Stonewall Collision & Auto Painting

Vijayawada, Andhra Pradesh, India (On-Site)
8 Months ago
ByteDance - Research Scientist, Multimodality

ByteDance

Seattle, Washington, United States (On-Site)
6 Months ago
CharacterAI - Research Engineer, ML Systems

CharacterAI

New York, New York, United States (On-Site)
1 Month ago
ByteDance - AI Security Researcher - Security - San Jose

ByteDance

San Jose, California, United States (On-Site)
6 Months ago
Google - Staff Software Engineer, Network Interface Card Firmware, SmartNIC

Google

Sunnyvale, California, United States (On-Site)
1 Month ago
Google - Field Solutions Architect, Generative AI, Google Cloud

Google

Hamburg, Hamburg, Germany (On-Site)
1 Month ago
Equivalent Jobs - DL RESEARCHER

Equivalent Jobs

(Remote)
6 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

ByteDance - Software Engineer - Applied Machine Learning, Engine

ByteDance

San Jose, California, United States (On-Site)
1 Month ago
NVIDIA - Principal Engineer

NVIDIA

United States (Remote)
3 Months ago
Netflix - Software Engineer L4, Machine Learning Platform (Metaflow)

Netflix

Los Gatos, California, United States (On-Site)
2 Months ago
Canva - Staff Machine Learning Engineer - User Voice

Canva

Melbourne, Victoria, Australia (Remote)
1 Month ago
Tencent - Senior Staff Researcher

Tencent

California, United States (On-Site)
2 Months ago
Google - Senior Software Engineer, Machine Learning (Recommendations, Rankings, and Predictions)

Google

Mountain View, California, United States (On-Site)
1 Month ago
Canva - Senior Machine Learning Engineer - Specialist Platform and Experience

Canva

Melbourne, Victoria, Australia (Remote)
1 Month ago
Meta - Research Intern, Computer Vision for Egocentric Representation Learning (PhD)

Meta

Redmond, Washington, United States (On-Site)
6 Months ago
Attentive - Staff Machine Learning Engineer

Attentive

San Francisco, California, United States (Hybrid)
7 Months ago
ByteDance - Software Engineer, ML System Scheduling

ByteDance

Seattle, Washington, United States (On-Site)
6 Months ago

Get notifed when new similar jobs are uploaded

Jobs in New Taipei, New Taipei City, Taiwan

NVIDIA - Senior Signal and Power Integrity Engineer

NVIDIA

Taipei City, Taiwan (On-Site)
1 Month ago
Google - Software Engineer II

Google

New Taipei, New Taipei City, Taiwan (On-Site)
1 Month ago
Google - Product Manager I, Chrome OS Platform Enablement

Google

Taipei City, Taiwan (On-Site)
1 Month ago
NVIDIA - Senior Mixed Signal and Analog Circuit Designer

NVIDIA

Hsinchu, Hsinchu City, Taiwan (On-Site)
2 Months ago
Trend Micro - (Sr.) Software Engineer in Linux

Trend Micro

Taipei City, Taiwan (On-Site)
7 Months ago
NVIDIA - Senior ASIC Verification Engineer, Coherent High Speed Interconnect

NVIDIA

Hsinchu, Hsinchu City, Taiwan (On-Site)
2 Months ago
Google - Silicon Physical Design CAD Engineer

Google

New Taipei, New Taipei City, Taiwan (On-Site)
1 Month ago
Google - Software Engineer III, Embedded, Pixel Memory Management

Google

New Taipei, New Taipei City, Taiwan (On-Site)
1 Month ago
Google - Senior Software Engineer, Media Routing, Android

Google

New Taipei, New Taipei City, Taiwan (On-Site)
1 Month ago
Keywords Studios - Subtitling Project Coordinator - Asia

Keywords Studios

Taipei City, Taiwan (Hybrid)
1 Month ago

Get notifed when new similar jobs are uploaded

Artificial Intelligence Jobs

Google - Technical Program Manager III, Machine Learning Infrastructure, Cloud AI Systems

Google

Sunnyvale, California, United States (On-Site)
1 Month ago
ByteDance - Student Researcher Intern (Edge Research Project for General Intelligence)

ByteDance

San Jose, California, United States (On-Site)
1 Month ago
Meta - Research Scientist Intern, Machine Perception for Input and Interaction (PhD)

Meta

Sausalito, California, United States (On-Site)
6 Months ago
Krafton  - Deep Learning Engineer - Model Optimization

Krafton

Seoul, South Korea (On-Site)
1 Month ago
ByteDance - AI Security Researcher - Security - San Jose

ByteDance

San Jose, California, United States (On-Site)
6 Months ago
NVIDIA - Machine Learning Engineer Intern - 2025

NVIDIA

Shanghai, Shanghai, China (On-Site)
4 Months ago
Inworld AI - AI Trainer (Contractor) - Writing & Gaming

Inworld AI

Mountain View, California, United States (Remote)
1 Month ago
Google - Senior Developer Relations Engineer

Google

London, England, United Kingdom (On-Site)
1 Month ago
Henkel - Data Scientist-Intern

Henkel

Pune, Maharashtra, India (On-Site)
8 Months ago
Google - Staff Software Engineer, Embedded Systems

Google

Sunnyvale, California, United States (On-Site)
1 Month ago

Get notifed when new similar jobs are uploaded

About The Company

A problem isn't truly solved until it's solved for all. Googlers build products that help create opportunities for everyone, whether down the street or across the globe. Bring your insight, imagination and a healthy disregard for the impossible. Bring everything that makes you unique. Together, we can build for everyone.

London, England, United Kingdom (On-Site)

Fremont, California, United States (On-Site)

Bengaluru, Karnataka, India (On-Site)

Reston, Virginia, United States (On-Site)

Sunnyvale, California, United States (On-Site)

New Taipei, New Taipei City, Taiwan (On-Site)

Dublin, County Dublin, Ireland (On-Site)

San Jose, California, United States (On-Site)

Mexico City, Mexico City, Mexico (On-Site)

View All Jobs

Get notified when new jobs are added by Google

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug