Machine Learning System Tooling Tech Lead, Silicon

2 Days ago β€’ 5 Years + β€’ Artificial Intelligence β€’ Research & Development

About the job

Job Description

As a Machine Learning System Tooling Tech Lead at Google, you will design, develop, and maintain tools and infrastructure for analyzing ML workloads and hardware performance. Responsibilities include developing power and performance models, creating visualizations and dashboards, building models and benchmarks for workload analysis to inform architectural decisions. You'll collaborate with cross-functional teams to improve workload analysis flows, focusing on debuggability and tracing. This role requires expertise in computer architecture, ML accelerators, and tooling development for power, performance, and architecture analysis. A strong understanding of compiler flows and translating high-level languages (like TensorFlow) to hardware instructions is crucial. You will be part of a team developing custom silicon solutions for Google's direct-to-consumer products.
Must have:
  • 5+ years experience with computer architecture
  • Experience with ML accelerators
  • Tooling development for power/performance analysis
  • Develop and maintain performance models
  • Collaborate with cross-functional teams
Good to have:
  • Master's or PhD in performance evaluation for ML systems
  • Experience writing ML algorithms
  • Experience in architecting and optimizing compilers
  • Understanding of compiler flows

Minimum qualifications:

  • Bachelor's degree in Electrical Engineering, Computer Engineering, Computer Science, a related field, or equivalent practical experience.
  • 5 years of experience with computer architecture concepts, including microarchitecture, cache hierarchy, pipelining, and memory subsystems.

Preferred qualifications:

  • Master's Degree or Ph.D. with an emphasis on performance evaluation for Machine Learning (ML) systems.
  • Experience with ML accelerators (e.g. having worked on ML software models or accelerator architectures).
  • Experience writing ML algorithms for e.g. recommendation systems, Natural Language Processing (NLP), image and vision.
  • Experience in tooling development for power, performance and architecture analysis.
  • Experience in architecting and optimizing compilers.
  • Understanding of compiler flows, software involved in translating a high-level language (e.g. TensorFlow) to hardware instructions.

About the job

Be part of a diverse team that pushes boundaries, developing custom silicon solutions that power the future of Google's direct-to-consumer products. You'll contribute to the innovation behind products loved by millions worldwide. Your expertise will shape the next generation of hardware experiences, delivering unparalleled performance, efficiency, and integration.

Google's mission is to organize the world's information and make it universally accessible and useful. Our team combines the best of Google AI, Software, and Hardware to create radically helpful experiences. We research, design, and develop new technologies and hardware to make computing faster, seamless, and more powerful. We aim to make people's lives better through technology.

Responsibilities

  • Design, develop, and maintain tools and infrastructure for analyzing Machine Learning (ML) workloads and hardware performance.
  • Develop and maintain power and performance models.
  • Develop visualizations and dashboards to effectively communicate performance insights to engineers.
  • Build models, benchmarks for workload analysis and help to drive architectural decisions.
  • Collaborate with cross-functional teams to improve the workload analysis flows, including debuggability and tracing.
View Full Job Description

Add your resume

80%

Upload your resume, increase your shortlisting chances by 80%

About The Company

A problem isn't truly solved until it's solved for all. Googlers build products that help create opportunities for everyone, whether down the street or across the globe. Bring your insight, imagination and a healthy disregard for the impossible. Bring everything that makes you unique. Together, we can build for everyone.

San Francisco, California, United States (On-Site)

Mountain View, California, United States (On-Site)

Warsaw, Masovian Voivodeship, Poland (On-Site)

San Bruno, California, United States (On-Site)

Mexico City, Mexico City, Mexico (On-Site)

Dublin, County Dublin, Ireland (On-Site)

Hyderabad, Telangana, India (On-Site)

View All Jobs

Get notified when new jobs are added by Google

Similar Jobs

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Get notifed when new similar jobs are uploaded

Jobs in New Taipei, New Taipei City, Taiwan

Get notifed when new similar jobs are uploaded

Artificial Intelligence Jobs

Get notifed when new similar jobs are uploaded