Accelerator Architect and Performance Engineer, Generative AI

4 Hours ago • 8 Years + • Artificial Intelligence • $183,000 PA - $271,000 PA

Job Summary

Job Description

This role involves driving forward-looking Generative AI (GenAI) Machine Learning architecture exploration for Tensor mobile SoCs. Collaboration with research, system architecture, and compiler teams is crucial to optimize future workloads across the entire tech stack (hardware, software, use cases, network, and external components). Responsibilities include defining system architecture requirements for future GenAI use cases, applying advanced research to achieve power and performance improvements on GenAI workloads, and optimizing GenAI use case performance through model scheduling on TPU compute engines. The ideal candidate possesses extensive experience in computer architecture, performance, and compilers, coupled with expertise in Generative AI model architectures (LLMs, Vision Transformers, etc.). Proficiency in programming languages (C/C++, Python) and deep learning frameworks (TensorFlow/Jax/PyTorch) is essential.
Must have:
  • Bachelor's degree in relevant field
  • 8+ years experience in computer architecture/performance/compiler
  • GenAI model architecture experience
  • Programming in C/C++ or Python and deep learning frameworks
  • Collaboration with research and engineering teams
  • Optimize GenAI performance on TPUs
Good to have:
  • Master's or PhD in relevant field
  • Experience with domain-specific accelerators
  • Distributed/parallel programming experience
  • Hardware/software co-design experience for ML
  • Simulator development and micro-architecture experience
  • Excellent communication skills

Job Details


Minimum qualifications:

  • Bachelor's degree in Electrical Engineering, Computer Engineering, Computer Science, a related field, or equivalent practical experience.
  • 8 years of work or academic research experience in computer or chip architecture, performance, or compiler.
  • Experience with Generative AI model architectures (e.g., Large Language Models, Vision Transformers, Image Diffusion Models, etc.).
  • Experience with one or more general purpose programming languages including (but not limited to) C/C++ or Python and deep learning frameworks like TensorFlow/Jax/Pytorch.

Preferred qualifications:

  • Master's degree or PhD in Electrical Engineering, Computer Engineering or Computer Science, with an emphasis on computer architecture.
  • Experience with domain-specific accelerators.
  • Experience with distributed/parallel programming.
  • Experience with hardware/software co-design for machine learning.
  • Experience with simulator development and micro-architecture.
  • Excellent communication skills.

About the job

Be part of a team that pushes boundaries, developing custom silicon solutions that power the future of Google's direct-to-consumer products. You'll contribute to the innovation behind products loved by millions worldwide. Your expertise will shape the next generation of hardware experiences, delivering unparalleled performance, efficiency, and integration. Google's mission is to organize the world's information and make it universally accessible and useful. Our team combines the best of Google AI, Software, and Hardware to create radically helpful experiences. We research, design, and develop new technologies and hardware to make computing faster, seamless, and more powerful. We aim to make people's lives better through technology.

The US base salary range for this full-time position is $183,000-$271,000 + bonus + equity + benefits. Our salary ranges are determined by role, level, and location. Within the range, individual pay is determined by work location and additional factors, including job-related skills, experience, and relevant education or training. Your recruiter can share more about the specific salary range for your preferred location during the hiring process.

Please note that the compensation details listed in US role postings reflect the base salary only, and do not include bonus, equity, or benefits. Learn more about .

Responsibilities

  • Drive forward-looking GenAI Machine Learning architecture exploration for Tensor mobile SoCs while collaborating with research teams, system architecture teams, and compiler engineers to optimize future workloads from both all perspectives across the tech stack including hardware, software, use case, network, and external components.
  • Work with researchers and Program Management teams to define system architecture requirements for future Generative AI use cases.
  • Apply advanced research in architecture and process technology to get breakthrough power and performance improvements on Generative AI workloads.
  • Optimize performance of GenAI use cases by defining an optimal model scheduling on the TPU compute engines.

Similar Jobs

NVIDIA - Full Stack Developer, AI and LLM

NVIDIA

California, United States (Hybrid)
2 Weeks ago
ByteDance - Research Scientist in Large Model System

ByteDance

Seattle, Washington, United States (On-Site)
5 Months ago
Stonewall Collision & Auto Painting - Senior Data Scientist

Stonewall Collision & Auto Painting

Vijayawada, Andhra Pradesh, India (On-Site)
7 Months ago
ByteDance - Senior Machine Learning Engineer

ByteDance

San Jose, California, United States (On-Site)
3 Days ago
ByteDance - Software Engineer Large Model System Graduate (Machine Learning Sys-US) - 2024 Start (BS/MS)

ByteDance

Seattle, Washington, United States (On-Site)
5 Months ago
Google - Senior Software Engineer, Machine Learning, Google Play Books

Google

Bengaluru, Karnataka, India (On-Site)
1 Day ago
NVIDIA - Principal DGX Cloud Machine Learning Architect

NVIDIA

Canada (On-Site)
1 Month ago
NVIDIA - Deep Learning Performance Architect

NVIDIA

Hyderabad, Telangana, India (Hybrid)
1 Month ago
Microsoft - Senior Researcher – Artificial Specialized Intelligence

Microsoft

Redmond, Washington, United States (On-Site)
1 Day ago
Google - Customer Engineer, Machine Learning, Google Cloud

Google

Tel Aviv-Yafo, Tel Aviv District, Israel (On-Site)
1 Day ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

ByteDance - Student Researcher (Doubao (Seed) - Foundation Model - Speech Understanding) - 2025 Start (PhD)

ByteDance

San Jose, California, United States (On-Site)
5 Months ago
Nintendo - Intern – Machine Learning Software Engineer (NTD)

Nintendo

Redmond, Washington, United States (On-Site)
4 Months ago
Epic Games - Senior Machine Learning Rendering Engineer

Epic Games

(On-Site)
3 Weeks ago
Attentive - Staff Machine Learning Engineer

Attentive

San Francisco, California, United States (Hybrid)
6 Months ago
Netflix - Product Manager, ML Platform: Training

Netflix

Los Gatos, California, United States (Hybrid)
5 Months ago
Netflix - Machine Learning Intern - Spring or Summer 2025

Netflix

Los Gatos, California, United States (On-Site)
5 Months ago
Genies - Machine Learning Engineer: 3D Generative AI

Genies

San Mateo, California, United States (Remote)
5 Months ago
ByteDance - Engineering Manager Machine Learning Infrastructure

ByteDance

Seattle, Washington, United States (On-Site)
5 Months ago
Canva - Staff Machine Learning Engineer - User Voice

Canva

Melbourne, Victoria, Australia (Remote)
1 Week ago

Get notifed when new similar jobs are uploaded

Jobs in Mountain View, California, United States

Apollo - Senior Software Engineer, Backend

Apollo

United States (Remote)
6 Months ago
Trek - Part Time Sales Associate

Trek

Leesburg, Virginia, United States (On-Site)
1 Month ago
Scientific Games  - Procurement Manager - Process, Tools, and Continuous Improvement

Scientific Games

Alpharetta, Georgia, United States (On-Site)
1 Month ago
ByteDance - Applied Scientist Intern (Computational Modeling & Optimization-System Technologies and Engineering)

ByteDance

San Jose, California, United States (On-Site)
3 Weeks ago
Onward Search - UX Content Designer

Onward Search

Chicago, Illinois, United States (Remote)
3 Days ago
The Walt Disney Company - Disney Store: Sales Associate (PT)

The Walt Disney Company

New York, New York, United States (On-Site)
5 Months ago
ByteDance - Research Scientist Graduate (Computational Biology (AI-for-Science))

ByteDance

Seattle, Washington, United States (On-Site)
3 Weeks ago
Bonfire Studios - Senior Producer (Audio / Narrative / Localization)

Bonfire Studios

California, United States (Hybrid)
1 Month ago
SMU Guildhall - Faculty - Video Game Development

SMU Guildhall

Dallas, Texas, United States (On-Site)
6 Months ago
Microsoft - Member of Technical Staff, Platform Engineer

Microsoft

Redmond, Washington, United States (Hybrid)
3 Days ago

Get notifed when new similar jobs are uploaded

Artificial Intelligence Jobs

Light Speed Studios - Game AI Researcher

Light Speed Studios

Tokyo, Japan (On-Site)
2 Weeks ago
ByteDance - Research Scientist in Foundation Model (Music) - 2025 Start (PhD)

ByteDance

San Jose, California, United States (On-Site)
5 Months ago
Google - Staff Software Engineer, GenAI and Computational Photography

Google

New Taipei, New Taipei City, Taiwan (On-Site)
1 Day ago
Ubisoft - Senior ML Data Scientist

Ubisoft

Montreal, Quebec, Canada (On-Site)
3 Days ago
Microsoft - Principal Researcher - Deep Learning, Reinforcement Learning

Microsoft

New York, New York, United States (On-Site)
1 Day ago
NVIDIA - Senior Deep Learning Performance Architect

NVIDIA

Canada (On-Site)
1 Month ago
Google - Software Engineer III, AI/ML GenAI, Search

Google

Mountain View, California, United States (On-Site)
1 Day ago
NVIDIA - Senior Solutions Architect, Global Partner Team

NVIDIA

Canada (On-Site)
3 Months ago
Google - Senior Technical Solutions Consultant, AI, Customer Experience Suite

Google

Waterloo, Ontario, Canada (On-Site)
1 Day ago
NVIDIA - Full Stack Developer, AI and LLM

NVIDIA

California, United States (Hybrid)
2 Weeks ago

Get notifed when new similar jobs are uploaded

About The Company

A problem isn't truly solved until it's solved for all. Googlers build products that help create opportunities for everyone, whether down the street or across the globe. Bring your insight, imagination and a healthy disregard for the impossible. Bring everything that makes you unique. Together, we can build for everyone.

Seoul, South Korea (On-Site)

Tel Aviv-Yafo, Tel Aviv District, Israel (On-Site)

Hyderabad, Telangana, India (On-Site)

Atlanta, Georgia, United States (On-Site)

Fremont, California, United States (On-Site)

Milan, Lombardy, Italy (On-Site)

Eemshaven, Groningen, Netherlands (On-Site)

Bengaluru, Karnataka, India (On-Site)

Sunnyvale, California, United States (On-Site)

View All Jobs

Get notified when new jobs are added by Google

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug