Accelerator Architect and Performance Engineer, Generative AI

2 Months ago • 8 Years + • $183,000 PA - $271,000 PA

Job Summary

Job Description

This role involves driving forward-looking Generative AI (GenAI) Machine Learning architecture exploration for Tensor mobile SoCs. Collaboration with research, system architecture, and compiler teams is crucial to optimize future workloads across the entire tech stack (hardware, software, use cases, network, and external components). Responsibilities include defining system architecture requirements for future GenAI use cases, applying advanced research to achieve power and performance improvements on GenAI workloads, and optimizing GenAI use case performance through model scheduling on TPU compute engines. The ideal candidate possesses extensive experience in computer architecture, performance, and compilers, coupled with expertise in Generative AI model architectures (LLMs, Vision Transformers, etc.). Proficiency in programming languages (C/C++, Python) and deep learning frameworks (TensorFlow/Jax/PyTorch) is essential.
Must have:
  • Bachelor's degree in relevant field
  • 8+ years experience in computer architecture/performance/compiler
  • GenAI model architecture experience
  • Programming in C/C++ or Python and deep learning frameworks
  • Collaboration with research and engineering teams
  • Optimize GenAI performance on TPUs
Good to have:
  • Master's or PhD in relevant field
  • Experience with domain-specific accelerators
  • Distributed/parallel programming experience
  • Hardware/software co-design experience for ML
  • Simulator development and micro-architecture experience
  • Excellent communication skills

Job Details


Minimum qualifications:

  • Bachelor's degree in Electrical Engineering, Computer Engineering, Computer Science, a related field, or equivalent practical experience.
  • 8 years of work or academic research experience in computer or chip architecture, performance, or compiler.
  • Experience with Generative AI model architectures (e.g., Large Language Models, Vision Transformers, Image Diffusion Models, etc.).
  • Experience with one or more general purpose programming languages including (but not limited to) C/C++ or Python and deep learning frameworks like TensorFlow/Jax/Pytorch.

Preferred qualifications:

  • Master's degree or PhD in Electrical Engineering, Computer Engineering or Computer Science, with an emphasis on computer architecture.
  • Experience with domain-specific accelerators.
  • Experience with distributed/parallel programming.
  • Experience with hardware/software co-design for machine learning.
  • Experience with simulator development and micro-architecture.
  • Excellent communication skills.

About the job

Be part of a team that pushes boundaries, developing custom silicon solutions that power the future of Google's direct-to-consumer products. You'll contribute to the innovation behind products loved by millions worldwide. Your expertise will shape the next generation of hardware experiences, delivering unparalleled performance, efficiency, and integration. Google's mission is to organize the world's information and make it universally accessible and useful. Our team combines the best of Google AI, Software, and Hardware to create radically helpful experiences. We research, design, and develop new technologies and hardware to make computing faster, seamless, and more powerful. We aim to make people's lives better through technology.

The US base salary range for this full-time position is $183,000-$271,000 + bonus + equity + benefits. Our salary ranges are determined by role, level, and location. Within the range, individual pay is determined by work location and additional factors, including job-related skills, experience, and relevant education or training. Your recruiter can share more about the specific salary range for your preferred location during the hiring process.

Please note that the compensation details listed in US role postings reflect the base salary only, and do not include bonus, equity, or benefits. Learn more about .

Responsibilities

  • Drive forward-looking GenAI Machine Learning architecture exploration for Tensor mobile SoCs while collaborating with research teams, system architecture teams, and compiler engineers to optimize future workloads from both all perspectives across the tech stack including hardware, software, use case, network, and external components.
  • Work with researchers and Program Management teams to define system architecture requirements for future Generative AI use cases.
  • Apply advanced research in architecture and process technology to get breakthrough power and performance improvements on Generative AI workloads.
  • Optimize performance of GenAI use cases by defining an optimal model scheduling on the TPU compute engines.

Similar Jobs

shyft labs - Machine Learning Engineer

shyft labs

Atlanta, Georgia, United States (Hybrid)
1 Month ago
Ciklum - Senior Data Scientist

Ciklum

Pune, Maharashtra, India (Hybrid)
8 Months ago
Triple dot studios - Senior Machine Learning Engineer

Triple dot studios

Barcelona, Catalonia, Spain (Hybrid)
1 Month ago
SymphonyAI - Data Scientist

SymphonyAI

Bengaluru, Karnataka, India (On-Site)
7 Months ago
Google - Senior Software Engineer, Visual Language and Multimodal Modeling

Google

Sydney, New South Wales, Australia (On-Site)
2 Months ago
Genies - Backend Engineer Intern (LLM)

Genies

San Mateo, California, United States (Hybrid)
3 Months ago
Meta - Research Engineer (Robotics)

Meta

Menlo Park, California, United States (On-Site)
2 Months ago
zoox - Senior/Staff Software Engineer - Simulation Traffic & Behavior Modeling

zoox

Foster City, California, United States (Hybrid)
8 Months ago
NVIDIA - Principal Engineer - DL and AI Software

NVIDIA

Canada (On-Site)
4 Months ago
NVIDIA - Senior Computer Architect - Deep Learning

NVIDIA

Santa Clara, California, United States (On-Site)
5 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Yahoo - Senior Principal Machine Learning Engineer

Yahoo

United States (Hybrid)
1 Month ago
Reddit - Senior Machine Learning Engineer

Reddit

United Kingdom (Remote)
1 Month ago
appier - Staff/Senior Machine Learning Scientist (Ad Cloud)

appier

Taipei City, Taiwan (On-Site)
1 Month ago
Reddit - Machine Learning Manager - Ads Engagement Modeling

Reddit

Canada (Remote)
1 Month ago
CyberArk - Data Architect

CyberArk

Israel (Hybrid)
1 Month ago
shyft labs - Engineering Manager - Data Platform

shyft labs

Noida, Uttar Pradesh, India (On-Site)
4 Months ago
Hedra - Machine Learning Engineer (CUDA)

Hedra

New York, New York, United States (On-Site)
3 Months ago
JoinZoe - Lead Machine Learning Engineer

JoinZoe

(Remote)
1 Month ago
Synechron - Senior Python Developer (Machine Learning, Data Analysis, Visualization)

Synechron

Pune, Maharashtra, India (On-Site)
1 Month ago

Get notifed when new similar jobs are uploaded

Jobs in Mountain View, California, United States

Qualcomm - DV CAD Engineer

Qualcomm

Santa Clara, California, United States (On-Site)
1 Month ago
Philips - Clinical Solutions Delivery Consultant - Patient Monitoring

Philips

St. Louis, Missouri, United States (On-Site)
1 Month ago
warner bros games - Senior Manager, Franchise Planning

warner bros games

Burbank, California, United States (Hybrid)
2 Months ago
Demandbase - Senior Data Scientist

Demandbase

San Francisco, California, United States (On-Site)
1 Month ago
PlayStation Global - Staff HRIS Analyst-Recruiting and HCM

PlayStation Global

Carlsbad, California, United States (On-Site)
3 Months ago
Penn Interactive - Executive Host

Penn Interactive

Detroit, Michigan, United States (Remote)
1 Month ago
bytedance - Research Engineer / Scientist - Storage for LLM

bytedance

Seattle, Washington, United States (On-Site)
2 Months ago
bytedance - Senior Solutions Manager - Legal System

bytedance

San Jose, California, United States (On-Site)
3 Months ago
Illuminia - Senior Financial Analyst 2

Illuminia

San Diego, California, United States (Hybrid)
2 Months ago
Next Level Business Services - Mobile Test Manager

Next Level Business Services

Alpharetta, Georgia, United States (On-Site)
8 Months ago

Get notifed when new similar jobs are uploaded

Similar Category Jobs

Google - Senior Staff Software Engineer, BigQuery Generative AI

Google

Kirkland, Washington, United States (On-Site)
2 Months ago
NVIDIA - Solutions Architect, Financial Services

NVIDIA

New Jersey, United States (Remote)
2 Months ago
bytedance - Research Scientist Graduate (Foundation Model - Vision and Language)

bytedance

Seattle, Washington, United States (On-Site)
5 Months ago
Google - Senior Machine Learning Physical Design Engineer

Google

Bengaluru, Karnataka, India (On-Site)
2 Months ago
Microsoft - Technical Support Engineer (Data and AI Intelligent Platform)

Microsoft

Selangor, Malaysia (Hybrid)
2 Months ago
Meta - Software Engineer, Machine Learning

Meta

Burlingame, California, United States (On-Site)
7 Months ago
bytedance - Research Scientist in Foundation Model (Music) - 2025 Start (PhD)

bytedance

San Jose, California, United States (On-Site)
8 Months ago
PlayStation Global - Senior Machine Learning Software Engineer

PlayStation Global

United States (Remote)
3 Months ago
Keywords Studios - Research Associate - AI

Keywords Studios

(Remote)
3 Months ago

Get notifed when new similar jobs are uploaded

About The Company

New York, United States (On-Site)

London, England, United Kingdom (On-Site)

Taipei City, Taiwan (On-Site)

Kirkland, Washington, United States (On-Site)

Sunnyvale, California, United States (On-Site)

Sunnyvale, California, United States (On-Site)

Bengaluru, Karnataka, India (On-Site)

Sunnyvale, California, United States (On-Site)

Kraków, Lesser Poland Voivodeship, Poland (On-Site)

View All Jobs

Get notified when new jobs are added by Google

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug