Jobs Courses Resources Companies Placements

Home >

Jobs >

Accelerator Architect and Performance Engineer, Generative AI

Google

California, United States (On-site)

Accelerator Architect and Performance Engineer, Generative AI

2 Months ago • 8 Years + • $183,000 PA - $271,000 PA

Job Summary

Job Description

This role involves driving forward-looking Generative AI (GenAI) Machine Learning architecture exploration for Tensor mobile SoCs. Collaboration with research, system architecture, and compiler engineering teams is crucial to optimize future workloads across the entire tech stack (hardware, software, use cases, network, and external components). Responsibilities include working with researchers and program management to define system architecture requirements for future GenAI use cases, applying advanced research to achieve breakthrough power and performance improvements on GenAI workloads, and optimizing GenAI performance by defining optimal model scheduling on TPU compute engines. The ideal candidate possesses expertise in computer architecture, performance, and compilers, with experience in GenAI model architectures (LLMs, Vision Transformers, etc.) and programming languages (C/C++, Python) and deep learning frameworks (TensorFlow/Jax/PyTorch).

Must have:

Bachelor's degree in relevant field
8+ years experience in computer architecture, performance, or compiler
Generative AI model architecture experience
Proficiency in C/C++, Python, TensorFlow/Jax/PyTorch
Drive GenAI architecture exploration for Tensor mobile SoCs
Collaborate with research and engineering teams
Define system architecture requirements for GenAI

Good to have:

Master's or PhD in relevant field
Experience with domain-specific accelerators
Distributed/parallel programming experience
Hardware/software co-design for machine learning
Simulator development and micro-architecture experience
Excellent communication skills

6 skills required

6 skills required for this role

Add these skills to join the top 1% applicants for this job

tensorflow

deep-learning

python

pytorch

communication

innovation

Job Details

Minimum qualifications:

Bachelor's degree in Electrical Engineering, Computer Engineering, Computer Science, a related field, or equivalent practical experience.
8 years of work or academic research experience in computer or chip architecture, performance, or compiler.
Experience with Generative AI model architectures (e.g., Large Language Models, Vision Transformers, Image Diffusion Models, etc.).
Experience with one or more general purpose programming languages including (but not limited to) C/C++ or Python and deep learning frameworks like TensorFlow/Jax/Pytorch.

Preferred qualifications:

Master's degree or PhD in Electrical Engineering, Computer Engineering or Computer Science, with an emphasis on computer architecture.
Experience with domain-specific accelerators.
Experience with distributed/parallel programming.
Experience with hardware/software co-design for machine learning.
Experience with simulator development and micro-architecture.
Excellent communication skills.

About the job

Be part of a team that pushes boundaries, developing custom silicon solutions that power the future of Google's direct-to-consumer products. You'll contribute to the innovation behind products loved by millions worldwide. Your expertise will shape the next generation of hardware experiences, delivering unparalleled performance, efficiency, and integration. Google's mission is to organize the world's information and make it universally accessible and useful. Our team combines the best of Google AI, Software, and Hardware to create radically helpful experiences. We research, design, and develop new technologies and hardware to make computing faster, seamless, and more powerful. We aim to make people's lives better through technology.

The US base salary range for this full-time position is $183,000-$271,000 + bonus + equity + benefits. Our salary ranges are determined by role, level, and location. Within the range, individual pay is determined by work location and additional factors, including job-related skills, experience, and relevant education or training. Your recruiter can share more about the specific salary range for your preferred location during the hiring process.

Please note that the compensation details listed in US role postings reflect the base salary only, and do not include bonus, equity, or benefits. Learn more about .

Responsibilities

Drive forward-looking GenAI Machine Learning architecture exploration for Tensor mobile SoCs while collaborating with research teams, system architecture teams, and compiler engineers to optimize future workloads from both all perspectives across the tech stack including hardware, software, use case, network, and external components.
Work with researchers and Program Management teams to define system architecture requirements for future Generative AI use cases.
Apply advanced research in architecture and process technology to get breakthrough power and performance improvements on Generative AI workloads.
Optimize performance of GenAI use cases by defining an optimal model scheduling on the TPU compute engines.

Similar Jobs

Research Scientist in ML Systems

bytedance

San Jose, California, United States (On-Site)

• 8 Months ago

Senior Data Scientist - Singapore Efficiency Team

Riot Games

Singapore (On-Site)

• 5 Months ago

Research Scientist, Multimodal Foundation Model

bytedance

Singapore (On-Site)

• 8 Months ago

Senior Machine Learning Graphics Engineer

PlayStation Global

London, England, United Kingdom (Hybrid)

• 3 Months ago

Senior ML Compiler Engineer

Qualcomm

Hyderabad, Telangana, India (On-Site)

• 1 Month ago

Software Engineering Manager - Image and Data Compression Libraries

NVIDIA

(Hybrid)

• 4 Months ago

Staff Software Engineer, Applied AI

Google

Kraków, Lesser Poland Voivodeship, Poland (On-Site)

• 2 Months ago

Artificial Intelligence Engineer

Zazz

(Remote)

• 4 Months ago

Game AI Product Management Intern

Tencent

Auckland, Auckland, New Zealand (On-Site)

• 3 Months ago

Research Scientist/Engineer - Multimodal Interaction & World Model

bytedance

Singapore (On-Site)

• 7 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Staff AI QA Engineer

AI Dash

Bengaluru, Karnataka, India (Hybrid)

• 3 Months ago

Software Engineer L4, Machine Learning Platform (Metaflow)

Netflix

Los Gatos, California, United States (On-Site)

• 4 Months ago

Software Engineer III, Generative AI

Google

Sunnyvale, California, United States (On-Site)

• 2 Months ago

Data Scientist

Rackspace Technology

Alexandria, Alexandria Governorate, Egypt (Remote)

• 5 Months ago

Senior ML Systems Engineer, AICore

Google

Taipei City, Taiwan (On-Site)

• 2 Months ago

Machine Learning Developer

Capgemini

Bengaluru, Karnataka, India (On-Site)

• 1 Month ago

Machine Learning Engineer - Monopoly World

Reality Games

Kraków, Lesser Poland Voivodeship, Poland (On-Site)

• 4 Months ago

ML Engineer

truecaller

Bengaluru, Karnataka, India (On-Site)

• 1 Month ago

Software Engineer Intern

bytedance

San Jose, California, United States (On-Site)

• 1 Month ago

Senior ML Compiler Engineer

Qualcomm

Hyderabad, Telangana, India (On-Site)

• 1 Month ago

Get notifed when new similar jobs are uploaded

Jobs in San Diego, California, United States

Strategic Operations Lead

Mercury

San Francisco, California, United States (On-Site)

• 1 Month ago

Manager, Product Management

Granicus

United States (Remote)

• 1 Month ago

Lead Software Engineer - Full-Stack

The Walt Disney Company

Santa Monica, California, United States (On-Site)

• 3 Months ago

Senior Insurance Sales Agent

Experian

United States (Remote)

• 1 Month ago

Email Marketing Coordinator

Nintendo

Redmond, Washington, United States (Hybrid)

• 8 Months ago

Executive Assistant

hogarth

New York, United States (Hybrid)

• 1 Month ago

Principal UX Designer

Riot Games

Los Angeles, California, United States (On-Site)

• 1 Month ago

Director, Customer Success

Mindtickle

United States (Remote)

• 1 Month ago

Experienced Technical Lead - Edge Cloud Infrastructure - San Jose / Seattle / Boston

bytedance

Boston, Massachusetts, United States (On-Site)

• 8 Months ago

Automation Engineer

Epic Games

Cary, North Carolina, United States (On-Site)

• 3 Months ago

Get notifed when new similar jobs are uploaded

Similar Category Jobs

Software Engineer III, Generative AI

Google

Sunnyvale, California, United States (On-Site)

• 2 Months ago

Researcher - Large Language Models, Applied Machine Learning

bytedance

Seattle, Washington, United States (On-Site)

• 3 Months ago

Research Scientist Graduate (Foundation Model - Vision and Language)

bytedance

Seattle, Washington, United States (On-Site)

• 5 Months ago

Student Researcher Intern (Edge Research Project for General Intelligence)

bytedance

San Jose, California, United States (On-Site)

• 2 Months ago

Research Engineer

Hedra

San Francisco, California, United States (On-Site)

• 3 Months ago

Machine Learning Engineer (CUDA)

Hedra

New York, New York, United States (On-Site)

• 3 Months ago

Student Researcher (Doubao (Seed) - Foundation Model - Video Generative Model)

bytedance

San Jose, California, United States (On-Site)

• 3 Months ago

AI Computing Software Development Engineer, TensorRT

NVIDIA

Taipei City, Taiwan (On-Site)

• 5 Months ago

Engineering Manager, ML Platform

Penn Interactive

Philadelphia, Pennsylvania, United States (Hybrid)

• 4 Months ago

Lead Software Engineer - Applied AI & Machine Learning

The Walt Disney Company

Santa Monica, California, United States (On-Site)

• 2 Months ago

Get notifed when new similar jobs are uploaded

About The Company

Google

354 Active Jobs

Get notified when new jobs are added by Google

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

A global community of game builders. Helping people upskill and land jobs in the best gaming studios.

Company

Key Links

hello@outscal.com

Made in INDIA 💛💙

Accelerator Architect and Performance Engineer, Generative AI

Job Summary

Job Description

6 skills required

6 skills required for this role

Job Details

Minimum qualifications:

Preferred qualifications:

About the job

Responsibilities

Similar Jobs

Research Scientist in ML Systems

Senior Data Scientist - Singapore Efficiency Team

Research Scientist, Multimodal Foundation Model

Senior Machine Learning Graphics Engineer

Senior ML Compiler Engineer

Software Engineering Manager - Image and Data Compression Libraries

Staff Software Engineer, Applied AI

Artificial Intelligence Engineer

Game AI Product Management Intern

Research Scientist/Engineer - Multimodal Interaction & World Model

Similar Skill Jobs

Staff AI QA Engineer

Software Engineer L4, Machine Learning Platform (Metaflow)

Software Engineer III, Generative AI

Data Scientist

Senior ML Systems Engineer, AICore

Machine Learning Developer

Machine Learning Engineer - Monopoly World

ML Engineer

Software Engineer Intern

Senior ML Compiler Engineer

Jobs in San Diego, California, United States

Strategic Operations Lead

Manager, Product Management

Lead Software Engineer - Full-Stack

Senior Insurance Sales Agent

Email Marketing Coordinator

Executive Assistant

Principal UX Designer

Director, Customer Success

Experienced Technical Lead - Edge Cloud Infrastructure - San Jose / Seattle / Boston

Automation Engineer

Similar Category Jobs

Software Engineer III, Generative AI

Researcher - Large Language Models, Applied Machine Learning

Research Scientist Graduate (Foundation Model - Vision and Language)

Student Researcher Intern (Edge Research Project for General Intelligence)

Research Engineer

Machine Learning Engineer (CUDA)

Student Researcher (Doubao (Seed) - Foundation Model - Video Generative Model)

AI Computing Software Development Engineer, TensorRT

Engineering Manager, ML Platform

Lead Software Engineer - Applied AI & Machine Learning

About The Company

Software Engineer III, Google Cloud Global Networking

2D Artist / Generalist

Senior ML Systems Engineer, AICore

Senior Technical Program Manager II, Infrastructure, Google Cloud

Software Engineer III, Security/Privacy, Google Cloud

PhD Software Engineer

Software Engineer III, Site Reliability Engineering

Senior Software Engineer, Messages, Android System Health

Software Engineer III, Engineering Productivity, Google Cloud Platforms

Associate Android Auto Partner Engineer, gReach Program

Level Up Your Career in Game Development!