Machine Learning Compiler Software Engineer, TPU Horizontal Scaling

3 Weeks ago • 1-4 Years • Research & Development

About the job

Job Description

Google seeks a Machine Learning Compiler Software Engineer for its TPU Horizontal Scaling team. This role involves contributing to the Accelerated Linear Algebra (XLA) compiler, optimizing and scaling machine learning models across TPU/GPU accelerators. Responsibilities include writing and reviewing code, conducting performance analysis, implementing optimizations, and enhancing features to improve production team velocity. The ideal candidate will have experience in compiler design, parallel computing, and machine learning, with a strong background in C++ and a focus on distributed system optimization.
Must have:
  • Bachelor's degree or equivalent experience
  • 2 years software development experience (or 1 year with advanced degree)
  • 2 years experience with data structures/algorithms
  • C++ experience
  • Compiler experience
  • Debugging concurrent/parallel computations
Good to have:
  • Master's/PhD in Computer Science
  • Machine Learning & High Performance Computing (HPC) experience
  • Distributed scale optimization experience
Not hearing back from companies?
Unlock the secrets to a successful job application and accelerate your journey to your next opportunity.

Minimum qualifications:

  • Bachelor’s degree or equivalent practical experience.
  • Candidates will typically have 2 years of experience with software development in one or more programming languages, or 1 year of experience with an advanced degree.
  • Candidates will typically have 2 years of experience with data structures or algorithms.

Preferred qualifications:

  • Master's degree or PhD in Computer Science, or a related technical field.
  • Experience in Machine Learning and High Performance Computing (HPC).
  • Experience optimizing programs at distributed scale.
  • Experience in C++.
  • Experience in compilers.
  • Ability to debug and program concurrent/parallel computations.

About the job

Google's software engineers develop the next-generation technologies that change how billions of users connect, explore, and interact with information and one another. Our products need to handle information at massive scale, and extend well beyond web search. We're looking for engineers who bring fresh ideas from all areas, including information retrieval, distributed computing, large-scale system design, networking and data storage, security, artificial intelligence, natural language processing, UI design and mobile; the list goes on and is growing every day. As a software engineer, you will work on a specific project critical to Google’s needs with opportunities to switch teams and projects as you and our fast-paced business grow and evolve. We need our engineers to be versatile, display leadership qualities and be enthusiastic to take on new problems across the full-stack as we continue to push technology forward.

With your technical expertise you will manage project priorities, deadlines, and deliverables. You will design, develop, test, deploy, maintain, and enhance software solutions.

Our team develops the Accelerated Linear Algebra (XLA) TPU/GPU parallelizing compiler used to partition, optimize, and run large-scale machine learning models across multiple TPU/GPU accelerators for internal and external customers. The XLA Horizontal Scaling team’s software stack includes the XLA Single Program Multiple Data (SPMD) partitioner, collective and scheduling optimizations, and code generation.

Google Cloud accelerates every organization’s ability to digitally transform its business and industry. We deliver enterprise-grade solutions that leverage Google’s cutting-edge technology, and tools that help developers build more sustainably. Customers in more than 200 countries and territories turn to Google Cloud as their trusted partner to enable growth and solve their most critical business problems.

Responsibilities

  • Write product or system development code.
  • Participate in, or lead design reviews with peers and stakeholders to decide amongst available technologies.
  • Contribute to a compiler which scales-out machine learning models across accelerators such as Tensor Processing Unit (TPU)/Graphics Processing Unit (GPU) at Google and Cloud. 
  • Conduct static and runtime performance analysis of important large-scale production models.
  • Design and implement performance optimizations and critical features, which increase the velocity of important production teams.
View Full Job Description

Add your resume

80%

Upload your resume, increase your shortlisting chances by 80%

About The Company

A problem isn't truly solved until it's solved for all. Googlers build products that help create opportunities for everyone, whether down the street or across the globe. Bring your insight, imagination and a healthy disregard for the impossible. Bring everything that makes you unique. Together, we can build for everyone.

View All Jobs

Get notified when new jobs are added by Google

Similar Jobs

Epic Games - Senior SDK Engineer

Epic Games, (On-Site)

ATMTA,  Inc  - Technical Artist

ATMTA, Inc , (On-Site)

ByteDance - Product Security Leader, Global Payment

ByteDance, Singapore (On-Site)

Altair - Senior Software Engineer (Golang)

Altair, India (Remote)

Intel Corporation - CPU Logic Design Engineer

Intel Corporation, India (Hybrid)

Marvell - Analog Mixed Signal Designer Intern

Marvell, Argentina (On-Site)

GreenWave™ Radios - Tech Lead, Design Verification

GreenWave™ Radios, India (On-Site)

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Activision - Gameplay Engineer - High Moon Studios

Activision, United States (On-Site)

Cadence - Lead Solutions Engineer

Cadence, India (On-Site)

ION - Senior Technical Consultant - Endur

ION, United States (On-Site)

The AES Group - Lead Audio Engineer - Automotive

The AES Group, India (On-Site)

Gearbox Entertainment - Senior Technical Artist

Gearbox Entertainment, United States (On-Site)

Get notifed when new similar jobs are uploaded

Research & Development Jobs

Anthology  Inc  - Manager, Software Engineering

Anthology Inc , India (On-Site)

FocalPoint - Principal GNSS Scientist

FocalPoint, United Kingdom (Hybrid)

Pixar Animation Studios - Software Engineer, Platform

Pixar Animation Studios, United States (Hybrid)

Anavation - Software Developer 4

Anavation, United States (On-Site)

Samsung Semiconductor - Senior Principal Engineer, DTCO

Samsung Semiconductor, United States (Hybrid)

Meta - Software Engineer (Technical Leadership)

Meta, United States (On-Site)

Riot Games - Staff Software Engineer - Developer Connections

Riot Games, United States (On-Site)

Get notifed when new similar jobs are uploaded