Machine Learning Compiler Software Engineer, TPU Horizontal Scaling

3 Months ago • 2-2 Years • Research & Development

Job Summary

Job Description

Google is looking for a Machine Learning Compiler Software Engineer to join their TPU Horizontal Scaling team. This team develops the Accelerated Linear Algebra (XLA) TPU/GPU parallelizing compiler used to partition, optimize, and run large-scale machine learning models across multiple TPU/GPU accelerators. The XLA Horizontal Scaling team's software stack includes the XLA Single Program Multiple Data (SPMD) partitioner, collective and scheduling optimizations, and code generation. Responsibilities include writing product or system development code, participating in design reviews, contributing to the compiler, conducting static and runtime performance analysis, designing and implementing performance optimizations, and increasing the velocity of important production teams.
Must have:
  • Bachelor's degree or equivalent practical experience
  • 2 years of experience with software development in one or more programming languages
  • 2 years of experience with data structures or algorithms
Good to have:
  • Master's degree or PhD in Computer Science
  • Experience in Machine Learning and High Performance Computing (HPC)
  • Experience optimizing programs at distributed scale
  • Experience in C++
  • Experience in compilers
  • Ability to debug and program concurrent/parallel computations

Job Details

Minimum qualifications:

  • Bachelor’s degree or equivalent practical experience.
  • Candidates will typically have 2 years of experience with software development in one or more programming languages, or 1 year of experience with an advanced degree.
  • Candidates will typically have 2 years of experience with data structures or algorithms.

Preferred qualifications:

  • Master's degree or PhD in Computer Science, or a related technical field.
  • Experience in Machine Learning and High Performance Computing (HPC).
  • Experience optimizing programs at distributed scale.
  • Experience in C++.
  • Experience in compilers.
  • Ability to debug and program concurrent/parallel computations.

About the job

Google's software engineers develop the next-generation technologies that change how billions of users connect, explore, and interact with information and one another. Our products need to handle information at massive scale, and extend well beyond web search. We're looking for engineers who bring fresh ideas from all areas, including information retrieval, distributed computing, large-scale system design, networking and data storage, security, artificial intelligence, natural language processing, UI design and mobile; the list goes on and is growing every day. As a software engineer, you will work on a specific project critical to Google’s needs with opportunities to switch teams and projects as you and our fast-paced business grow and evolve. We need our engineers to be versatile, display leadership qualities and be enthusiastic to take on new problems across the full-stack as we continue to push technology forward.

With your technical expertise you will manage project priorities, deadlines, and deliverables. You will design, develop, test, deploy, maintain, and enhance software solutions.

Our team develops the Accelerated Linear Algebra (XLA) TPU/GPU parallelizing compiler used to partition, optimize, and run large-scale machine learning models across multiple TPU/GPU accelerators for internal and external customers. The XLA Horizontal Scaling team’s software stack includes the XLA Single Program Multiple Data (SPMD) partitioner, collective and scheduling optimizations, and code generation.

Google Cloud accelerates every organization’s ability to digitally transform its business and industry. We deliver enterprise-grade solutions that leverage Google’s cutting-edge technology, and tools that help developers build more sustainably. Customers in more than 200 countries and territories turn to Google Cloud as their trusted partner to enable growth and solve their most critical business problems.

Responsibilities

  • Write product or system development code.
  • Participate in, or lead design reviews with peers and stakeholders to decide amongst available technologies.
  • Contribute to a compiler which scales-out machine learning models across accelerators such as Tensor Processing Unit (TPU)/Graphics Processing Unit (GPU) at Google and Cloud. 
  • Conduct static and runtime performance analysis of important large-scale production models.
  • Design and implement performance optimizations and critical features, which increase the velocity of important production teams.

Similar Jobs

Meta - Technical Game Designer

Meta

Seattle, Washington, United States (On-Site)
9 Months ago
Google - Staff Software Engineer, Google Cloud Databases

Google

Bengaluru, Karnataka, India (On-Site)
3 Months ago
Riot Games - Principal Software Engineer, Product Tech-Lead - Unpublished R&D Product

Riot Games

Dublin, County Dublin, Ireland (On-Site)
3 Months ago
Overdare - [OVERDARE] Sr. Unreal Engine Engineer

Overdare

Seoul, South Korea (On-Site)
4 Months ago
Fabric - Staff Digital Design Verification Engineer

Fabric

Boston, Massachusetts, United States (On-Site)
4 Months ago
Netflix - Research Scientist 5 - Content and Studio

Netflix

Los Gatos, California, United States (On-Site)
3 Months ago
ByteDance - Research Scientist (Multiple Positions)

ByteDance

Seattle, Washington, United States (On-Site)
3 Months ago
Zoox - Senior/Staff Software Engineer - Simulation C++ Framework

Zoox

Seattle, Washington, United States (Hybrid)
4 Months ago
Fabric - Staff Digital Design Verification Engineer

Fabric

San Francisco, California, United States (On-Site)
4 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Sperasoft - C++ / UE5 Software Developer

Sperasoft

Warsaw, Masovian Voivodeship, Poland (Hybrid)
6 Months ago
DNEG - Facial Rigger

DNEG

Vancouver, British Columbia, Canada (On-Site)
8 Months ago
DeepSight AI Labs   - Intern/Computer Vision Engineer

DeepSight AI Labs

Gurugram, Haryana, India (On-Site)
8 Months ago
Fabric - Principal Design Verification Engineer

Fabric

Austin, Texas, United States (On-Site)
4 Months ago
Google - Software Engineer, Mobile, Android

Google

(On-Site)
3 Months ago
Wind River Systems - Senior Cloud Solution Architect - GSIs

Wind River Systems

Bengaluru, Karnataka, India (On-Site)
4 Months ago
Luxoft - Cybersecurity Test Expert

Luxoft

Italy, New York, United States (Remote)
3 Months ago
Bungie - Destiny Senior UI Engineer

Bungie

(Hybrid)
4 Months ago

Get notifed when new similar jobs are uploaded

Jobs in London, England, United Kingdom

Climax Studios - Senior/ Lead Gameplay Systems Designer

Climax Studios

Liverpool, England, United Kingdom (On-Site)
4 Months ago
DPS Games - Principal Gameplay Programmer (Unannounced Project)

DPS Games

Guildford, England, United Kingdom (Hybrid)
5 Months ago
Rank group - Food & Beverage Team Leader

Rank group

Manchester, England, United Kingdom (On_site)
3 Months ago
Playground Games - Cinematic Lead

Playground Games

Royal Leamington Spa, England, United Kingdom (Hybrid)
3 Months ago
Team17 - Management Accountant

Team17

Nottingham, England, United Kingdom (Hybrid)
4 Months ago
Activision - Associate Director, Legal (Privacy)

Activision

London, England, United Kingdom (Hybrid)
5 Months ago
Eleven Labs - IT Security Engineer

Eleven Labs

London, England, United Kingdom (Remote)
4 Months ago
Assystems - Project Engineer (CS&A)

Assystems

Plymouth, England, United Kingdom (Hybrid)
3 Months ago
Salesforce - Territory Account Executive - Romania

Salesforce

London, England, United Kingdom (On-Site)
5 Months ago
Warner Bros Discovery - Technical Edit Assistant

Warner Bros Discovery

London, England, United Kingdom (Hybrid)
3 Months ago

Get notifed when new similar jobs are uploaded

Research & Development Jobs

Riot Games - Art Outsourcing II (Illustration) - VALORANT

Riot Games

Dublin, County Dublin, Ireland (On-Site)
3 Months ago
Cognizant - Hiring Polarion Lead Developer.

Cognizant

Bengaluru, Karnataka, India (Hybrid)
4 Months ago
Google - Lead CPU Design Verification Engineer, Silicon

Google

(On-Site)
3 Months ago
Riot Games - Principal Software Engineer, Product Tech-Lead - Unpublished R&D Product

Riot Games

Los Angeles, California, United States (On-Site)
3 Months ago
Google - CPU Design Verification Engineer, Google Cloud

Google

(On-Site)
3 Months ago
CloudLinux - Python Developer Internship (worldwide remote, work anywhere)

CloudLinux

Almaty, Almaty Region, Kazakhstan (Remote)
3 Months ago
NK Securities Research - ML- Quantitative Researcher

NK Securities Research

Gurugram, Haryana, India (Hybrid)
7 Months ago
Fabric - Principal Design Verification Engineer (contract)

Fabric

Austin, Texas, United States (On-Site)
4 Months ago
Meta - Research Scientist Intern, Algorithms (PhD)

Meta

Burlingame, California, United States (On-Site)
3 Months ago
MIPS - Performance Architecture Engineer - Modeling

MIPS

Austin, Texas, United States (On-Site)
4 Months ago

Get notifed when new similar jobs are uploaded

About The Company

A problem isn't truly solved until it's solved for all. Googlers build products that help create opportunities for everyone, whether down the street or across the globe. Bring your insight, imagination and a healthy disregard for the impossible. Bring everything that makes you unique. Together, we can build for everyone.

View All Jobs

Get notified when new jobs are added by Google

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug