Software Engineer, JAX/TPU Tooling

1 Month ago • 2-4 Years • Full Stack Development

About the job

Job Description

This Software Engineer role focuses on accelerating the growth of the JAX/XLA/TPU ecosystem and improving JAX's usability for TPUs. You'll collaborate with the JAX third-party team and TPU SWEs to build tools for managing large High-level Operations (HLO) graphs, address compiler issues, make changes to XLA (APIs, hooks, transforms), troubleshoot numerics accuracy/stability on TPUs, and ensure user-facing features work end-to-end. Responsibilities include building and improving developer tools, working with large-scale systems, and collaborating across teams to solve machine learning developer challenges. The ideal candidate has experience with C++ or Python, building developer tools, and ideally, experience with ML developer infrastructure and SQL.
Must have:
  • 2+ years software development experience
  • 2+ years building developer tools
  • C++ or Python experience
  • Build tools for HLO graphs
  • Address compiler issues
  • Troubleshoot TPU numerics
Good to have:
  • ML developer infrastructure experience
  • Experience with SQL
  • Experience developing tools and infrastructure
  • XLA experience

Minimum qualifications:

  • Bachelor’s degree or equivalent practical experience.
  • 2 years of experience with software development in one or more programming languages, or 1 year of experience with an advanced degree.
  • 2 years of experience building developer tools (e.g., compilers, automated releases, code design and testing, test automation frameworks).
  • Experience in C++ or Python.

Preferred qualifications:

  • Experience with Machine Learning (ML) developer infrastructure.
  • Experience developing tools and infrastructure.
  • Experience with SQL to develop impact tracking dashboards.
  • Ability to collaborate across teams and organizational boundaries.
  • Ability to solve the issues faced by machine learning developers.

About the job

Google's software engineers develop the next-generation technologies that change how billions of users connect, explore, and interact with information and one another. Our products need to handle information at massive scale, and extend well beyond web search. We're looking for engineers who bring fresh ideas from all areas, including information retrieval, distributed computing, large-scale system design, networking and data storage, security, artificial intelligence, natural language processing, UI design and mobile; the list goes on and is growing every day. As a software engineer, you will work on a specific project critical to Google’s needs with opportunities to switch teams and projects as you and our fast-paced business grow and evolve. We need our engineers to be versatile, display leadership qualities and be enthusiastic to take on new problems across the full-stack as we continue to push technology forward.

With your technical expertise you will manage project priorities, deadlines, and deliverables. You will design, develop, test, deploy, maintain, and enhance software solutions.

We are looking for a Software Engineer (SWE) to help us accelerate the growth of the JAX/XLA/TPU based external ecosystem and improving the usability of JAX for TPUs, enabling wider ecosystem of third-party libraries and tools.

JAX depends on XLA for its Machine Learning (ML) hardware acceleration needs. You will collaborate with the JAX third-party (3P) team whose focus is on the top of the stack (user-facing features in JAX itself) to ensure that user-facing features work end-to-end. You will work with TPU SWEs in the ML Compiler/Runtime organisation to change lower in the compiler stack.

Google Cloud accelerates every organization’s ability to digitally transform its business and industry. We deliver enterprise-grade solutions that leverage Google’s cutting-edge technology, and tools that help developers build more sustainably. Customers in more than 200 countries and territories turn to Google Cloud as their trusted partner to enable growth and solve their most critical business problems.

The US base salary range for this full-time position is $136,000-$200,000 + bonus + equity + benefits. Our salary ranges are determined by role, level, and location. The range displayed on each job posting reflects the minimum and maximum target salaries for the position across all US locations. Within the range, individual pay is determined by work location and additional factors, including job-related skills, experience, and relevant education or training. Your recruiter can share more about the specific salary range for your preferred location during the hiring process.

Please note that the compensation details listed in US role postings reflect the base salary only, and do not include bonus, equity, or benefits. Learn more about .

Responsibilities

  • Build tools to help wrangle increasingly larger High-level Operations (HLO) graphs.
  • Address rough edges and make the compiler work to delight users.
  • Make necessary changes to XLA (e.g., new APIs, hooks, transforms etc.) to unlock a rich ecosystem of third-party libraries.
  • Troubleshoot numerics accuracy and stability on Tensor Processing Units (TPUs).
View Full Job Description
$136.0K - $200.0K/yr (Outscal est.)
$168.0K/yr avg.
Worldwide

Add your resume

80%

Upload your resume, increase your shortlisting chances by 80%

About The Company

A problem isn't truly solved until it's solved for all. Googlers build products that help create opportunities for everyone, whether down the street or across the globe. Bring your insight, imagination and a healthy disregard for the impossible. Bring everything that makes you unique. Together, we can build for everyone.

San Francisco, California, United States (On-Site)

Mountain View, California, United States (On-Site)

Warsaw, Masovian Voivodeship, Poland (On-Site)

San Bruno, California, United States (On-Site)

Mexico City, Mexico City, Mexico (On-Site)

Dublin, County Dublin, Ireland (On-Site)

Hyderabad, Telangana, India (On-Site)

Sunnyvale, California, United States (On-Site)

View All Jobs

Get notified when new jobs are added by Google

Similar Jobs

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

SKYDANCE - Lead Software Engineer, RenderMan

SKYDANCE, United States (Hybrid)

Razer - Software QA Engineer II

Razer, Singapore (On-Site)

ARVORE Immersive Experiences - GAMEPLAY DEVELOPER

ARVORE Immersive Experiences, Brazil (Remote)

Intel Corporation - Quality and Reliability Engineer

Intel Corporation, Costa Rica (On-Site)

Google - Software Engineer III, AI Infrastructure

Google, United States (On-Site)

Playrix - Technical Director (Game Project)

Playrix, Montenegro (Remote)

Get notifed when new similar jobs are uploaded

Full Stack Development Jobs

Get notifed when new similar jobs are uploaded