Software Engineer III, XBorg, Google Cloud

1 Month ago • 2 Years + • Artificial Intelligence

Job Summary

Job Description

As a Software Engineer III at Google, you'll contribute to XBorg, a novel orchestration layer for scheduling throughput-oriented workloads, focusing on ML training and inference. Responsibilities include writing and reviewing code, participating in design reviews, contributing to documentation, and troubleshooting system issues. You'll work with technologies like weighted fair queuing and contribute to increased resource efficiency for ML workloads across major Alphabet products. The role requires experience in software development, data structures, and algorithms. The ideal candidate possesses a Master's or PhD in Computer Science and experience with accessible technologies and ML infrastructure.
Must have:
  • Software development experience
  • Data structures and algorithms expertise
  • Code review and debugging skills
  • Contribution to documentation
Good to have:
  • Master's/PhD in Computer Science
  • Experience with accessible technologies
  • Machine Learning infrastructure experience

Job Details

Minimum qualifications:

  • Bachelor’s degree or equivalent practical experience.
  • 2 years of experience with software development in one or more programming languages, or 1 year of experience with an advanced degree.
  • 2 years of experience with data structures or algorithms.

Preferred qualifications:

  • Master's degree or PhD in Computer Science or related technical fields.
  • Experience developing accessible technologies.
  • Experience with Machine Learning infrastructure.

About the job

Google's software engineers develop the next-generation technologies that change how billions of users connect, explore, and interact with information and one another. Our products need to handle information at massive scale, and extend well beyond web search. We're looking for engineers who bring fresh ideas from all areas, including information retrieval, distributed computing, large-scale system design, networking and data storage, security, artificial intelligence, natural language processing, UI design and mobile; the list goes on and is growing every day. As a software engineer, you will work on a specific project critical to Google’s needs with opportunities to switch teams and projects as you and our fast-paced business grow and evolve. We need our engineers to be versatile, display leadership qualities and be enthusiastic to take on new problems across the full-stack as we continue to push technology forward.

XBorg is part of Borg Control Plane and is a novel orchestration layer responsible for scheduling of throughput oriented workloads onto clusters across the fleet, with a central focus on Machine Learning (ML) training and inference workloads. Xborg's features and introduction of new concepts, such as weighted fair queuing and seamless opportunistic access to unused resource, spatial and platform flexibility, etc. have led to increased resource occupancy and efficiency for ML workloads across major Alphabet products.

The ML, Systems, & Cloud AI (MSCA) organization at Google designs, implements, and manages the hardware, software, machine learning, and systems infrastructure for all Google services (Search, YouTube, etc.) and Google Cloud. Our end users are Googlers, Cloud customers and the billions of people who use Google services around the world.

We prioritize security, efficiency, and reliability across everything we do - from developing our latest TPUs to running a global network, while driving towards shaping the future of hyperscale computing. Our global impact spans software and hardware, including Google Cloud’s Vertex AI, the leading AI platform for bringing Gemini models to enterprise customers.

Responsibilities

  • Write product or system development code.
  • Participate in, or lead design reviews with peers and stakeholders to decide amongst available technologies.
  • Review code developed by other developers and provide feedback to ensure best practices (e.g., style guidelines, checking code in, accuracy, testability, and efficiency).
  • Contribute to existing documentation or educational content and adapt content based on product/program updates and user feedback.
  • Triage product or system issues and debug/track/resolve by analyzing the sources of issues and the impact on hardware, network, or service operations and quality.

Similar Jobs

Fictiv - Document Controller

Fictiv

Bengaluru, Karnataka, India (On-Site)
1 Month ago
Tesla - Electrical Engineer - Motor Design and Powertrain Modeling

Tesla

Athens, Greece (On-Site)
3 Months ago
Inkittt - Senior Front-End Engineer

Inkittt

Krakow Am See, Mecklenburg-Vorpommern, Germany (Hybrid)
2 Months ago
Virtuos - Assistant Technical Director (Gameplay)

Virtuos

China (On-Site)
2 Months ago
Microsoft - Member of Technical Staff, AI Reinforcement Systems

Microsoft

Zürich, Zurich, Switzerland (On-Site)
1 Month ago
CharacterAI - Research Engineer, Post-Training

CharacterAI

New York, New York, United States (On-Site)
2 Months ago
NVIDIA - Principal Software Engineer - Enterprise AI Platform

NVIDIA

Santa Clara, California, United States (Hybrid)
4 Months ago
Google - Conversational AI Consultant

Google

Haryana, India (On-Site)
1 Month ago
Google - Software Engineer, PhD, Early Career, Campus, Machine Learning, Systems and Cloud AI, 2025 start

Google

Sunnyvale, California, United States (On-Site)
5 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Google - Senior Software Engineer, Engineering Productivity, Android

Google

New Taipei, New Taipei City, Taiwan (On-Site)
1 Month ago
AI Dash - Software Development Engineer II - Backend

AI Dash

Bengaluru, Karnataka, India (Hybrid)
3 Months ago
CGS Carrers - Software Development Engineer Senior

CGS Carrers

Bengaluru, Karnataka, India (On-Site)
3 Weeks ago
QuinStreet - Applied Machine Learning Engineer

QuinStreet

(Remote)
1 Month ago
Ethos Life - Senior Backend Engineer

Ethos Life

Bengaluru, Karnataka, India (On-Site)
1 Month ago
Axon - Senior Machine Learning Scientist II

Axon

Seattle, Washington, United States (Hybrid)
1 Month ago
CharacterAI - Research Engineer, ML Systems

CharacterAI

New York, New York, United States (On-Site)
2 Months ago
Google - Staff Software Engineer, Engineering Productivity, Google Cloud Platforms

Google

Sunnyvale, California, United States (On-Site)
1 Month ago
Snowed In Studios - Senior Generalist Programmer

Snowed In Studios

Quebec, Canada (Remote)
2 Months ago
Google - Software Engineer III, Ads

Google

Mountain View, California, United States (On-Site)
1 Month ago

Get notifed when new similar jobs are uploaded

Jobs in Warsaw, Masovian Voivodeship, Poland

Tesla - Delivery Advisor

Tesla

Ząbki, Masovian Voivodeship, Poland (On-Site)
3 Months ago
PwC - Starszy Konsultant  / Starsza Konsultantka | Ryzyko Kredytowe

PwC

Warsaw, Masovian Voivodeship, Poland (Hybrid)
6 Months ago
Techland - Senior Cinematic Artist

Techland

Wrocław, Lower Silesian Voivodeship, Poland (On-Site)
3 Months ago
PwC - Starszy Konsultant / Starsza Konsultantka | Audyt

PwC

Poznań, Greater Poland Voivodeship, Poland (Hybrid)
7 Months ago
Google - Software Engineer II, Google Messages

Google

Kraków, Lesser Poland Voivodeship, Poland (On-Site)
1 Month ago
11 bit studios - Creative Lead

11 bit studios

Warsaw, Masovian Voivodeship, Poland (Hybrid)
3 Weeks ago
Penumbra - Marketing Events & Congress Lead

Penumbra

Warsaw, Masovian Voivodeship, Poland (Remote)
1 Month ago
Netflix - Engineering Manager, Security Incident Response

Netflix

Warsaw, Masovian Voivodeship, Poland (On-Site)
4 Months ago
Tripledot Studios - Lead Motion Designer

Tripledot Studios

Warsaw, Masovian Voivodeship, Poland (On-Site)
4 Months ago

Get notifed when new similar jobs are uploaded

Artificial Intelligence Jobs

Google - Customer Engineer II, Cloud AI, Google Cloud

Google

San Francisco, California, United States (On-Site)
1 Month ago
Google - Staff Software Engineer, AI/ML

Google

Sunnyvale, California, United States (On-Site)
1 Month ago
NVIDIA - Senior Software Engineer, AI Resiliency

NVIDIA

Santa Clara, California, United States (On-Site)
2 Months ago
Meta - Software Engineer, Machine Learning

Meta

Pittsburgh, Pennsylvania, United States (On-Site)
6 Months ago
NVIDIA - Solutions Architect, AI and ML

NVIDIA

Redmond, Washington, United States (On-Site)
2 Months ago
Canva - Senior Computer Vision Engineer - Photo AI

Canva

Prague, Czechia (Remote)
4 Months ago
Google - EDA/CAD Custom Tool Development Engineer

Google

Bengaluru, Karnataka, India (On-Site)
1 Month ago
Krafton  - Deep Learning Engineer - LLM Game Agent

Krafton

Seoul, South Korea (On-Site)
3 Months ago
Ubisoft - Senior ML Programmer

Ubisoft

Montreal, Quebec, Canada (On-Site)
4 Months ago
Meta - Research Scientist Intern, Language and Multimodal Research for MetaAI (PhD)

Meta

New York, New York, United States (On-Site)
6 Months ago

Get notifed when new similar jobs are uploaded

About The Company

London, England, United Kingdom (On-Site)

Bengaluru, Karnataka, India (On-Site)

Mountain View, California, United States (On-Site)

Bengaluru, Karnataka, India (On-Site)

Taipei City, Taiwan (On-Site)

Zürich, Zurich, Switzerland (On-Site)

Kirkland, Washington, United States (On-Site)

New Taipei, New Taipei City, Taiwan (On-Site)

Seattle, Washington, United States (On-Site)

View All Jobs

Get notified when new jobs are added by Google

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug