Software Engineer III, XBorg, Google Cloud

2 Days ago • 2 Years + • Artificial Intelligence

Job Summary

Job Description

As a Software Engineer III at Google, you'll contribute to XBorg, a novel orchestration layer for scheduling throughput-oriented workloads, focusing on ML training and inference. Responsibilities include writing and reviewing code, participating in design reviews, contributing to documentation, and troubleshooting system issues. You'll work with technologies like weighted fair queuing and contribute to increased resource efficiency for ML workloads across major Alphabet products. The role requires experience in software development, data structures, and algorithms. The ideal candidate possesses a Master's or PhD in Computer Science and experience with accessible technologies and ML infrastructure.
Must have:
  • Software development experience
  • Data structures and algorithms expertise
  • Code review and debugging skills
  • Contribution to documentation
Good to have:
  • Master's/PhD in Computer Science
  • Experience with accessible technologies
  • Machine Learning infrastructure experience

Job Details

Minimum qualifications:

  • Bachelor’s degree or equivalent practical experience.
  • 2 years of experience with software development in one or more programming languages, or 1 year of experience with an advanced degree.
  • 2 years of experience with data structures or algorithms.

Preferred qualifications:

  • Master's degree or PhD in Computer Science or related technical fields.
  • Experience developing accessible technologies.
  • Experience with Machine Learning infrastructure.

About the job

Google's software engineers develop the next-generation technologies that change how billions of users connect, explore, and interact with information and one another. Our products need to handle information at massive scale, and extend well beyond web search. We're looking for engineers who bring fresh ideas from all areas, including information retrieval, distributed computing, large-scale system design, networking and data storage, security, artificial intelligence, natural language processing, UI design and mobile; the list goes on and is growing every day. As a software engineer, you will work on a specific project critical to Google’s needs with opportunities to switch teams and projects as you and our fast-paced business grow and evolve. We need our engineers to be versatile, display leadership qualities and be enthusiastic to take on new problems across the full-stack as we continue to push technology forward.

XBorg is part of Borg Control Plane and is a novel orchestration layer responsible for scheduling of throughput oriented workloads onto clusters across the fleet, with a central focus on Machine Learning (ML) training and inference workloads. Xborg's features and introduction of new concepts, such as weighted fair queuing and seamless opportunistic access to unused resource, spatial and platform flexibility, etc. have led to increased resource occupancy and efficiency for ML workloads across major Alphabet products.

The ML, Systems, & Cloud AI (MSCA) organization at Google designs, implements, and manages the hardware, software, machine learning, and systems infrastructure for all Google services (Search, YouTube, etc.) and Google Cloud. Our end users are Googlers, Cloud customers and the billions of people who use Google services around the world.

We prioritize security, efficiency, and reliability across everything we do - from developing our latest TPUs to running a global network, while driving towards shaping the future of hyperscale computing. Our global impact spans software and hardware, including Google Cloud’s Vertex AI, the leading AI platform for bringing Gemini models to enterprise customers.

Responsibilities

  • Write product or system development code.
  • Participate in, or lead design reviews with peers and stakeholders to decide amongst available technologies.
  • Review code developed by other developers and provide feedback to ensure best practices (e.g., style guidelines, checking code in, accuracy, testability, and efficiency).
  • Contribute to existing documentation or educational content and adapt content based on product/program updates and user feedback.
  • Triage product or system issues and debug/track/resolve by analyzing the sources of issues and the impact on hardware, network, or service operations and quality.

Similar Jobs

Meta - Software Engineer (Technical Leadership)

Meta

New York, New York, United States (On-Site)
5 Months ago
Virtuos - Junior Software Engineer

Virtuos

Malaysia (On-Site)
2 Weeks ago
Microsoft - ROP - Cloud Network Engineer

Microsoft

Hyderabad, Telangana, India (On-Site)
1 Week ago
Google - Cloud Engineer II, AI/ML, Professional Services

Google

Mexico City, Mexico City, Mexico (On-Site)
1 Week ago
Hitachi - Artificial Intelligence - JBU

Hitachi

Chennai, Tamil Nadu, India (On-Site)
6 Months ago
Google - Creative Technologist, Demos and Experiments

Google

Sunnyvale, California, United States (On-Site)
2 Days ago
Microsoft - Senior Applied Researcher

Microsoft

Redmond, Washington, United States (On-Site)
1 Week ago
CharacterAI - Research Engineer, ML Systems

CharacterAI

New York, New York, United States (On-Site)
1 Month ago
Google - Software Engineer III, AI/ML GenAI, Google Cloud Data Management

Google

Sunnyvale, California, United States (On-Site)
2 Weeks ago
Meta - Software Engineer, Systems ML - SW/HW Co-design

Meta

Seattle, Washington, United States (Remote)
5 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

CharacterAI - Research Engineer, ML Systems

CharacterAI

New York, New York, United States (On-Site)
1 Month ago
Microsoft - Research Intern - Applied Sciences Group (Computer Vision)

Microsoft

Redmond, Washington, United States (On-Site)
1 Week ago
Google - Staff Software Engineer, Infrastructure, Google Cloud

Google

San Francisco, California, United States (On-Site)
2 Days ago
ByteDance - Senior Research Engineer / Scientist - Storage for LLM

ByteDance

Seattle, Washington, United States (On-Site)
3 Days ago
The Walt Disney Company - Lead Machine Learning Engineer

The Walt Disney Company

San Francisco, California, United States (On-Site)
2 Weeks ago
Oportun - Senior ML Engineer

Oportun

(Remote)
1 Day ago
Google - Senior Software Engineer, Distributed Machine Learning

Google

Mountain View, California, United States (On-Site)
2 Days ago
Google - Senior Software Engineer, ML Compilers, Google Silicon

Google

New Taipei, New Taipei City, Taiwan (On-Site)
2 Weeks ago
Appier - Software Engineer, Data Backend(Data Platform)

Appier

Taipei City, Taiwan (On-Site)
5 Months ago
Activision - Software Development Intern

Activision

Shanghai, Shanghai, China (On-Site)
1 Month ago

Get notifed when new similar jobs are uploaded

Jobs in Warsaw, Masovian Voivodeship, Poland

Google - Senior UX Designer, Chrome

Google

Warsaw, Masovian Voivodeship, Poland (On-Site)
2 Weeks ago
Google - Software Engineer, Vertex GenAI, Infrastructure and Features

Google

Warsaw, Masovian Voivodeship, Poland (On-Site)
2 Weeks ago
Techland - Creative Director

Techland

Warsaw, Masovian Voivodeship, Poland (On-Site)
1 Month ago
PwC - Starszy Konsultant / Starsza Konsultantka | Audyt

PwC

Poznań, Greater Poland Voivodeship, Poland (On-Site)
6 Months ago
Google - Product Manager II, Health Data Governance

Google

Warsaw, Masovian Voivodeship, Poland (On-Site)
2 Days ago
Google - Software Engineer III, Site Reliability Engineering

Google

Warsaw, Masovian Voivodeship, Poland (On-Site)
1 Week ago
Veeam Software - Talent Acquisition Partner

Veeam Software

Warsaw, Masovian Voivodeship, Poland (On-Site)
7 Hours ago
ARHS - System Analyst

ARHS

Warsaw, Masovian Voivodeship, Poland (Remote)
6 Months ago
Google - Software Engineer III, RCS Business

Google

Kraków, Lesser Poland Voivodeship, Poland (On-Site)
2 Days ago
ARHS - IAM Service Coordinator

ARHS

Warsaw, Masovian Voivodeship, Poland (On-Site)
6 Months ago

Get notifed when new similar jobs are uploaded

Artificial Intelligence Jobs

NVIDIA - Machine Learning Intern - 2025

NVIDIA

(On-Site)
3 Months ago
Microsoft - Member of Technical Staff, AI Multimodal

Microsoft

London, England, United Kingdom (On-Site)
1 Month ago
Google - Hardware Engineer, PhD, Cloud Platforms

Google

Taipei City, Taiwan (On-Site)
2 Days ago
Google - Customer Engineer, AI Infrastructure, Google Cloud

Google

Singapore (On-Site)
2 Weeks ago
Google - Software Engineer, PhD

Google

Sunnyvale, California, United States (On-Site)
2 Weeks ago
ByteDance - Senior XR Strategy Expert

ByteDance

San Jose, California, United States (On-Site)
2 Weeks ago
Google - Hardware Engineer, PhD, Cloud Platforms

Google

Taipei City, Taiwan (On-Site)
1 Week ago
Lionbridge Games - AI Program Manager

Lionbridge Games

(Remote)
2 Days ago
ByteDance - Student Researcher (Doubao (Seed) - Foundation Model - Vision Generative AI)

ByteDance

San Jose, California, United States (On-Site)
1 Month ago
NVIDIA - Senior Solutions Architect, Retail

NVIDIA

Arkansas, United States (Remote)
2 Weeks ago

Get notifed when new similar jobs are uploaded

About The Company

A problem isn't truly solved until it's solved for all. Googlers build products that help create opportunities for everyone, whether down the street or across the globe. Bring your insight, imagination and a healthy disregard for the impossible. Bring everything that makes you unique. Together, we can build for everyone.

Mountain View, California, United States (On-Site)

Mountain View, California, United States (On-Site)

Bengaluru, Karnataka, India (On-Site)

Bengaluru, Karnataka, India (On-Site)

Bengaluru, Karnataka, India (On-Site)

Bengaluru, Karnataka, India (On-Site)

Bengaluru, Karnataka, India (On-Site)

View All Jobs

Get notified when new jobs are added by Google

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug