Principal Software Engineer, Google Compute Engine Control Plane

2 Months ago • 15 Years + • DevOps • Undisclosed

About the job

Job Description

Google Compute Engine (GCE) is at the heart of the Google Cloud Platform (GCP). As Principal Software Engineer, you'll lead the Compute organization in designing and developing cutting-edge projects. Responsibilities include developing new AI/ML offerings, designing capacity-aware scheduling, driving architectural decisions for reliability and scalability, ensuring API modularity and compatibility, and maximizing code reuse. The role requires leadership in global projects, setting technical direction, and delivering customer-focused products. This position involves working on the core infrastructure of GCP, impacting nearly every service offered.
Must have:
  • 15+ years experience with large scale distributed systems
  • Technical leadership & global project experience
  • Experience with cloud solutions architecture, development, maintenance
  • Customer-focused iterative product delivery
  • Networking and compute infrastructure expertise
Good to have:
  • Hyperscale cloud technology experience
  • Deep understanding of AI/ML infrastructure (GPUs, TPUs, LLMs)
  • AI/ML use case experience (training, inference, tuning)

Minimum qualifications:

  • 15 years of experience with large scale distributed systems and architectures.
  • Experience in technical leadership, leading global projects and setting technical direction for teams.
  • Experience with customer focused, iterative product and feature delivery.
  • Experience in networking, compute infrastructure, and architecting, developing, or maintaining cloud solutions.

Preferred qualifications:

  • Experience working on or with hyperscale cloud technologies.
  • Deep understanding of AI/ML-related infrastructure technologies (e.g., GPUs, TPUs, LLMs, foundational models) and use cases (e.g., training, inference, tuning etc.).

About the job

Google Compute Engine (GCE) is at the heart of the Google Cloud Platform (GCP). It underlies and powers almost every service (e.g., VMs, databases, data analytics, Kubernetes, AI/ML, batch, cloud functions, monitoring, alerting, etc.) that GCP offers.

As the Principal Software Engineer, you will lead the Compute organization in the ideation, design, and development of numerous simultaneously executed cutting-edge projects and initiatives.

Google Cloud accelerates every organization’s ability to digitally transform its business and industry. We deliver enterprise-grade solutions that leverage Google’s cutting-edge technology, and tools that help developers build more sustainably. Customers in more than 200 countries and territories turn to Google Cloud as their trusted partner to enable growth and solve their most critical business problems.

Responsibilities

  • Develop new easy-to-use AI/ML related offerings leveraging Google’s software stack.
  • Design capacity-aware scheduling capabilities to automatically move workloads between zones and regions.
  • Drive key architectural decisions to ensure reliability, security, performance, and scalability.
  • Drive key implementation decisions to maximize code reuse, leveraging existing frameworks and minimizing accumulation of technical debt.
  • Ensure that APIs and semantics are modular, future proof, and compatible with other parts of GCE and GCP to ensure a consistent user experience.
View Full Job Description

Add your resume

80%

Upload your resume, increase your shortlisting chances by 80%

About The Company

A problem isn't truly solved until it's solved for all. Googlers build products that help create opportunities for everyone, whether down the street or across the globe. Bring your insight, imagination and a healthy disregard for the impossible. Bring everything that makes you unique. Together, we can build for everyone.

Hyderabad, Telangana, India (On-Site)

New Taipei, New Taipei City, Taiwan (On-Site)

New York, New York, United States (On-Site)

Dublin, County Dublin, Ireland (On-Site)

Kuala Lumpur, Federal Territory Of Kuala Lumpur, Malaysia (On-Site)

Mountain View, California, United States (On-Site)

View All Jobs

Get notified when new jobs are added by Google

Similar Jobs

Patterned Learning Career - Senior Software Engineer, Data

Patterned Learning Career, (Remote)

Postman - Engineering Manager, Flows

Postman, United States (On-Site)

Fairmatic - Senior Software Engineer - Backend

Fairmatic, India (Hybrid)

Wargaming - Infrastructure DevOps Engineer

Wargaming, Lithuania (Hybrid)

ByteDance - Site Reliability Engineer - Game

ByteDance, Singapore (On-Site)

Luxoft - Senior DevOps (Lambda, Kubernetes)

Luxoft, United States (Remote)

Sinch - Site Reliability Engineer III

Sinch, France (Remote)

Equivalent Jobs - MLOPS ENGINEER

Equivalent Jobs, (Remote)

Meta - Silicon CAD Infrastructure

Meta, United States (On-Site)

Microsoft - Senior System Electrical Engineer

Microsoft, Taiwan (On-Site)

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

USE Insider - DevOps Engineer

USE Insider, Türkiye (Remote)

Telesign - Site Reliability Engineer (SRE) III

Telesign, India (On-Site)

Microsoft - Technical Support Engineer - Kubernetes

Microsoft, Australia (Remote)

Anko GCC - DevOps Engineer

Anko GCC, India (Hybrid)

NCR Voyix - SW Quality Engineer III

NCR Voyix, India (Hybrid)

DraftKings - Senior Software Engineer

DraftKings, Bulgaria (On-Site)

Ubisoft Blue Byte - Site Reliability Engineer [Game Security]

Ubisoft Blue Byte, Germany (On-Site)

Get notifed when new similar jobs are uploaded

DevOps Jobs

Microsoft - Senior Research Data and Service Engineer

Microsoft, United States (On-Site)

Playtech - ProdOps Engineer

Playtech, Ukraine (On-Site)

Rockstar Games - Build & Release Engineer

Rockstar Games, United States (On-Site)

Razer - Software Engineer (DevOps)

Razer, Malaysia (On-Site)

Warner Bros Games - Senior Software Developer

Warner Bros Games, Canada (Hybrid)

Take-Two Interactive - Senior Systems Engineer

Take-Two Interactive, India (On-Site)

Luxoft - .NET and Azure API Developer

Luxoft, India (On-Site)

Paytm - DevOps- Principal Engineer

Paytm, India (On-Site)

Litera - Site Reliability Engineer

Litera, India (On-Site)

Get notifed when new similar jobs are uploaded