Principal Software Engineer, Google Compute Engine Control Plane

1 Month ago • 15 Years + • DevOps

About the job

Job Description

Google Compute Engine (GCE) is at the heart of the Google Cloud Platform (GCP). As Principal Software Engineer, you'll lead the Compute organization in designing and developing cutting-edge projects. Responsibilities include developing new AI/ML offerings, designing capacity-aware scheduling, driving architectural decisions for reliability and scalability, ensuring API modularity and compatibility, and maximizing code reuse. The role requires leadership in global projects, setting technical direction, and delivering customer-focused products. This position involves working on the core infrastructure of GCP, impacting nearly every service offered.
Must have:
  • 15+ years experience with large scale distributed systems
  • Technical leadership & global project experience
  • Experience with cloud solutions architecture, development, maintenance
  • Customer-focused iterative product delivery
  • Networking and compute infrastructure expertise
Good to have:
  • Hyperscale cloud technology experience
  • Deep understanding of AI/ML infrastructure (GPUs, TPUs, LLMs)
  • AI/ML use case experience (training, inference, tuning)

Minimum qualifications:

  • 15 years of experience with large scale distributed systems and architectures.
  • Experience in technical leadership, leading global projects and setting technical direction for teams.
  • Experience with customer focused, iterative product and feature delivery.
  • Experience in networking, compute infrastructure, and architecting, developing, or maintaining cloud solutions.

Preferred qualifications:

  • Experience working on or with hyperscale cloud technologies.
  • Deep understanding of AI/ML-related infrastructure technologies (e.g., GPUs, TPUs, LLMs, foundational models) and use cases (e.g., training, inference, tuning etc.).

About the job

Google Compute Engine (GCE) is at the heart of the Google Cloud Platform (GCP). It underlies and powers almost every service (e.g., VMs, databases, data analytics, Kubernetes, AI/ML, batch, cloud functions, monitoring, alerting, etc.) that GCP offers.

As the Principal Software Engineer, you will lead the Compute organization in the ideation, design, and development of numerous simultaneously executed cutting-edge projects and initiatives.

Google Cloud accelerates every organization’s ability to digitally transform its business and industry. We deliver enterprise-grade solutions that leverage Google’s cutting-edge technology, and tools that help developers build more sustainably. Customers in more than 200 countries and territories turn to Google Cloud as their trusted partner to enable growth and solve their most critical business problems.

Responsibilities

  • Develop new easy-to-use AI/ML related offerings leveraging Google’s software stack.
  • Design capacity-aware scheduling capabilities to automatically move workloads between zones and regions.
  • Drive key architectural decisions to ensure reliability, security, performance, and scalability.
  • Drive key implementation decisions to maximize code reuse, leveraging existing frameworks and minimizing accumulation of technical debt.
  • Ensure that APIs and semantics are modular, future proof, and compatible with other parts of GCE and GCP to ensure a consistent user experience.
View Full Job Description

Add your resume

80%

Upload your resume, increase your shortlisting chances by 80%

About The Company

A problem isn't truly solved until it's solved for all. Googlers build products that help create opportunities for everyone, whether down the street or across the globe. Bring your insight, imagination and a healthy disregard for the impossible. Bring everything that makes you unique. Together, we can build for everyone.

San Francisco, California, United States (On-Site)

Mountain View, California, United States (On-Site)

Warsaw, Masovian Voivodeship, Poland (On-Site)

San Bruno, California, United States (On-Site)

Mexico City, Mexico City, Mexico (On-Site)

Dublin, County Dublin, Ireland (On-Site)

Hyderabad, Telangana, India (On-Site)

View All Jobs

Get notified when new jobs are added by Google

Similar Jobs

Luxoft - Lead DevOps Engineer

Luxoft, (Remote)

Tanla Platforms  - Senior Site Reliability Engineer

Tanla Platforms , India (On-Site)

SSC Technologies - Sr. Platform Engineer

SSC Technologies, United States (Remote)

Topsoe - Senior Software Engineer II

Topsoe, India (On-Site)

Luxoft - Avaloq Release Manager

Luxoft, Switzerland (On-Site)

Starkflow - Devops Engineer

Starkflow, India (On-Site)

Aera Technology - Senior Release Engineer

Aera Technology, India (Hybrid)

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Skillz - Lead Web Full Stack Engineer (Las Vegas)

Skillz, United States (On-Site)

Saviynt - Senior Engineer

Saviynt, India (Hybrid)

Sonar Source - Solutions Engineer - ANZ

Sonar Source, Australia (Remote)

Alphasense - Join AlphaSense India Talent Community

Alphasense, India (On-Site)

Intrepid Studios,  Inc  - DevOps Engineer (Kubernetes & Cloud Services)

Intrepid Studios, Inc , Canada (On-Site)

BitGo - Staff Backend Engineer

BitGo, India (Hybrid)

ION - Senior Security Architect

ION, Italy (On-Site)

The Walt Disney Company - Software Engineer II, Core Media Manufacturing

The Walt Disney Company, United States (On-Site)

Get notifed when new similar jobs are uploaded

DevOps Jobs

The Walt Disney Company - Senior Real Time Pipeline Engineer (PH)

The Walt Disney Company, United States (On-Site)

Skan AI - Release Manager

Skan AI, India (Hybrid)

Techland - DevOps Engineer

Techland, Poland (On-Site)

Visa - Lead Cloud Network Engineer - GNE

Visa, United States (On-Site)

Axinous - Staff Software Engineer - Risk360

Axinous, United States (Hybrid)

Razer - Lead Site Reliability Engineer

Razer, China (On-Site)

Anthology  Inc  - Principal Software Developer

Anthology Inc , India (On-Site)

Get notifed when new similar jobs are uploaded