Software Engineering Manager, Google Kubernetes AI Infrastructure

1 Month ago • 8-15 Years • Artificial Intelligence • $197,000 PA - $291,000 PA

Job Summary

Job Description

As a Software Engineering Manager on Google Kubernetes Engine (GKE) Artificial Intelligence (AI) Infrastructure, you will lead the strategic expansion of GKE AI infrastructure. Responsibilities include building a reliable, secure, and scalable GKE service, collaborating with multiple feature teams, engaging with customers to drive GKE GPU/TPU roadmap adoption, and managing and coaching a team of engineers. You will participate in Kubernetes community discussions and partner with GCP teams to ship new GPU/TPU management features. The role requires extensive experience in software development, machine learning infrastructure, and technical leadership.
Must have:
  • 8+ years software development experience
  • 4+ years ML infrastructure experience
  • 3+ years technical leadership
  • 2+ years people management
  • Kubernetes expertise
  • Strong communication skills
Good to have:
  • Master's/PhD in CS
  • Experience with distributed systems
  • Open-source contributions
Perks:
  • Bonus
  • Equity
  • Benefits

Job Details

Minimum qualifications:

  • Bachelor's degree or equivalent practical experience.
  • 8 years of experience with software development in one or more programming languages (e.g., Python, C, C++, Java, JavaScript).
  • 4 years of experience with Machine Learning Infrastructure (e.g., GPU, cloud TPU), or deep learning frameworks (e.g., TensorFlow, PyTorch, JAX), algorithms and tools (e.g., Kubeflow, Ray.io, MLflow).
  • 3 years of experience in a technical leadership role; overseeing projects, with 2 years of experience in a people management, supervision/team leadership role.

Preferred qualifications:

  • Master's degree or PhD in Computer Science or related technical field.
  • Experience leading, designing, and implementing distributed systems.
  • Experience innovating technology at scale, while motivating others to act by creating a shared sense of purpose.
  • Experience building upon, contributing to open-source software projects.

About the job

Like Google's own ambitions, the work of a Software Engineer goes beyond just Search. Software Engineering Managers have not only the technical expertise to take on and provide technical leadership to major projects, but also manage a team of Engineers. You not only optimize your own code but make sure Engineers are able to optimize theirs. As a Software Engineering Manager you manage your project goals, contribute to product strategy and help develop your team. Teams work all across the company, in areas such as information retrieval, artificial intelligence, natural language processing, distributed computing, large-scale system design, networking, security, data compression, user interface design; the list goes on and is growing every day. Operating with scale and speed, our exceptional software engineers are just getting started -- and as a manager, you guide the way.

With technical and leadership expertise, you manage engineers across multiple teams and locations, a large product budget and oversee the deployment of large-scale projects across multiple sites internationally.

As an Engineering Manager on Google Kubernetes Engine (GKE) Artificial Intellignece (AI) Infrastructure, you will facilitate the strategic expansion of GKE AI Infrastructure, positioning GKE as the premier IaaS platform for deploying large-scale GenAI workloads. You will present a distinct opportunity to direct the strategy and execution of developing cloud infrastructure to support leading-edge GenAI applications, and to engage in close collaboration with our partners in serving several of Google Cloud Platforms (GCP’s) largest GenAI customers.

The US base salary range for this full-time position is $197,000-$291,000 + bonus + equity + benefits. Our salary ranges are determined by role, level, and location. Within the range, individual pay is determined by work location and additional factors, including job-related skills, experience, and relevant education or training. Your recruiter can share more about the specific salary range for your preferred location during the hiring process.

Please note that the compensation details listed in US role postings reflect the base salary only, and do not include bonus, equity, or benefits. Learn more about .

Responsibilities

  • Work with multiple Google Kubernetes Engine (GKE) feature teams to build a reliable, secure, and scalable GKE service.
  • Engage with customers to drive GKE Graphics Processing Unit/Tensor Processing Unit roadmap and to increase GKE/Google Cloud Platform adoption on Artificial Intelligence/Machine Learning workloads.
  • Manage and coach a team of engineers to grow in their careers and overcome tests.
  • Participate in Kubernetes community discussions (via SIG meetings and conferences) to guide other products and present our design and work and collaborate with other companies that are interested in this area.
  • Partner with GCP teams to ship new Graphics Processing Unit/Tensor Processing Unit management features, to advance GKE’s capabilities of running massive-scale GenAI workloads.

Similar Jobs

Spyke Games - Backend Game Developer

Spyke Games

İstanbul, Türkiye (On-Site)
10 Months ago
Google - Senior Application Engineer, AI, Marketing Tech and Engineering

Google

Atlanta, Georgia, United States (On-Site)
1 Month ago
ByteDance - Software Engineer, Multi Cloud CDN

ByteDance

San Jose, California, United States (On-Site)
1 Month ago
Alphasense - Join AlphaSense India Talent Community

Alphasense

Pune, Maharashtra, India (On-Site)
1 Month ago
Google - Software Engineering Manager II, Front End, Google Cloud

Google

San Francisco, California, United States (On-Site)
1 Month ago
Scale AI - Software Engineer, GenAI Model Evaluation

Scale AI

San Francisco, California, United States (Hybrid)
7 Months ago
ByteDance - Cloud Native Engineer, ARK Large Model Platform (Singapore)

ByteDance

Singapore (On-Site)
7 Months ago
ByteDance - Student Researcher (Doubao (Seed) - Foundation Model - Generative AI)

ByteDance

Seattle, Washington, United States (On-Site)
3 Months ago
Google - Senior Software Engineer, AI/ML, Google Cloud

Google

Hyderabad, Telangana, India (On-Site)
1 Month ago
Google - Senior Staff Software Engineer, AI/ML GenAI, Google Ads

Google

New York, New York, United States (On-Site)
1 Month ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Nagarro - Senior Staff Engineer, Java Fullstack

Nagarro

Riyadh, Riyadh Province, Saudi Arabia (On-Site)
7 Months ago
Quizizz - Software Engineer - Infrastructure

Quizizz

Bengaluru, Karnataka, India (On-Site)
2 Months ago
Reddit - Senior Staff Backend Engineer

Reddit

(Remote)
1 Month ago
Scopely - Engineering Manager - LiveOps Automation

Scopely

(Remote)
1 Month ago
Next Level Business Services - JavaScript Developer with Full stack Experience

Next Level Business Services

Dallas, Texas, United States (On-Site)
7 Months ago
Microsoft - Principal Software Engineer - RDMA

Microsoft

Santa Clara, California, United States (On-Site)
1 Month ago
Tide - Senior Engineer, Python (Data & AI)

Tide

Sofia, Sofia City Province, Bulgaria (Hybrid)
1 Month ago
ByteDance - Backend Software Engineer - CapCut - Seattle (SEA)

ByteDance

Seattle, Washington, United States (On-Site)
6 Months ago
ByteDance - Full Stack Software Engineer - Data, Security

ByteDance

San Jose, California, United States (On-Site)
4 Months ago
The Walt Disney Company - Software Engineer II

The Walt Disney Company

Seattle, Washington, United States (On-Site)
1 Month ago

Get notifed when new similar jobs are uploaded

Jobs in Kirkland, Washington, United States

Scanline VFX - Senior Manager, VFX Production Finance

Scanline VFX

Los Angeles, California, United States (Hybrid)
2 Months ago
Volley - Principal Software Engineer

Volley

San Francisco, California, United States (On-Site)
5 Months ago
The Walt Disney Company - Sr Software Engineer (Rust Developer)

The Walt Disney Company

Glendale, California, United States (On-Site)
6 Months ago
Power Integrations - Field Sales Engineer

Power Integrations

San Jose, California, United States (On-Site)
7 Months ago
Framestore - FREELANCE: NUKE - CHICAGO

Framestore

Chicago, Illinois, United States (On-Site)
1 Year ago
People Can Fly - Live Operations Technician

People Can Fly

New York, United States (On-Site)
2 Months ago
ByteDance - Senior Software Engineer, Multi Cloud CDN - San Jose / Seattle / Boston

ByteDance

San Jose, California, United States (On-Site)
5 Months ago
Blinkhealth - Pharmacist

Blinkhealth

Chesterfield, Missouri, United States (On-Site)
1 Month ago
Nagarro - Senior Staff Engineer, Big Data

Nagarro

Atlanta, Georgia, United States (On-Site)
7 Months ago
Blinkhealth - Revenue and AR Billing Manager (Accounting Operations)

Blinkhealth

United States (On-Site)
1 Month ago

Get notifed when new similar jobs are uploaded

Artificial Intelligence Jobs

Meta - Software Engineer, Machine Learning

Meta

Pittsburgh, Pennsylvania, United States (On-Site)
6 Months ago
Google - Developer Relations Engineer, AI Developer, Cloud AI

Google

Kirkland, Washington, United States (On-Site)
1 Month ago
KPIT - CTO_ML/DL Data scientist

KPIT

Pune, Maharashtra, India (On-Site)
6 Months ago
Scopely - Senior AI Creative (Motion) - Monopoly Go

Scopely

Barcelona, Catalonia, Spain (Hybrid)
2 Months ago
ByteDance - Research Scientist- Foundation Model, Video Generation

ByteDance

San Jose, California, United States (On-Site)
7 Months ago
Rackspace Technology - Practice Manager, Data Science, AI and ML

Rackspace Technology

(Remote)
5 Months ago
Google - Software Engineer III, Education AI Platform

Google

Mexico City, Mexico City, Mexico (On-Site)
1 Month ago
ByteDance - Solutions Architect

ByteDance

Gurugram, Haryana, India (On-Site)
2 Months ago
NVIDIA - AI Computing Software Development Engineer, TensorRT

NVIDIA

Shanghai, Shanghai, China (On-Site)
4 Months ago
Google - Technical Lead Manager, AI Flow Platform, GDM Deployment

Google

Mountain View, California, United States (On-Site)
1 Month ago

Get notifed when new similar jobs are uploaded

About The Company

London, England, United Kingdom (On-Site)

Bengaluru, Karnataka, India (On-Site)

Mountain View, California, United States (On-Site)

Bengaluru, Karnataka, India (On-Site)

Taipei City, Taiwan (On-Site)

Zürich, Zurich, Switzerland (On-Site)

Kirkland, Washington, United States (On-Site)

New Taipei, New Taipei City, Taiwan (On-Site)

Seattle, Washington, United States (On-Site)

View All Jobs

Get notified when new jobs are added by Google

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug