Software Engineering Manager, Google Kubernetes AI Infrastructure

1 Hour ago • 8-15 Years • Artificial Intelligence • $197,000 PA - $291,000 PA

Job Summary

Job Description

As a Software Engineering Manager on Google Kubernetes Engine (GKE) Artificial Intelligence (AI) Infrastructure, you will lead the strategic expansion of GKE AI infrastructure. Responsibilities include building a reliable, secure, and scalable GKE service, collaborating with multiple feature teams, engaging with customers to drive GKE GPU/TPU roadmap adoption, and managing and coaching a team of engineers. You will participate in Kubernetes community discussions and partner with GCP teams to ship new GPU/TPU management features. The role requires extensive experience in software development, machine learning infrastructure, and technical leadership.
Must have:
  • 8+ years software development experience
  • 4+ years ML infrastructure experience
  • 3+ years technical leadership
  • 2+ years people management
  • Kubernetes expertise
  • Strong communication skills
Good to have:
  • Master's/PhD in CS
  • Experience with distributed systems
  • Open-source contributions
Perks:
  • Bonus
  • Equity
  • Benefits

Job Details

Minimum qualifications:

  • Bachelor's degree or equivalent practical experience.
  • 8 years of experience with software development in one or more programming languages (e.g., Python, C, C++, Java, JavaScript).
  • 4 years of experience with Machine Learning Infrastructure (e.g., GPU, cloud TPU), or deep learning frameworks (e.g., TensorFlow, PyTorch, JAX), algorithms and tools (e.g., Kubeflow, Ray.io, MLflow).
  • 3 years of experience in a technical leadership role; overseeing projects, with 2 years of experience in a people management, supervision/team leadership role.

Preferred qualifications:

  • Master's degree or PhD in Computer Science or related technical field.
  • Experience leading, designing, and implementing distributed systems.
  • Experience innovating technology at scale, while motivating others to act by creating a shared sense of purpose.
  • Experience building upon, contributing to open-source software projects.

About the job

Like Google's own ambitions, the work of a Software Engineer goes beyond just Search. Software Engineering Managers have not only the technical expertise to take on and provide technical leadership to major projects, but also manage a team of Engineers. You not only optimize your own code but make sure Engineers are able to optimize theirs. As a Software Engineering Manager you manage your project goals, contribute to product strategy and help develop your team. Teams work all across the company, in areas such as information retrieval, artificial intelligence, natural language processing, distributed computing, large-scale system design, networking, security, data compression, user interface design; the list goes on and is growing every day. Operating with scale and speed, our exceptional software engineers are just getting started -- and as a manager, you guide the way.

With technical and leadership expertise, you manage engineers across multiple teams and locations, a large product budget and oversee the deployment of large-scale projects across multiple sites internationally.

As an Engineering Manager on Google Kubernetes Engine (GKE) Artificial Intellignece (AI) Infrastructure, you will facilitate the strategic expansion of GKE AI Infrastructure, positioning GKE as the premier IaaS platform for deploying large-scale GenAI workloads. You will present a distinct opportunity to direct the strategy and execution of developing cloud infrastructure to support leading-edge GenAI applications, and to engage in close collaboration with our partners in serving several of Google Cloud Platforms (GCP’s) largest GenAI customers.

The US base salary range for this full-time position is $197,000-$291,000 + bonus + equity + benefits. Our salary ranges are determined by role, level, and location. Within the range, individual pay is determined by work location and additional factors, including job-related skills, experience, and relevant education or training. Your recruiter can share more about the specific salary range for your preferred location during the hiring process.

Please note that the compensation details listed in US role postings reflect the base salary only, and do not include bonus, equity, or benefits. Learn more about .

Responsibilities

  • Work with multiple Google Kubernetes Engine (GKE) feature teams to build a reliable, secure, and scalable GKE service.
  • Engage with customers to drive GKE Graphics Processing Unit/Tensor Processing Unit roadmap and to increase GKE/Google Cloud Platform adoption on Artificial Intelligence/Machine Learning workloads.
  • Manage and coach a team of engineers to grow in their careers and overcome tests.
  • Participate in Kubernetes community discussions (via SIG meetings and conferences) to guide other products and present our design and work and collaborate with other companies that are interested in this area.
  • Partner with GCP teams to ship new Graphics Processing Unit/Tensor Processing Unit management features, to advance GKE’s capabilities of running massive-scale GenAI workloads.

Similar Jobs

ByteDance - Software Engineer, SRE - Platform Services

ByteDance

Seattle, Washington, United States (On-Site)
2 Months ago
ByteDance - Android Software Engineer - Global Payment

ByteDance

Singapore (On-Site)
3 Weeks ago
Netflix - Solutions Support Engineer (L5) - Observability

Netflix

United States (Remote)
2 Months ago
ByteDance - Cloud Security Architect

ByteDance

Singapore (On-Site)
2 Days ago
Google - Silicon Architecture/Design Engineer

Google

Bengaluru, Karnataka, India (On-Site)
1 Hour ago
Google - Software Engineer, PhD, Early Career, Campus, Machine Learning, Systems and Cloud AI, 2025 start

Google

Sunnyvale, California, United States (On-Site)
3 Months ago
DraftKings - Senior Manager, Technical Learning & Development

DraftKings

New York, New York, United States (On-Site)
4 Days ago
ByteDance - Research Scientist - Multimodal Foundation Model - 2025 Start

ByteDance

Singapore (On-Site)
5 Months ago
HP - Machine Learning Intern

HP

Austin, Texas, United States (On-Site)
6 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Push Gaming - Game Mathematician

Push Gaming

Poland (Hybrid)
4 Weeks ago
LeoVegas - Backend Engineer - Gaming

LeoVegas

Stockholm, Stockholm County, Sweden (Hybrid)
3 Months ago
ION - Senior IT Architect, Italy

ION

Italy (Hybrid)
6 Months ago
The Walt Disney Company - Sr. System Reliability Engineer

The Walt Disney Company

Burbank, California, United States (On-Site)
3 Weeks ago
PwC - Senior Associate_Java Full Stack Developer_Data & Analytics_Advisory_PAN India

PwC

Bengaluru, Karnataka, India (On-Site)
6 Months ago
Zazz - Java Developer

Zazz

(Remote)
2 Months ago
ByteDance - Backend Software Engineer

ByteDance

San Jose, California, United States (On-Site)
2 Days ago
ByteDance - Video Experience Software Engineer Intern (Global Streaming Media)

ByteDance

San Jose, California, United States (On-Site)
2 Days ago
Next Level Business Services - ServiceNow Architect

Next Level Business Services

Cupertino, California, United States (On-Site)
6 Months ago
Microsoft - Member of Technical Staff, AI - Pre-Training

Microsoft

Redmond, Washington, United States (On-Site)
3 Weeks ago

Get notifed when new similar jobs are uploaded

Jobs in Kirkland, Washington, United States

AGS - American Gaming Systems - Social Media Specialist

AGS - American Gaming Systems

Nevada, United States (On-Site)
3 Weeks ago
Next Level Business Services - Sr. SAP WM/Shipping Consultant

Next Level Business Services

Chicago, Illinois, United States (On-Site)
6 Months ago
ByteDance - Senior Software Engineer - Virtualization

ByteDance

San Jose, California, United States (On-Site)
3 Weeks ago
Twitch - Senior Product Manager - Commerce

Twitch

Irvine, California, United States (On-Site)
4 Days ago
The Walt Disney Company - Manager, Product Design (Disney +)

The Walt Disney Company

Santa Monica, California, United States (On-Site)
5 Months ago
Google - Senior CPU Design Verification Engineer

Google

Austin, Texas, United States (On-Site)
1 Hour ago
The Walt Disney Company - Principal Product Designer

The Walt Disney Company

Glendale, California, United States (On-Site)
1 Month ago
Evolution - Online Casino Dealer - Live In-Studio - Philadelphia

Evolution

Philadelphia, Pennsylvania, United States (On-Site)
10 Months ago
NVIDIA - Senior ATE Hardware Engineer

NVIDIA

Santa Clara, California, United States (Hybrid)
1 Month ago
Zoox - Systems Engineer - Vehicle Controls Functional Safety

Zoox

Foster City, California, United States (Hybrid)
6 Months ago

Get notifed when new similar jobs are uploaded

Artificial Intelligence Jobs

Microsoft - Senior Researcher – Systems and Networking

Microsoft

Vancouver, British Columbia, Canada (On-Site)
9 Hours ago
NVIDIA - Technical Marketing Engineer - AI Platform Software

NVIDIA

Canada (Hybrid)
1 Month ago
AI Fund - Founder in Residence/CEO (AI for Construction)

AI Fund

United States (Remote)
2 Weeks ago
Microsoft - Research Intern - AI HW/SW Co-design

Microsoft

Redmond, Washington, United States (On-Site)
9 Hours ago
ByteDance - Research Scientist Graduate (Foundation Model, Video Generation) - 2025 Start (PhD)

ByteDance

Seattle, Washington, United States (On-Site)
5 Months ago
Salesforce - 2025 PhD Intern - AI Research, Singapore

Salesforce

Singapore, Singapore (On-Site)
6 Months ago
Keywords Studios - Research Associate - Fresher

Keywords Studios

Karnataka, India (On-Site)
1 Week ago
Meta - AI Research Scientist, Language - Generative AI

Meta

Burlingame, California, United States (On-Site)
5 Months ago
NVIDIA - Deep Learning Software Engineer, Performance Optimization

NVIDIA

Tokyo, Japan (On-Site)
2 Months ago
Spell Brush - LLM Engineer

Spell Brush

San Francisco, California, United States (On-Site)
3 Weeks ago

Get notifed when new similar jobs are uploaded

About The Company

A problem isn't truly solved until it's solved for all. Googlers build products that help create opportunities for everyone, whether down the street or across the globe. Bring your insight, imagination and a healthy disregard for the impossible. Bring everything that makes you unique. Together, we can build for everyone.

Portland, Oregon, United States (On-Site)

Mountain View, California, United States (On-Site)

Mountain View, California, United States (On-Site)

Taipei City, Taiwan (On-Site)

Atlanta, Georgia, United States (On-Site)

Bengaluru, Karnataka, India (On-Site)

View All Jobs

Get notified when new jobs are added by Google

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug