Software Engineering Manager II, Site Reliability Engineering

1 Month ago • 8-13 Years • Backend Development

Job Summary

Job Description

As a Software Engineering Manager II in Site Reliability Engineering (SRE) at Google, you will lead a team responsible for the global uptime and performance of key services. Responsibilities include designing, building, and maintaining large-scale distributed systems, automating responses to service conditions, mentoring team members, managing on-call rotations, and improving service availability, scalability, latency, and efficiency. You will leverage expertise in coding, algorithms, and large-scale system design to tackle complex challenges inherent in Google's scale. The role demands strong technical leadership, problem-solving skills, and effective communication.
Must have:
  • Bachelor's degree in CS or related field
  • 8+ years experience with data structures/algorithms
  • 5+ years software development experience
  • 3+ years people management experience
  • Experience with distributed systems
  • Lead engineering teams
  • Ensure service availability and performance
Good to have:
  • Experience in computing, distributed systems, storage, or networking
  • Expertise in large-scale distributed systems
  • Debugging, code optimization, and automation skills
  • Systematic problem-solving and communication skills

Job Details

Minimum qualifications:

  • Bachelor’s degree in Computer Science, a related field, or equivalent practical experience.
  • 8 years of experience with data structures or algorithms.
  • 5 years of experience with software development in one or more programming languages.
  • 3 years of people management experience, and experience designing, analyzing, and troubleshooting distributed systems.

Preferred qualifications:

  • Experience working in computing, distributed systems, storage, or networking.
  • Expertise in designing, analyzing, and troubleshooting large-scale distributed systems.
  • Ability to debug, optimize code, and to automate routine tasks.
  • Systematic problem-solving approach, coupled with effective communication skills.

About the job

Site Reliability Engineering (SRE) combines software and systems engineering to build and run large-scale, massively distributed, fault-tolerant systems. SRE ensures that Google's services—both our internally critical and our externally-visible systems—have reliability, uptime appropriate to users' needs and a fast rate of improvement. Additionally SRE’s will keep an ever-watchful eye on our systems capacity and performance.

Much of our software development focuses on optimizing existing systems, building infrastructure and eliminating work through automation. On the SRE team, you’ll have the opportunity to manage the complex challenges of scale which are unique to Google, while using your expertise in coding, algorithms, complexity analysis and large-scale system design.

SRE's culture of intellectual curiosity, problem solving and openness is key to its success. Our organization brings together people with a wide variety of backgrounds, experiences and perspectives. We encourage them to collaborate, think big and take risks in a blame-free environment. We promote self-direction to work on meaningful projects, while we also strive to create an environment that provides the support and mentorship needed to learn and grow.

To learn more: check out our books on or read a about why a Software Engineer chose to join SRE.

As an Engineering Manager, you'll lead a team and be responsible for products globally, providing technical leadership to key projects and empowering and developing teams to do the same.

Behind everything our users see online is the architecture built by the Technical Infrastructure team to keep it running. From developing and maintaining our data centers to building the next generation of Google platforms, we make Google's product portfolio possible. We're proud to be our engineers' engineers and love voiding warranties by taking things apart so we can rebuild them. We keep our networks up and running, ensuring our users have the best and fastest experience possible.

Responsibilities

  • Lead a team of Software/Systems Engineers on projects for users and be directly responsible for uptime.
  • Own end-to-end availability and performance of key services and build automation to prevent problem recurrence. Automate response to all non-exceptional service conditions.
  • Lead by example, mentor the team and establish credibility through quality technical execution.
  • Manage on-call rotations across continents, using a follow-the-sun model.
  • Design, write and deliver software to improve the availability, scalability, latency and efficiency of Google's services.

Similar Jobs

Grab - Principal Software Engineer, Fulfilment

Grab

Jakarta, Indonesia (On-Site)
3 Weeks ago
Inkittt - Author Experience Manager

Inkittt

San Francisco, California, United States (Hybrid)
6 Months ago
ByteDance - Student Researcher (Doubao (Seed) - Foundation Model - Video Generative Model)

ByteDance

San Jose, California, United States (On-Site)
2 Months ago
Google - Senior Imaging and On-Device Machine Learning Software Engineer, Silicon

Google

New Taipei, New Taipei City, Taiwan (On-Site)
1 Month ago
Numrah - Fullstack Engineer

Numrah

(Remote)
2 Weeks ago
Epic Games - Machine Learning Ops Engineer

Epic Games

London, England, United Kingdom (On-Site)
4 Months ago
Limit Break - Senior Backend Engineer, Core Services

Limit Break

Tokyo, Japan (On-Site)
3 Months ago
ByteDance - Senior Software Development Engineer - Cloud Native Databases

ByteDance

San Jose, California, United States (On-Site)
4 Months ago
SmileGate - Billing/Store Service Developer

SmileGate

Seongnam-si, Gyeonggi-do, South Korea (On-Site)
4 Months ago
Microsoft - Member of Technical Staff – Data Engineer

Microsoft

New York, New York, United States (Hybrid)
1 Month ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Inkittt - PR Manager

Inkittt

San Francisco, California, United States (On-Site)
11 Months ago
Visa - Staff Data Engineer

Visa

Warsaw, Masovian Voivodeship, Poland (Hybrid)
7 Months ago
Ubisoft - Research Internship (F/M/NB) – Crafting NPCs & Bots behaviors with LLM/VLM - La Forge

Ubisoft

Bordeaux, Nouvelle-Aquitaine, France (Hybrid)
2 Weeks ago
Playtika - Data Scientist

Playtika

Israel (On-Site)
2 Months ago
London stock Exchange - Lead Cloud Site Reliability Engineer

London stock Exchange

St. Louis, Missouri, United States (On-Site)
1 Month ago
Inkittt - Director of Engineering

Inkittt

San Francisco, California, United States (Hybrid)
4 Months ago
Argus Labs - Senior Software Engineer (Infrastructure/Backend)

Argus Labs

(Remote)
2 Months ago
Rebellion - AI Gameplay Programmer

Rebellion

Runcorn, England, United Kingdom (Hybrid)
2 Months ago
IBKR External - Software Engineer

IBKR External

Hyderabad, Telangana, India (Hybrid)
2 Weeks ago

Get notifed when new similar jobs are uploaded

Jobs in Dublin, County Dublin, Ireland

Monzo - Senior Financial Crime Risk Manager

Monzo

Dublin, County Dublin, Ireland (On-Site)
2 Weeks ago
Playrix - Senior Release Automation Engineer (Gardenscapes)

Playrix

Ireland (Remote)
4 Months ago
Affinidi - Engineering Manager, Full Stack

Affinidi

Dublin, County Dublin, Ireland (Hybrid)
2 Weeks ago
PlayStation Global - Staff Software Engineer

PlayStation Global

Dublin, County Dublin, Ireland (On-Site)
1 Month ago
PwC - Senior Manager - International Tax (FDI)

PwC

Dublin, County Dublin, Ireland (On-Site)
8 Months ago
Google - Media Solutions Specialist, LCS, Central Europe (English, German)

Google

Dublin, County Dublin, Ireland (On-Site)
1 Month ago
Putnam Associates - Principal, Health Economic Modelling (HEOR)

Putnam Associates

Dublin, County Dublin, Ireland (Hybrid)
2 Weeks ago
Playrix - Senior/Lead 2D Artist (Generalist)

Playrix

Ireland (Remote)
7 Months ago
Playrix - Senior QA Engineer (Core Team)

Playrix

Ireland (Remote)
1 Month ago
Playrix - Middle C++ Software Engineer (Gameplay)

Playrix

Ireland (Remote)
7 Months ago

Get notifed when new similar jobs are uploaded

Backend Development Jobs

Good Job Games - Software Engineer

Good Job Games

İstanbul, Türkiye (On-Site)
6 Months ago
Balbix - Sr Staff Engineer - Connector

Balbix

Bengaluru, Karnataka, India (On-Site)
7 Months ago
The Walt Disney Company - Senior Software Engineer - Activation

The Walt Disney Company

Santa Monica, California, United States (On-Site)
1 Month ago
Playrix - Senior Python Developer

Playrix

Ireland (Remote)
4 Months ago
Gaming Innovation Group  - Senior .NET Backend Developer

Gaming Innovation Group

St. Julian's, Malta (Hybrid)
2 Months ago
Argus Labs - Software Engineer (Infrastructure/Backend)

Argus Labs

(Remote)
2 Months ago
Google - Software Engineering Manager II, Google Photos

Google

Mountain View, California, United States (On-Site)
1 Month ago
ByteDance - Software Development Engineer - Distributed NoSQL Database Systems

ByteDance

Seattle, Washington, United States (On-Site)
4 Months ago
CloudLinux - Python Developer

CloudLinux

Tbilisi, Tbilisi, Georgia (Remote)
1 Month ago
Microsoft - Software Engineer II / Senior Software Engineer

Microsoft

(Remote)
1 Month ago

Get notifed when new similar jobs are uploaded

About The Company

London, England, United Kingdom (On-Site)

Bengaluru, Karnataka, India (On-Site)

Mountain View, California, United States (On-Site)

Bengaluru, Karnataka, India (On-Site)

Taipei City, Taiwan (On-Site)

Zürich, Zurich, Switzerland (On-Site)

Kirkland, Washington, United States (On-Site)

New Taipei, New Taipei City, Taiwan (On-Site)

Seattle, Washington, United States (On-Site)

View All Jobs

Get notified when new jobs are added by Google

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug