Software Engineering Manager II, Site Reliability Engineering

1 Month ago • 8-13 Years • Backend Development

Job Summary

Job Description

As a Software Engineering Manager II in Site Reliability Engineering (SRE) at Google, you will lead a team responsible for the global uptime of key services. Responsibilities include owning end-to-end availability and performance, building automation to prevent issues, mentoring the team, managing on-call rotations, and designing/delivering software to improve Google's service efficiency and scalability. You'll leverage expertise in coding, algorithms, and large-scale system design to address complex scalability challenges unique to Google. The role requires experience in designing, analyzing, and troubleshooting distributed systems, along with strong people management skills.
Must have:
  • 8+ years experience with data structures/algorithms
  • 5+ years software development experience
  • 3+ years people management experience
  • Experience with distributed systems
  • Lead a team of engineers
  • Own service availability and performance
Good to have:
  • Experience in computing, distributed systems, storage, or networking
  • Expertise in large-scale distributed systems
  • Debugging, code optimization, automation skills

Job Details

Minimum qualifications:

  • Bachelor’s degree in Computer Science, a related field, or equivalent practical experience.
  • Candidates will typically have 8 years of experience with data structures or algorithms.
  • Typically 5 years of experience with software development in one or more programming languages.
  • Typically 3 years of people management experience, and experience designing, analyzing, and troubleshooting distributed systems.

Preferred qualifications:

  • Experience working in computing, distributed systems, storage, or networking.
  • Expertise in designing, analyzing, and troubleshooting large-scale distributed systems.
  • Ability to debug, optimize code, and to automate routine tasks.
  • Systematic problem-solving approach, coupled with effective verbal and written communication skills.

About the job

Site Reliability Engineering (SRE) combines software and systems engineering to build and run large-scale, massively distributed, fault-tolerant systems. SRE ensures that Google's services—both our internally critical and our externally-visible systems—have reliability, uptime appropriate to users' needs and a fast rate of improvement. Additionally SRE’s will keep an ever-watchful eye on our systems capacity and performance.


Much of our software development focuses on optimizing existing systems, building infrastructure and eliminating work through automation. On the SRE team, you’ll have the opportunity to manage the complex challenges of scale which are unique to Google, while using your expertise in coding, algorithms, complexity analysis and large-scale system design.

SRE's culture of intellectual curiosity, problem solving and openness is key to its success. Our organization brings together people with a wide variety of backgrounds, experiences and perspectives. We encourage them to collaborate, think big and take risks in a blame-free environment. We promote self-direction to work on meaningful projects, while we also strive to create an environment that provides the support and mentorship needed to learn and grow.

To learn more: check out our books on or read a about why a Software Engineer chose to join SRE.

As an Engineering Manager, you'll lead a team and be responsible for products globally, providing technical leadership to key projects and empowering and developing teams to do the same.

Behind everything our users see online is the architecture built by the Technical Infrastructure team to keep it running. From developing and maintaining our data centers to building the next generation of Google platforms, we make Google's product portfolio possible. We're proud to be our engineers' engineers and love voiding warranties by taking things apart so we can rebuild them. We keep our networks up and running, ensuring our users have the best and fastest experience possible.

Responsibilities

  • Lead a team of Software/Systems Engineers on projects for users and be directly responsible for uptime.
  • Own end-to-end availability and performance of key services and build automation to prevent problem recurrence. Automate response to all non-exceptional service conditions.
  • Lead by example, mentor the team and establish credibility through quality technical execution.
  • Manage on-call rotations across continents, using a follow-the-sun model.
  • Design, write and deliver software to improve the availability, scalability, latency and efficiency of Google's services.

Similar Jobs

Qualcomm - Embedded Platform Dev- Lead Engineer, Senior

Qualcomm

Bengaluru, Karnataka, India (On-Site)
1 Week ago
NVIDIA - System Software Architect, Programmable Vision Accelerator

NVIDIA

Pune, Maharashtra, India (On-Site)
3 Months ago
Zynga - Software Engineer

Zynga

Bengaluru, Karnataka, India (On-Site)
3 Days ago
Hitachi - Artificial Intelligence - JBU

Hitachi

Chennai, Tamil Nadu, India (On-Site)
7 Months ago
Genies - Machine Learning Infrastructure Engineer, 3D Model Inference & Deployment

Genies

Los Angeles, California, United States (On-Site)
3 Months ago
Sporty Group - Weekend Backend Engineer

Sporty Group

(On-Site)
10 Months ago
Playtika - PHP Technical Lead

Playtika

Ukraine (On-Site)
1 Month ago
Red Rover Interactive - Senior Server programmer

Red Rover Interactive

Oslo, Oslo, Norway (Hybrid)
1 Year ago
Liquidnitro Games - Software Engineer

Liquidnitro Games

Hyderabad, Telangana, India (On-Site)
6 Months ago
playrix  - Engineering Manager (Golang)

playrix

Ireland (Remote)
1 Month ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

CD PROJEKT RED - Senior Gameplay Designer

CD PROJEKT RED

Warsaw, Masovian Voivodeship, Poland (On-Site)
4 Weeks ago
G5 games - C++ Gameplay Programmer

G5 games

Yerevan, Yerevan, Armenia (Remote)
6 Months ago
Genies - Machine Learning Engineer, Character Animation & Motion AI

Genies

Los Angeles, California, United States (On-Site)
3 Months ago
Fictiv - Sr. Product Designer

Fictiv

Pune, Maharashtra, India (On-Site)
1 Week ago
Google - Staff Software Engineer, Storage

Google

Sunnyvale, California, United States (On-Site)
1 Month ago
Forcepoint - Software Engineer II - Golang

Forcepoint

Thane, Maharashtra, India (On-Site)
1 Day ago
ManyChat - Technical SEO Specialist

ManyChat

Austin, Texas, United States (Hybrid)
1 Day ago
Balbix - Staff /Sr Staff/ Principal Engineer - Lakehouse

Balbix

Gurugram, Haryana, India (On-Site)
7 Months ago
Falcon X - Senior Software Engineer, Blockchain

Falcon X

New York, New York, United States (On-Site)
4 Weeks ago
Google - Engineering Manager, YouTube Developer Infrastructure

Google

Bengaluru, Karnataka, India (On-Site)
1 Month ago

Get notifed when new similar jobs are uploaded

Jobs in Zürich, Zurich, Switzerland

Tesla - Area Parts Supervisor

Tesla

Cham, Zug, Switzerland (On-Site)
3 Months ago
Microsoft - Member of Technical Staff, AI Pre-Training

Microsoft

Zürich, Zurich, Switzerland (On-Site)
1 Month ago
Tesla - Automotive Mechatronics Technician

Tesla

St. Gallen, St. Gallen, Switzerland (On-Site)
3 Months ago
Tesla - Employee Advisor

Tesla

Zug, Zug, Switzerland (On-Site)
3 Months ago
Toku - International Payroll Lead/Analyst (Switzerland)

Toku

Switzerland (Remote)
4 Months ago
PwC - Manager SAP Data Migration Consulting

PwC

Zürich, Zurich, Switzerland (On-Site)
6 Months ago
Haleon - Senior Toxicologist

Haleon

Nyon, Vaud, Switzerland (On-Site)
2 Weeks ago
PwC - Manager / Senior Manager Cyber Technology and Transformation

PwC

Zürich, Zurich, Switzerland (On-Site)
8 Months ago
Tesla - Automotive Mechatronics Technician

Tesla

Cham, Zug, Switzerland (On-Site)
3 Months ago
Flowable - Senior Sales Executive

Flowable

Zürich, Zurich, Switzerland (Hybrid)
1 Day ago

Get notifed when new similar jobs are uploaded

Backend Development Jobs

Hedra - Senior Backend Engineer

Hedra

San Francisco, California, United States (On-Site)
2 Months ago
Rennsportgg - Senior Backend Engineer (f/m/x)

Rennsportgg

Sweden (Remote)
9 Months ago
Argus Labs - Software Engineer (Junior/Fresh Graduate)

Argus Labs

Indonesia (Remote)
2 Months ago
Google - Software Engineer III, Infrastructure, Platforms Infrastructure Engineering

Google

Sunnyvale, California, United States (On-Site)
5 Months ago
Assist software  - C++ Developer

Assist software

Suceava, Suceava County, Romania (On-Site)
6 Months ago
Voodoo - Principal Engineer

Voodoo

Paris, Île-de-France, France (Remote)
1 Month ago
Epic Games - Senior Data Scientist

Epic Games

(On-Site)
3 Months ago
bytedance - Senior Software Development Engineer - Distributed NoSQL Database Systems

bytedance

San Jose, California, United States (On-Site)
4 Months ago
Red Point Labs - Java Backend Developer (Remote OK)

Red Point Labs

Argentina (Remote)
1 Year ago
Maxis Studios - Senior Multiplayer & Online Engineer

Maxis Studios

Victoria, Australia (Hybrid)
2 Months ago

Get notifed when new similar jobs are uploaded