Staff Software Engineer, Site Reliability Engineering

2 Months ago • 8 Years + • Backend Development • Devops

Job Summary

Job Description

This Staff Software Engineer, Site Reliability Engineering (SRE) role at Google combines software and systems engineering to build and run large-scale, fault-tolerant systems. Responsibilities include the entire service lifecycle: design, deployment, operation, and refinement. You'll support services before launch (system design, capacity planning), monitor system health post-launch, and scale systems sustainably through automation. The role also involves incident response, blameless postmortems, and improving reliability and velocity. Candidates need significant experience with data structures, algorithms, software development, and distributed systems.
Must have:
  • Bachelor's degree in CS or related field
  • 8+ years experience with data structures/algorithms
  • 5+ years software development experience
  • 3+ years leading projects and designing distributed systems
  • Experience with system design, capacity planning, and monitoring
Good to have:
  • Master's degree in CS or Engineering

Job Details


Minimum qualifications:

  • Bachelor’s degree in Computer Science, a related field, or equivalent practical experience.
  • 8 years of experience with data structures or algorithms.
  • 5 years of experience with software development in one or more programming languages.
  • 3 years of experience leading projects and designing, analyzing, and troubleshooting distributed systems.

Preferred qualifications:

  • Master's degree in Computer Science or Engineering.

About the job

Site Reliability Engineering (SRE) combines software and systems engineering to build and run large-scale, massively distributed, fault-tolerant systems. SRE ensures that Google Cloud's services—both our internally critical and our externally-visible systems—have reliability, uptime appropriate to customer's needs and a fast rate of improvement. Additionally SRE’s will keep an ever-watchful eye on our systems capacity and performance.

Much of our software development focuses on optimizing existing systems, building infrastructure and eliminating work through automation. On the SRE team, you’ll have the opportunity to manage the complex challenges of scale which are unique to Google Cloud, while using your expertise in coding, algorithms, complexity analysis and large-scale system design. SRE's culture of intellectual curiosity, problem solving and openness is key to its success. Our organization brings together people with a wide variety of backgrounds, experiences and perspectives. We encourage them to collaborate, think big and take risks in a blame-free environment. We promote self-direction to work on meaningful projects, while we also strive to create an environment that provides the support and mentorship needed to learn and grow.

Behind everything our users see online is the architecture built by the Technical Infrastructure team to keep it running. From developing and maintaining our data centers to building the next generation of Google platforms, we make Google's product portfolio possible. We're proud to be our engineers' engineers and love voiding warranties by taking things apart so we can rebuild them. We keep our networks up and running, ensuring our users have the best and fastest experience possible.

Responsibilities

  • Engage in and improve the whole lifecycle of services—from inception and design, through to deployment, operation and refinement.
  • Support services before they go live through activities such as system design consulting, developing software platforms and frameworks, capacity planning and launch reviews.
  • Maintain services once they are live by monitoring availability, latency and overall system health.
  • Scale systems sustainably through mechanisms like automation, and evolve systems by pushing for changes that improve reliability and velocity.
  • Practice sustainable incident response and blameless postmortems.

Similar Jobs

Scale AI - Senior Software Engineer - Billing Platform

Scale AI

San Francisco, California, United States (On-Site)
2 Months ago
ByteDance - Student Researcher (Doubao (Seed) - Foundation Model - Vision Generative AI)

ByteDance

San Jose, California, United States (On-Site)
3 Months ago
Interface AI - Software Development Engineer IV - Backend

Interface AI

India (Remote)
4 Months ago
NVIDIA - Senior Solutions Architect, Autonomous Vehicles and Robotics

NVIDIA

Santa Clara, California, United States (On-Site)
3 Months ago
Google - Software Engineer II, Data Engineering Console, Infrastructure

Google

Zürich, Zurich, Switzerland (On-Site)
2 Months ago
Playrix - Principal Golang Engineer (Cross-Game Server)

Playrix

Ireland (Remote)
2 Months ago
Voodoo - Senior Backend Engineer - Inference Platform

Voodoo

Paris, Île-de-France, France (Remote)
2 Months ago
Sovrun - Senior Rust Engineer

Sovrun

Makati, Metro Manila, Philippines (Remote)
3 Months ago
ByteDance - Tech Lead - Infrastructure Platform

ByteDance

Singapore (On-Site)
5 Months ago
Playrix - Senior Node.js Developer (Server)

Playrix

Cyprus (Remote)
2 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

NVIDIA - Senior Cost Analyst

NVIDIA

Canada (On-Site)
2 Months ago
Google - Software Engineer II, Android Enterprise

Google

Bucharest, Bucharest, Romania (On-Site)
2 Months ago
Social Discovery Group - Senior NLP Engineer

Social Discovery Group

Poland (Remote)
8 Months ago
CynLr - Software Engineer - GPU performance

CynLr

Bengaluru, Karnataka, India (On-Site)
8 Months ago
Panteon Games - Game Data Analyst

Panteon Games

Ankara, Ankara, Türkiye (On-Site)
8 Months ago
Meta - Research Scientist Intern, Smart Glasses in Wearables AI (PhD)

Meta

New York, New York, United States (On-Site)
8 Months ago
Google - Senior Software Engineer, Embedded Systems/Firmware, Platforms Infrastructure Engineering

Google

Sunnyvale, California, United States (On-Site)
8 Months ago
Google - Software Engineer III, Infrastructure, Google Cloud AI

Google

Sunnyvale, California, United States (On-Site)
8 Months ago
Inkittt - Lead Front-End Engineer: Mobile Team (m/f/d)

Inkittt

Krakow Am See, Mecklenburg-Vorpommern, Germany (On-Site)
11 Months ago
Inkittt - Author Experience Manager

Inkittt

San Francisco, California, United States (Hybrid)
7 Months ago

Get notifed when new similar jobs are uploaded

Jobs in Sydney, New South Wales, Australia

Google - Group Product Manager, AI Experiences, Android

Google

Sydney, New South Wales, Australia (On-Site)
2 Months ago
Canva - Staff Frontend Engineer - Editing Foundations

Canva

Sydney, New South Wales, Australia (Remote)
2 Months ago
Aristocrat Gaming - Sales Manager - NSW Metro

Aristocrat Gaming

North Ryde, New South Wales, Australia (Hybrid)
3 Months ago
Nagarro - Associate Principal Consultant, Business Analyst

Nagarro

Australia (Remote)
8 Months ago
Canva - Senior Platform Product Manager - Cloud Platform

Canva

Melbourne, Victoria, Australia (Remote)
6 Months ago
Big Ant Studios - Junior QA

Big Ant Studios

Melbourne, Victoria, Australia (On-Site)
5 Months ago
Canva - Senior Frontend Engineer - Editing APIs

Canva

Brisbane, Queensland, Australia (Remote)
3 Months ago
Canva - Head of APJ Sales

Canva

Sydney, New South Wales, Australia (Remote)
3 Months ago
Nine - Associate Network Cyber Security Engineer

Nine

North Sydney, New South Wales, Australia (On-Site)
2 Months ago
Telastra - Telstra Retail: Casual Customer Service & Sales Consultant

Telastra

Horsham, Victoria, Australia (On-Site)
2 Months ago

Get notifed when new similar jobs are uploaded

Backend Development Jobs

Visual Concepts - Server Engineer - WWE 2K

Visual Concepts

Austin, Texas, United States (On-Site)
3 Months ago
SmileGate - Game Data Engineer [LOST ARK]

SmileGate

Seongnam-si, Gyeonggi-do, South Korea (On-Site)
5 Months ago
Limit Break - Senior Backend Engineer, Core Services

Limit Break

Tokyo, Japan (On-Site)
5 Months ago
Velotio Technologies - Lead Engineer (Java)

Velotio Technologies

Pune, Maharashtra, India (Remote)
3 Months ago
Epic Games - Senior UI Engineer, Fortnite

Epic Games

London, England, United Kingdom (On-Site)
2 Months ago
Rennsportgg - Site Reliability Engineer

Rennsportgg

Munich, Bavaria, Germany (Remote)
4 Months ago
Gameplay Galaxy - Senior Backend Developer

Gameplay Galaxy

(Remote)
3 Months ago
Flowplay llc - Senior Backend Engineer

Flowplay llc

Seattle, Washington, United States (Hybrid)
3 Months ago
Microsoft - Member of Technical Staff, Infrastructure Engineer

Microsoft

Mountain View, California, United States (Hybrid)
3 Months ago
Ziff Davis - Senior Software Engineer, Backend - Lose It!

Ziff Davis

United States (On-Site)
8 Months ago

Get notifed when new similar jobs are uploaded

About The Company

São Paulo, State Of São Paulo, Brazil (On-Site)

Sunnyvale, California, United States (On-Site)

Los Angeles, California, United States (On-Site)

New Taipei, New Taipei City, Taiwan (On-Site)

Mexico City, Mexico City, Mexico (On-Site)

Mexico City, Mexico City, Mexico (On-Site)

London, England, United Kingdom (On-Site)

Taipei City, Taiwan (On-Site)

Kirkland, Washington, United States (On-Site)

Sunnyvale, California, United States (On-Site)

View All Jobs

Get notified when new jobs are added by Google

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug