Staff Software Engineer, Turnup Site Reliability Engineering

1 Month ago • 8 Years + • DevOps • Backend Development

Job Summary

Job Description

The Staff Software Engineer, Turnup Site Reliability Engineering role at Google involves improving the entire lifecycle of services, from design to deployment and maintenance. Responsibilities include system design consulting, developing software platforms, capacity planning, monitoring system health, and scaling systems sustainably through automation. The role requires expertise in data structures, algorithms, distributed systems, and large-scale system design. The team focuses on ensuring reliability, uptime, and performance of Google Cloud services, using a culture of intellectual curiosity and problem-solving. This requires engaging in sustainable incident response and conducting blameless postmortems. The position necessitates collaboration with a multi-site SRE team and independent project management.
Must have:
  • Bachelor's degree in CS or related field
  • 8+ years experience with data structures/algorithms
  • 5+ years software development experience
  • 3+ years leading projects and designing distributed systems
  • Experience with system design, capacity planning, and monitoring
Good to have:
  • Experience working across organizational boundaries
  • Experience driving medium to large projects independently
  • Ability to collaborate effectively on a multi-site SRE team

Job Details

Minimum qualifications:

  • Bachelor’s degree in Computer Science, a related field, or equivalent practical experience.
  • 8 years of experience with data structures or algorithms.
  • 5 years of experience with software development in one or more programming languages.
  • 3 years of experience leading projects and designing, analyzing, and troubleshooting distributed systems.

Preferred qualifications:

  • Experience working across organizational boundaries.
  • Experience driving medium to large-sized projects independently.
  • Ability to collaborate effectively on a multi-site SRE team.

About the job

Site Reliability Engineering (SRE) combines software and systems engineering to build and run large-scale, massively distributed, fault-tolerant systems. SRE ensures that Google Cloud's services—both our internally critical and our externally-visible systems—have reliability, uptime appropriate to customer's needs and a fast rate of improvement. Additionally SRE’s will keep an ever-watchful eye on our systems capacity and performance.

Much of our software development focuses on optimizing existing systems, building infrastructure and eliminating work through automation. On the SRE team, you’ll have the opportunity to manage the complex challenges of scale which are unique to Google Cloud, while using your expertise in coding, algorithms, complexity analysis and large-scale system design. SRE's culture of intellectual curiosity, problem solving and openness is key to its success. Our organization brings together people with a wide variety of backgrounds, experiences and perspectives. We encourage them to collaborate, think big and take risks in a blame-free environment. We promote self-direction to work on meaningful projects, while we also strive to create an environment that provides the support and mentorship needed to learn and grow.

Behind everything our users see online is the architecture built by the Technical Infrastructure team to keep it running. From developing and maintaining our data centers to building the next generation of Google platforms, we make Google's product portfolio possible. We're proud to be our engineers' engineers and love voiding warranties by taking things apart so we can rebuild them. We keep our networks up and running, ensuring our users have the best and fastest experience possible.

Responsibilities

  • Engage in and improve the whole lifecycle of services, from inception and design, through to deployment, operation and refinement.
  • Support services before they go live through activities such as system design consulting, developing software platforms and frameworks, capacity planning, and launch reviews.
  • Maintain services once they are live by measuring and monitoring availability, latency, and overall system health.
  • Scale systems sustainably through mechanisms like automation, and evolve systems by pushing for changes that improve reliability and velocity.
  • Practice sustainable incident response and blameless postmortems.

Similar Jobs

Google - Software Engineer (For Women in Tech Candidates)

Google

(On-Site)
6 Months ago
Google - Senior Software Engineer, YouTube Shopping

Google

Bengaluru, Karnataka, India (On-Site)
1 Month ago
CleverTap - Senior Backend Engineer - Platform

CleverTap

Mumbai, Maharashtra, India (Hybrid)
7 Months ago
Google - Senior Staff Software Engineer, Infrastructure, Google Cloud Security and Privacy

Google

Cambridge, Massachusetts, United States (On-Site)
5 Months ago
Microsoft - Researcher

Microsoft

Beijing, Beijing, China (On-Site)
1 Month ago
Zeta - Engineering Manager - Cloud Security (DevSecOps)

Zeta

Bengaluru, Karnataka, India (On-Site)
7 Months ago
Microsoft - Azure Service Operations Engineer

Microsoft

Paris, Île-de-France, France (On-Site)
1 Month ago
Google - Customer Solutions Consultant, Infrastructure Modernization, Google Cloud

Google

Kuala Lumpur, Federal Territory Of Kuala Lumpur, Malaysia (On-Site)
1 Month ago
Google - Senior Solutions Acceleration Architect, Data

Google

Bengaluru, Karnataka, India (On-Site)
1 Month ago
Google - Senior Software Engineer, Turn-up Site Reliability Engineering

Google

Dublin, County Dublin, Ireland (On-Site)
1 Month ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Google - Software Engineer, Information Retrieval, Data Indexing, Core Data

Google

Bengaluru, Karnataka, India (On-Site)
1 Month ago
Google - Senior Staff Engineer, Security Defense Platforms

Google

Bengaluru, Karnataka, India (On-Site)
1 Month ago
Google - Senior Software Engineer, Site Reliability Engineering

Google

Bengaluru, Karnataka, India (On-Site)
1 Month ago
Oportun - Senior ML Engineer

Oportun

India (Remote)
7 Months ago
Dream Sports - SDE 1 - Backend

Dream Sports

Mumbai, Maharashtra, India (On-Site)
6 Months ago
Google - Senior Software Developer, Chrome

Google

Waterloo, Ontario, Canada (On-Site)
1 Month ago
Google - Senior Software Engineer, Generative AI, Google Cloud AI

Google

Sunnyvale, California, United States (On-Site)
1 Month ago
Google - Software Engineer, Google Cloud Platform, App Hub

Google

Warsaw, Masovian Voivodeship, Poland (On-Site)
1 Month ago
Google - Software Engineer III, Google Cloud Security and Privacy

Google

Sunnyvale, California, United States (On-Site)
6 Months ago
ByteDance - Software Development Engineer Graduate (Distributed NoSQL Database Systems)

ByteDance

San Jose, California, United States (On-Site)
2 Months ago

Get notifed when new similar jobs are uploaded

Jobs in Dublin, County Dublin, Ireland

Google - Account Manager, Large Customer Sales, CEE

Google

Dublin, County Dublin, Ireland (On-Site)
1 Month ago
Google - Strategic Agency Manager, Google Customer Solutions, German Market

Google

Dublin, County Dublin, Ireland (On-Site)
1 Month ago
Playrix - Principal UI Artist

Playrix

Ireland (Remote)
7 Months ago
Playrix - Senior Technical Designer

Playrix

Ireland (Remote)
7 Months ago
Google - Account Manager, LCS, Retail Marketplaces and Groceries

Google

Dublin, County Dublin, Ireland (On-Site)
1 Month ago
Google - International Growth Consultant, AppDev

Google

Dublin, County Dublin, Ireland (On-Site)
1 Month ago
Playrix - Lead Unity Software Engineer (Gameplay)

Playrix

Ireland (Remote)
7 Months ago
Salesforce - Commercial Graduate Rotation Program - French Speaker

Salesforce

Dublin, County Dublin, Ireland (On-Site)
2 Months ago
Playrix - Senior Engineering Manager

Playrix

Ireland (Remote)
7 Months ago
Google - Account Manager, Large Customer Sales

Google

Dublin, County Dublin, Ireland (On-Site)
1 Month ago

Get notifed when new similar jobs are uploaded

DevOps Jobs

Journee - Lead Engineer, Cloud Infrastructure

Journee

(Remote)
2 Months ago
The Walt Disney Company - Manager, Software Engineering

The Walt Disney Company

California, United States (On-Site)
1 Month ago
Logifuture - Senior DevOps Engineer

Logifuture

Vojvodina, Serbia (Remote)
2 Months ago
Warner Bros Games - Staff Software Engineer - Cloud Support and Operations

Warner Bros Games

Bengaluru, Karnataka, India (Hybrid)
1 Month ago
Wargaming - DevOps Engineer (Deployment team)

Wargaming

Vilnius, Vilnius County, Lithuania (On-Site)
2 Months ago
NVIDIA - Senior System Software Engineer, NCCL - Partner Enablement

NVIDIA

Austin, Texas, United States (Remote)
2 Months ago
Glean - Solutions Architect - Central

Glean

(Remote)
5 Months ago
Malabar Gold & Diamonds - Executive - Cloud Engineer

Malabar Gold & Diamonds

Sri Vijaya Puram, Andaman And Nicobar Islands, India (On-Site)
10 Months ago
Google - Principal Architect, Google Cloud

Google

Tennessee, United States (On-Site)
1 Month ago
Playground Games - Build Engineer - Contract

Playground Games

England, United Kingdom (Hybrid)
2 Months ago

Get notifed when new similar jobs are uploaded