Senior Software Developer, Site Reliability Engineering, Google Cloud

9 Months ago • 5-10 Years • Devops • $161,000 PA - $239,000 PA

Job Summary

Job Description

As a Senior Software Developer, Site Reliability Engineering (SRE) at Google Cloud, you'll be responsible for ensuring the reliability, uptime, and performance of our critical systems. You'll work on optimizing existing systems, building infrastructure, and eliminating work through automation. You'll also collaborate with other teams to ensure our services meet the needs of our customers. This role requires a strong understanding of software development, data structures, algorithms, and large-scale system design. You will have the opportunity to work on complex challenges at scale while using your expertise to make a real impact. Responsibilities: - Engage in and improve the whole lifecycle of services—from inception and design, through to deployment, operation and refinement. - Support services before they go live through activities such as system design consulting, developing software platforms and frameworks, capacity planning and launch reviews. - Maintain services once they are live by measuring and monitoring availability, latency and overall system health. - Scale systems sustainably through mechanisms like automation, and evolve systems by pushing for changes that improve reliability and velocity. - Practice sustainable incident response and blameless postmortems.
Must have:
  • Bachelor's degree in Computer Science or related field
  • 5 years of experience with software development
  • 5 years of experience with data structures or algorithms
  • 3 years of experience in designing, analyzing, and troubleshooting large-scale distributed systems
  • 2 years of experience leading projects and providing technical leadership
Good to have:
  • Master's degree in Computer Science or Engineering

Job Details


Minimum qualifications:

  • Bachelor’s degree in Computer Science, a related field, or equivalent practical experience.
  • 5 years of experience with software development in one or more programming languages.
  • 5 years of experience with data structures or algorithms.
  • 3 years of experience in designing, analyzing, and troubleshooting large-scale distributed systems, and 2 years of experience leading projects and providing technical leadership.

Preferred qualifications:

  • Master's degree in Computer Science or Engineering.

About the job

Site Reliability Engineering (SRE) combines software and systems engineering to build and run large-scale, massively distributed, fault-tolerant systems. SRE ensures that Google Cloud's services—both our internally critical and our externally-visible systems—have reliability, uptime appropriate to customer's needs and a fast rate of improvement. Additionally SRE’s will keep an ever-watchful eye on our systems capacity and performance.

Much of our software development focuses on optimizing existing systems, building infrastructure and eliminating work through automation. On the SRE team, you’ll have the opportunity to manage the complex challenges of scale which are unique to Google Cloud, while using your expertise in coding, algorithms, complexity analysis and large-scale system design. SRE's culture of diversity, intellectual curiosity, problem solving and openness is key to its success. Our organization brings together people with a wide variety of backgrounds, experiences and perspectives. We encourage them to collaborate, think big and take risks in a blame-free environment. We promote self-direction to work on meaningful projects, while we also strive to create an environment that provides the support and mentorship needed to learn and grow.

Behind everything our users see online is the architecture built by the Technical Infrastructure team to keep it running. From developing and maintaining our data centers to building the next generation of Google platforms, we make Google's product portfolio possible. We're proud to be our engineers' engineers and love voiding warranties by taking things apart so we can rebuild them. We keep our networks up and running, ensuring our users have the best and fastest experience possible.

The US base salary range for this full-time position is $161,000-$239,000 + bonus + equity + benefits. Our salary ranges are determined by role, level, and location. The range displayed on each job posting reflects the minimum and maximum target salaries for the position across all US locations. Within the range, individual pay is determined by work location and additional factors, including job-related skills, experience, and relevant education or training. Your recruiter can share more about the specific salary range for your preferred location during the hiring process.

Please note that the compensation details listed in US role postings reflect the base salary only, and do not include bonus, equity, or benefits. Learn more about .

Responsibilities

  • Engage in and improve the whole lifecycle of services—from inception and design, through to deployment, operation and refinement.
  • Support services before they go live through activities such as system design consulting, developing software platforms and frameworks, capacity planning and launch reviews.
  • Maintain services once they are live by measuring and monitoring availability, latency and overall system health.
  • Scale systems sustainably through mechanisms like automation, and evolve systems by pushing for changes that improve reliability and velocity.
  • Practice sustainable incident response and blameless postmortems.

Similar Jobs

PwC - ETIC, GCP Cloud Solution Architect - Senior Manager

PwC

Cairo, Cairo Governorate, Egypt (On-Site)
9 Months ago
Stake logic - QA Engineer

Stake logic

(Remote)
5 Months ago
Qualcomm - Kernel Stability Engineer

Qualcomm

Hyderabad, Telangana, India (On-Site)
3 Months ago
Lead Venture - SEO Migration Specialist

Lead Venture

Mexico (Remote)
5 Months ago
Thatch.ai  - Implementation Manager

Thatch.ai

Austin, Texas, United States (Remote)
4 Months ago
Brillio - Service Cloud Architect - R01526359

Brillio

Bengaluru, Karnataka, India (Hybrid)
9 Months ago
Epic Games - Senior DevOps Programmer

Epic Games

Canada (On-Site)
3 Months ago
bytedance - Site Reliability Engineer Graduate (Technical Infrastructure) - 2025 Start (BS/MS)

bytedance

San Jose, California, United States (On-Site)
9 Months ago
Assist software  - Azure DevOps Engineer

Assist software

Suceava, Suceava County, Romania (Remote)
8 Months ago
Zamp - Forward Deployment Engineer

Zamp

San Francisco, California, United States (On-Site)
2 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Qualcomm - Embedded Software Staff Engineer - SOC Firmware

Qualcomm

San Diego, California, United States (On-Site)
1 Month ago
Ansys - Senior R&D Engineer (C++, Qt)

Ansys

Chalandri, Greece (Hybrid)
3 Weeks ago
gs studio - Unreal Engine Network Developer

gs studio

(Remote)
2 Months ago
Ion - Technical Consultant - Endur

Ion

New York, New York, United States (On-Site)
9 Months ago
extreme network - SR PROGRAMMER - Oracle Fusion Cloud- VBCS/ BI Reports/ OTBI/FRS & SmartView

extreme network

Chennai, Tamil Nadu, India (Hybrid)
9 Months ago
NCR Voyix - IT Help Desk Analyst

NCR Voyix

Chennai, Tamil Nadu, India (On-Site)
2 Months ago
Internet Brands - Telecom Engineer

Internet Brands

El Segundo, California, United States (On-Site)
3 Months ago
Activision - Senior Online Programmer

Activision

Warsaw, Masovian Voivodeship, Poland (On-Site)
2 Months ago
Google - Senior Account Manager, Large Customer Sales

Google

New York, New York, United States (On-Site)
3 Months ago
Glean - Delivery Excellence Manager

Glean

United States (Remote)
4 Weeks ago

Get notifed when new similar jobs are uploaded

Jobs in Durham, North Carolina, United States

The Walt Disney Company - Principal Software Engineer C++

The Walt Disney Company

Glendale, California, United States (On-Site)
3 Months ago
Apple - Apple Card Marketing Manager

Apple

Cupertino, California, United States (On-Site)
1 Month ago
Dave Ramsey - Sr. Director of Marketing

Dave Ramsey

Franklin, Tennessee, United States (On-Site)
2 Months ago
singularity 6 - Art Application Drop Box

singularity 6

Los Angeles, California, United States (Hybrid)
2 Years ago
Rippling - Staff Backend Engineer, Data Bridge

Rippling

San Francisco, California, United States (On-Site)
6 Months ago
Sierra - Enterprise Account Executive

Sierra

United States (Remote)
1 Year ago
Fox Factory - Business Process and Analytics Specialist

Fox Factory

Trussville, Alabama, United States (On-Site)
1 Year ago
Nice - Lead Cloud Network Engineer

Nice

Atlanta, Georgia, United States (On-Site)
4 Weeks ago
Palo Alto Networks - Managing Director, Digital Forensics and Incident Response

Palo Alto Networks

Arlington, Virginia, United States (Remote)
3 Months ago
Toast - Retail Account Executive

Toast

San Jose, California, United States (Hybrid)
1 Month ago

Get notifed when new similar jobs are uploaded

Devops Jobs

Workato - Senior Automation Engineer

Workato

Hyderabad, Telangana, India (On-Site)
4 Weeks ago
Discord - Staff Software Engineer - Desktop Platform

Discord

San Francisco, California, United States (On-Site)
1 Month ago
Trellix - Software Architect

Trellix

Cork, County Cork, Ireland (On-Site)
2 Months ago
Ansys - Lead DevOps Engineer

Ansys

Waterloo, Ontario, Canada (On-Site)
1 Month ago
Palo Alto Networks - Senior Principal FinOps/DevOps Engineer

Palo Alto Networks

Santa Clara, California, United States (On-Site)
1 Month ago
Notion - Customer Experience (CX) Automation Engineer

Notion

San Francisco, California, United States (On-Site)
2 Months ago
Epic Games - Automation Engineer

Epic Games

Cary, North Carolina, United States (On-Site)
4 Months ago
Ion - Senior DevOps Engineer

Ion

Budapest, Hungary (On-Site)
8 Months ago

Get notifed when new similar jobs are uploaded

About The Company

Durham, North Carolina, United States (On-Site)

Kirkland, Washington, United States (On-Site)

Sunnyvale, California, United States (On-Site)

Mountain View, California, United States (On-Site)

Sunnyvale, California, United States (On-Site)

Mexico City, Mexico City, Mexico (On-Site)

Belo Horizonte, State Of Minas Gerais, Brazil (On-Site)

Mexico City, Mexico City, Mexico (On-Site)

View All Jobs

Get notified when new jobs are added by Google

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug