Software Engineer III, Site Reliability Engineering

3 Months ago • 2 Years + • DevOps

Job Summary

Job Description

Site Reliability Engineering (SRE) at Google combines software and systems engineering to build and run large-scale, massively distributed, fault-tolerant systems. SRE ensures that Google Cloud's services—both our internally critical and our externally-visible systems—have reliability, uptime appropriate to customer's needs and a fast rate of improvement. Additionally SRE’s will keep an ever-watchful eye on our systems capacity and performance. Much of our software development focuses on optimizing existing systems, building infrastructure and eliminating work through automation. On the SRE team, you’ll have the opportunity to manage the complex challenges of scale which are unique to Google Cloud, while using your expertise in coding, algorithms, complexity analysis and large-scale system design. SRE's culture of diversity, intellectual curiosity, problem solving and openness is key to its success. With your technical expertise you will manage project priorities, deadlines, and deliverables. You will design, develop, test, deploy, maintain, and enhance software solutions.
Must have:
  • Bachelor’s degree in Computer Science or related field
  • 2 years of experience with data structures/algorithms and software development
  • Experience working in computing, distributed systems, storage, or networking
  • Expertise in designing, analyzing, and troubleshooting large-scale distributed systems
  • Ability to debug, optimize code, and automate routine tasks
  • Systematic problem-solving approach
  • Effective verbal and written communication skills

Job Details

Minimum qualifications:

  • Bachelor’s degree in Computer Science, a related field, or equivalent practical experience.
  • Candidates will typically have 2 years of experience with data structures/algorithms and software development in one or more programming languages.

Preferred qualifications:

  • Experience working in computing, distributed systems, storage, or networking.
  • Expertise in designing, analyzing, and troubleshooting large-scale distributed systems.
  • Ability to debug, optimize code, and to automate routine tasks.
  • Systematic problem-solving approach, coupled with effective verbal and written communication skills.

About the job

Site Reliability Engineering (SRE) combines software and systems engineering to build and run large-scale, massively distributed, fault-tolerant systems. SRE ensures that Google Cloud's services—both our internally critical and our externally-visible systems—have reliability, uptime appropriate to customer's needs and a fast rate of improvement. Additionally SRE’s will keep an ever-watchful eye on our systems capacity and performance.

Much of our software development focuses on optimizing existing systems, building infrastructure and eliminating work through automation. On the SRE team, you’ll have the opportunity to manage the complex challenges of scale which are unique to Google Cloud, while using your expertise in coding, algorithms, complexity analysis and large-scale system design. SRE's culture of diversity, intellectual curiosity, problem solving and openness is key to its success. Our organization brings together people with a wide variety of backgrounds, experiences and perspectives. We encourage them to collaborate, think big and take risks in a blame-free environment. We promote self-direction to work on meaningful projects, while we also strive to create an environment that provides the support and mentorship needed to learn and grow.

With your technical expertise you will manage project priorities, deadlines, and deliverables. You will design, develop, test, deploy, maintain, and enhance software solutions.

Responsibilities

  • Write product or system development code.
  • Review code developed by other engineers and provide feedback to ensure best practices (e.g., style guidelines, checking code in, accuracy, testability, and efficiency).
  • Contribute to existing documentation or educational content and adapt content based on product/program updates and user feedback.
  • Triage product or system issues and debug/track/resolve by analyzing the sources of issues and the impact on hardware, network, or service operations and quality.
  • Participate in, or lead design reviews with peers and stakeholders to decide amongst available technologies.

Similar Jobs

Google - Software Engineer III, Google Cloud AI

Google

Sunnyvale, California, United States (On-Site)
3 Months ago
Google - Senior Software Engineer, Machine Learning, Google Cloud Compute

Google

Sunnyvale, California, United States (On-Site)
3 Months ago
CloudLinux - Senior C Developer (worldwide remote, work anywhere)

CloudLinux

Sofia, Sofia City Province, Bulgaria (Remote)
2 Months ago
Google - Software Engineer III, Security/Privacy, Google Cloud Security and Privacy

Google

Sunnyvale, California, United States (On-Site)
1 Month ago
Unity - Scientifique de données sénior | Senior Data Scientist

Unity

Montreal, Quebec, Canada (On-Site)
2 Months ago
Mattel  Inc  - Manager, Development Live Ops

Mattel Inc

El Segundo, California, United States (On-Site)
3 Months ago
Dentsu - Senior Integration Developer

Dentsu

Pune, Maharashtra, India (On-Site)
4 Months ago
InvenioLSI - MuleSoft Architecht

InvenioLSI

Dubai, Dubai, United Arab Emirates (On-Site)
1 Month ago
Trimble  Inc  - Lead DevOps Engineer

Trimble Inc

Chennai, Tamil Nadu, India (On-Site)
2 Months ago
Surge Technology Solutions  Inc  - DevOps Engineer with AWS

Surge Technology Solutions Inc

Bengaluru, Karnataka, India (On-Site)
4 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Google - Senior Staff Software Engineer, Google Cloud

Google

Bengaluru, Karnataka, India (On-Site)
3 Months ago
Netflix - Machine Learning Intern - Spring or Summer 2025

Netflix

Los Gatos, California, United States (On-Site)
3 Months ago
BestEx Research - Senior Software Engineer

BestEx Research

Bengaluru, Karnataka, India (On-Site)
4 Months ago
Luxoft - Senior/Lead Machine Learning and Image Processing Specialist

Luxoft

Poland, Ohio, United States (Remote)
2 Months ago
AI Fund - Artificial Intelligence Engineer

AI Fund

California, United States (Remote)
3 Months ago
ByteDance - Senior Machine Learning Engineer

ByteDance

San Jose, California, United States (On-Site)
3 Weeks ago
The Walt Disney Company - Manager, Software Engineering(Scala)

The Walt Disney Company

San Francisco, California, United States (On-Site)
2 Months ago
DGS - Telecom Data Scientist

DGS

McLean, Virginia, United States (On-Site)
10 Months ago
Captions - Software Engineer, iOS (3+ years of experience)

Captions

New York, New York, United States (On-Site)
3 Months ago

Get notifed when new similar jobs are uploaded

Jobs in Dublin, County Dublin, Ireland

PwC - Cyber Governance Risk & Compliance| Manager | Cyber Security | Technology Consulting

PwC

Dublin, County Dublin, Ireland (On-Site)
4 Months ago
Scopely - Senior Site Reliability Engineer - Unannounced Project

Scopely

Dublin, County Dublin, Ireland (Hybrid)
3 Weeks ago
Romero Games - Online Programmer

Romero Games

Galway, County Galway, Ireland (Remote)
3 Months ago
PwC - Marketing Manager

PwC

Dublin, County Dublin, Ireland (On-Site)
1 Month ago
Riot Games - Senior Software Engineer - VALORANT - Foundations Developer Experience & Workflows

Riot Games

Dublin, County Dublin, Ireland (On-Site)
2 Months ago
Intel Corporation - Ireland Site Tax Manager

Intel Corporation

Ireland (Hybrid)
2 Months ago
PwC - Tax (Financial Services) - AWM - Manager

PwC

Dublin, County Dublin, Ireland (On-Site)
4 Months ago
Playrix - Senior Node.js Developer (Server)

Playrix

Ireland (Remote)
1 Week ago
Riot Games - Principal Software Engineer, Gameplay - Teamfight Tactics

Riot Games

Dublin, County Dublin, Ireland (On-Site)
2 Months ago
Riot Games - Senior Manager, Technical Production - Teamfight Tactics, Core Tech

Riot Games

Dublin, County Dublin, Ireland (On-Site)
2 Months ago

Get notifed when new similar jobs are uploaded

DevOps Jobs

Microsoft - Senior Engineering Manager – CI/CD Engineering

Microsoft

Hyderabad, Telangana, India (On-Site)
2 Weeks ago
Ness Digital - Site Reliability Engineer

Ness Digital

Timișoara, Timiș, Romania (Hybrid)
1 Month ago
Britive - ENGINEERING MANAGER

Britive

Bengaluru, Karnataka, India (Remote)
2 Months ago
Intel Corporation - DevOps infra - k8s Engineer

Intel Corporation

Tel Aviv-Yafo, Tel Aviv District, Israel (Hybrid)
1 Month ago
ION - Site Reliability Engineer

ION

London, England, United Kingdom (Hybrid)
4 Months ago
Meta - Production Engineer

Meta

London, England, United Kingdom (On-Site)
3 Months ago
ByteDance - Senior Site Reliability Engineer - Data Infrastructure (Seattle)

ByteDance

Seattle, Washington, United States (On-Site)
3 Months ago
Aera Technology - Senior Platform Administration Engineer

Aera Technology

Bucharest, Bucharest, Romania (Hybrid)
3 Months ago
Kefir Games - Middle/Senior DevOps Engineer

Kefir Games

Cyprus (On-Site)
1 Month ago

Get notifed when new similar jobs are uploaded

About The Company

A problem isn't truly solved until it's solved for all. Googlers build products that help create opportunities for everyone, whether down the street or across the globe. Bring your insight, imagination and a healthy disregard for the impossible. Bring everything that makes you unique. Together, we can build for everyone.

View All Jobs

Get notified when new jobs are added by Google

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug