Software Engineer III, Site Reliability Engineering

3 Months ago • 2 Years + • DevOps

Job Summary

Job Description

Site Reliability Engineering (SRE) at Google combines software and systems engineering to build and run large-scale, massively distributed, fault-tolerant systems. SRE ensures that Google Cloud's services—both our internally critical and our externally-visible systems—have reliability, uptime appropriate to customer's needs and a fast rate of improvement. Additionally SRE’s will keep an ever-watchful eye on our systems capacity and performance. Much of our software development focuses on optimizing existing systems, building infrastructure and eliminating work through automation. On the SRE team, you’ll have the opportunity to manage the complex challenges of scale which are unique to Google Cloud, while using your expertise in coding, algorithms, complexity analysis and large-scale system design. SRE's culture of diversity, intellectual curiosity, problem solving and openness is key to its success. With your technical expertise you will manage project priorities, deadlines, and deliverables. You will design, develop, test, deploy, maintain, and enhance software solutions.
Must have:
  • Bachelor’s degree in Computer Science or related field
  • 2 years of experience with data structures/algorithms and software development
  • Experience working in computing, distributed systems, storage, or networking
  • Expertise in designing, analyzing, and troubleshooting large-scale distributed systems
  • Ability to debug, optimize code, and automate routine tasks
  • Systematic problem-solving approach
  • Effective verbal and written communication skills

Job Details

Minimum qualifications:

  • Bachelor’s degree in Computer Science, a related field, or equivalent practical experience.
  • Candidates will typically have 2 years of experience with data structures/algorithms and software development in one or more programming languages.

Preferred qualifications:

  • Experience working in computing, distributed systems, storage, or networking.
  • Expertise in designing, analyzing, and troubleshooting large-scale distributed systems.
  • Ability to debug, optimize code, and to automate routine tasks.
  • Systematic problem-solving approach, coupled with effective verbal and written communication skills.

About the job

Site Reliability Engineering (SRE) combines software and systems engineering to build and run large-scale, massively distributed, fault-tolerant systems. SRE ensures that Google Cloud's services—both our internally critical and our externally-visible systems—have reliability, uptime appropriate to customer's needs and a fast rate of improvement. Additionally SRE’s will keep an ever-watchful eye on our systems capacity and performance.

Much of our software development focuses on optimizing existing systems, building infrastructure and eliminating work through automation. On the SRE team, you’ll have the opportunity to manage the complex challenges of scale which are unique to Google Cloud, while using your expertise in coding, algorithms, complexity analysis and large-scale system design. SRE's culture of diversity, intellectual curiosity, problem solving and openness is key to its success. Our organization brings together people with a wide variety of backgrounds, experiences and perspectives. We encourage them to collaborate, think big and take risks in a blame-free environment. We promote self-direction to work on meaningful projects, while we also strive to create an environment that provides the support and mentorship needed to learn and grow.

With your technical expertise you will manage project priorities, deadlines, and deliverables. You will design, develop, test, deploy, maintain, and enhance software solutions.

Responsibilities

  • Write product or system development code.
  • Review code developed by other engineers and provide feedback to ensure best practices (e.g., style guidelines, checking code in, accuracy, testability, and efficiency).
  • Contribute to existing documentation or educational content and adapt content based on product/program updates and user feedback.
  • Triage product or system issues and debug/track/resolve by analyzing the sources of issues and the impact on hardware, network, or service operations and quality.
  • Participate in, or lead design reviews with peers and stakeholders to decide amongst available technologies.

Similar Jobs

NVIDIA - Senior ASIC Verification Engineer - GPU Memory Subsystem

NVIDIA

Durham, North Carolina, United States (On-Site)
1 Month ago
Epic Games - Animation Tools Programmer

Epic Games

Montreal, Quebec, Canada (On-Site)
1 Month ago
The Walt Disney Company - Software Engineer, Tools Internals (Core)

The Walt Disney Company

California, United States (On-Site)
5 Days ago
Google - Software Developer Intern, PhD, Summer 2025

Google

Waterloo, Ontario, Canada (On-Site)
3 Months ago
Moon Active - Software Developer

Moon Active

Warsaw, Masovian Voivodeship, Poland (On-Site)
3 Weeks ago
Enphase Energy - DevOps Engineer

Enphase Energy

Bengaluru, Karnataka, India (On-Site)
1 Month ago
Epic Games - Build Engineer, Fortnite

Epic Games

Cary, North Carolina, United States (On-Site)
4 Days ago
DaySmart - Senior DevOps Engineer

DaySmart

Hyderabad, Telangana, India (On-Site)
5 Months ago
Canva - Senior Software Engineer (Release Engineering/Python) - open to remote across ANZ

Canva

Sydney, New South Wales, Australia (Remote)
1 Month ago
Zones - Azure Backend Developer

Zones

Noida, Uttar Pradesh, India (On-Site)
3 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Zuora - Sr Software Engineer

Zuora

Bengaluru, Karnataka, India (Hybrid)
4 Months ago
ByteDance - Research Scientist Graduate (Foundation Model - Vision and Language)

ByteDance

Seattle, Washington, United States (On-Site)
3 Weeks ago
Meta - Research Scientist Intern, Machine Perception for Input and Interaction (PhD)

Meta

Sausalito, California, United States (On-Site)
3 Months ago
Fluxon - Senior Software Engineer

Fluxon

Bengaluru, Karnataka, India (Remote)
4 Months ago
Salesforce - Senior Software Engineer, Search Infrastructure Data Engineering - Slack

Salesforce

Atlanta, Georgia, United States (On-Site)
4 Months ago
Tesla - Cellular Software Development Engineer

Tesla

Brandenburg, Germany (On-Site)
4 Days ago
Google - Senior Software Engineer, Infrastructure, Google Cloud Networking

Google

Sunnyvale, California, United States (On-Site)
1 Month ago
Activision - Senior Gameplay Engineer - High Moon Studios

Activision

Carlsbad, California, United States (Hybrid)
1 Month ago

Get notifed when new similar jobs are uploaded

Jobs in Dublin, County Dublin, Ireland

PwC - Cloud Security | Manager | Cyber Security | Technology Consulting

PwC

Dublin, County Dublin, Ireland (On-Site)
4 Months ago
Logitech - Global Digital Marketing Manager – Paid Media

Logitech

Cork, County Cork, Ireland (Hybrid)
1 Month ago
Playrix - Game Designer

Playrix

Ireland (Remote)
4 Months ago
Riot Games - Senior Software Engineer, Services - Shanghai Game Studio

Riot Games

Dublin, County Dublin, Ireland (On-Site)
3 Months ago
Alphasense - Account Executive

Alphasense

Waterford, County Waterford, Ireland (Remote)
2 Months ago
Microsoft - Digital Technology Specialist

Microsoft

Dublin, County Dublin, Ireland (On-Site)
5 Hours ago
Playrix - Lead User Acquisition Manager

Playrix

Ireland (Remote)
4 Months ago
Google - Trust and Safety Risk Operations Analyst, Google Play

Google

Dublin, County Dublin, Ireland (On-Site)
1 Month ago
Riot Games - Sr. Principal Insights Analyst - VALORANT

Riot Games

Dublin, County Dublin, Ireland (On-Site)
3 Months ago

Get notifed when new similar jobs are uploaded

DevOps Jobs

ByteDance - Site Reliability Engineer, Traffic Platform - 2025 Start

ByteDance

Singapore (On-Site)
3 Months ago
Revolgy - L1 Cloud Associate

Revolgy

(Remote)
3 Weeks ago
ION - Site Reliability Engineer

ION

Milan, Lombardy, Italy (Hybrid)
4 Months ago
Sony Interactive Entertainment - Senior Cloud Security Engineer

Sony Interactive Entertainment

Tokyo, Japan (On-Site)
2 Months ago
Meltwater - Backend & Cloud Engineer – Javascript

Meltwater

Hyderabad, Telangana, India (Hybrid)
4 Months ago
EXUSIA - AWS DevOps Engineer/Admin

EXUSIA

India (Remote)
4 Months ago
ByteDance - SRE and DevOps Tech Lead - Edge Cloud Infrastructure - London

ByteDance

London, England, United Kingdom (On-Site)
3 Months ago
Interface AI - Principal DevOps

Interface AI

Hyderabad, Telangana, India (Remote)
2 Months ago
Microsoft - Sr. Hardware Engineer - DEBUG

Microsoft

Taipei City, Taiwan (On-Site)
1 Month ago

Get notifed when new similar jobs are uploaded

About The Company

A problem isn't truly solved until it's solved for all. Googlers build products that help create opportunities for everyone, whether down the street or across the globe. Bring your insight, imagination and a healthy disregard for the impossible. Bring everything that makes you unique. Together, we can build for everyone.

View All Jobs

Get notified when new jobs are added by Google

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug