Site Reliability Engineer

19 Hours ago • 2 Years + • DevOps • Network Engineering

Job Summary

Job Description

This Site Reliability Engineer (SRE) role at Google combines software and systems engineering to build and run large-scale, fault-tolerant systems for Google Cloud services. Responsibilities include managing project priorities, designing and developing software solutions, ensuring service reliability and uptime, optimizing existing systems, building infrastructure through automation, and collaborating with partner teams and users. The SRE will contribute to projects like automated troubleshooting, improved monitoring and service level objectives (SLOs), and service podification. The ideal candidate will possess strong coding, algorithm, and large-scale system design skills, along with excellent collaboration and leadership abilities. They will also operate Google's production network telemetry systems and proactively identify and propose solutions to improve system reliability.
Must have:
  • Bachelor's degree in CS or related field
  • 2 years experience with data structures/algorithms
  • Software development experience
  • Contribute to automation projects
  • Identify & propose network solutions
  • Collaborate with partner teams
Good to have:
  • Experience with Google production network
  • Excellent leadership skills

Job Details

Minimum qualifications:

  • Bachelor’s degree in Computer Science, a related field, or equivalent practical experience.
  • 2 years of experience with data structures/algorithms and software development in one or more programming languages.

Preferred qualifications:

  • Experience in software engineering with knowledge of Google production network.
  • Experience with research, propose and launching engineering solutions.
  • Ability to collaborate with current and prospective partner teams, product and users to discover their needs and provide solutions.
  • Excellent collaboration skills with technical goals for the team and partners.
  • Excellent leadership skills.

About the job

Site Reliability Engineering (SRE) combines software and systems engineering to build and run large-scale, massively distributed, fault-tolerant systems. SRE ensures that Google Cloud's services—both our internally critical and our externally-visible systems—have reliability, uptime appropriate to customer's needs and a fast rate of improvement. Additionally SRE’s will keep an ever-watchful eye on our systems capacity and performance.

Much of our software development focuses on optimizing existing systems, building infrastructure and eliminating work through automation. On the SRE team, you’ll have the opportunity to manage the complex challenges of scale which are unique to Google Cloud, while using your expertise in coding, algorithms, complexity analysis and large-scale system design. SRE's culture of diversity, intellectual curiosity, problem solving and openness is key to its success. Our organization brings together people with a wide variety of backgrounds, experiences and perspectives. We encourage them to collaborate, think big and take risks in a blame-free environment. We promote self-direction to work on meaningful projects, while we also strive to create an environment that provides the support and mentorship needed to learn and grow.

In this role, you will manage project priorities, deadlines, and deliverables. You will design, develop, test, deploy, maintain, and enhance software solutions.

Responsibilities

  • Contribute to land projects like Automated Troubleshooting, Better Monitoring and Service Level Objective (SLOs), Podification of services, etc.
  • Identify needs across network telemetry services. Propose, build and launch cross-service solutions to satisfy those needs.
  • Motivate improvements in the team's systems, infrastructure around them, and network telemetry ecosystem.
  • Engage with partner teams, users to make systems reliable with relatable SLOs. Guide technical plans and goals towards creating reliable systems. Operate the network telemetry systems of Google production network.

Similar Jobs

ByteDance - FPGA Firmware Engineer

ByteDance

San Jose, California, United States (On-Site)
2 Months ago
Garena - Intern, Software Engineer

Garena

Singapore (On-Site)
2 Months ago
Warner Bros Games - Staff Software Engineer - Backend (Adtech Team)

Warner Bros Games

Pune, Maharashtra, India (Hybrid)
2 Months ago
NVIDIA - Applied Physics ML Research Intern - Fall 2025

NVIDIA

Santa Clara, California, United States (On-Site)
1 Week ago
Velotio Technologies - Senior GenAI Engineer - .Net

Velotio Technologies

Maharashtra, India (Remote)
1 Week ago
Wipro - Azure AD

Wipro

Bengaluru, Karnataka, India (On-Site)
7 Months ago
Warner Bros Games - Staff Software Engineer - Cloud Support and Operations

Warner Bros Games

Bengaluru, Karnataka, India (Hybrid)
3 Weeks ago
Epic Games - Senior DevOps Programmer

Epic Games

Porto Alegre, State Of Rio Grande Do Sul, Brazil (On-Site)
1 Month ago
Google - Software Developer III, Google Kubernetes Engine, Anthos Networking

Google

Warsaw, Masovian Voivodeship, Poland (On-Site)
1 Week ago
Nagarro - Power Platform Developer

Nagarro

Cebu City, Central Visayas, Philippines (On-Site)
6 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

NVIDIA - Senior ASIC Verification Engineer - HSIO

NVIDIA

Westford, Massachusetts, United States (On-Site)
3 Months ago
NVIDIA - Senior Developer Technology Engineer - AI

NVIDIA

Westford, Massachusetts, United States (Hybrid)
1 Month ago
Google - Senior Research Engineer, AI/ML

Google

London, England, United Kingdom (On-Site)
1 Week ago
Google - Senior Software Engineer, Core Machine Learning, Google Cloud

Google

Mountain View, California, United States (On-Site)
1 Week ago
NVIDIA - Senior Synthesis Flow Development Engineer

NVIDIA

Santa Clara, California, United States (On-Site)
1 Month ago
ByteDance - Research Scientist, Multimodality

ByteDance

Seattle, Washington, United States (On-Site)
5 Months ago
Google - Software Engineer III, Full Stack, Google Play Games

Google

Bengaluru, Karnataka, India (On-Site)
1 Week ago
Google - Software Engineer III, AI/ML GenAI, Google Cloud

Google

Hyderabad, Telangana, India (On-Site)
1 Week ago
Google - Software Engineer III

Google

Mountain View, California, United States (On-Site)
16 Hours ago

Get notifed when new similar jobs are uploaded

Jobs in Dublin, County Dublin, Ireland

PwC - Cyber Governance Risk & Compliance| Manager | Cyber Security | Technology Consulting

PwC

Dublin, County Dublin, Ireland (On-Site)
6 Months ago
Salesforce - Sales Development Representative

Salesforce

Dublin, County Dublin, Ireland (On-Site)
3 Weeks ago
Keywords Studios - IT Senior Support Manager

Keywords Studios

County Dublin, Ireland (Hybrid)
1 Week ago
Google - Accelerated Growth Consultant, GCS

Google

Dublin, County Dublin, Ireland (On-Site)
18 Hours ago
Playrix - AI Motion Designer

Playrix

Ireland (Remote)
1 Month ago
Google - Account Executive, Mid-Market Sales

Google

Dublin, County Dublin, Ireland (On-Site)
1 Week ago
Riot Games - Software Engineer - Platform & Tools (Contractor)

Riot Games

Dublin, County Dublin, Ireland (On-Site)
5 Months ago
PlayStation Global - Data Scientist/Analyst

PlayStation Global

Dublin, County Dublin, Ireland (On-Site)
2 Weeks ago
Romero Games - Data Analyst

Romero Games

Galway, County Galway, Ireland (Remote)
2 Weeks ago
Google - Customer Growth Associate

Google

Dublin, County Dublin, Ireland (On-Site)
1 Week ago

Get notifed when new similar jobs are uploaded

DevOps Jobs

PlayStation Global - Platform Engineer

PlayStation Global

Adelaide, South Australia, Australia (On-Site)
1 Month ago
CLO Virtual Fashion  Inc  - DevOps Engineer

CLO Virtual Fashion Inc

Bengaluru, Karnataka, India (On-Site)
6 Months ago
CharacterAI - Staff Software Engineer, Site Reliability (SRE)

CharacterAI

San Francisco, California, United States (On-Site)
1 Week ago
Zazz - Data Engineer

Zazz

(Remote)
3 Months ago
Scopely - Senior Security IAM Engineer

Scopely

Lisbon, Lisbon, Portugal (Hybrid)
1 Month ago
Google - Staff Software Engineering Manager, Sustainability and Efficiency

Google

Raleigh, North Carolina, United States (On-Site)
20 Hours ago
Epic Games - Senior DevOps Programmer

Epic Games

Montreal, Quebec, Canada (On-Site)
1 Month ago
Google - Principal Architect III, Retail, Google Cloud

Google

Addison, Texas, United States (On-Site)
1 Week ago
N-iX - Middle DevOps Engineer

N-iX

Colombia (Remote)
2 Weeks ago
Zazz - Java Developer

Zazz

(Remote)
2 Months ago

Get notifed when new similar jobs are uploaded

About The Company

A problem isn't truly solved until it's solved for all. Googlers build products that help create opportunities for everyone, whether down the street or across the globe. Bring your insight, imagination and a healthy disregard for the impossible. Bring everything that makes you unique. Together, we can build for everyone.

Dublin, County Dublin, Ireland (On-Site)

New York, New York, United States (On-Site)

Waterloo, Ontario, Canada (On-Site)

Taipei City, Taiwan (On-Site)

San Francisco, California, United States (On-Site)

Saint-Ghislain, Wallonia, Belgium (On-Site)

Bengaluru, Karnataka, India (On-Site)

Austin, Texas, United States (On-Site)

View All Jobs

Get notified when new jobs are added by Google

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug