Site Reliability Engineer

1 Month ago • 2 Years + • DevOps • Network Engineering

Job Summary

Job Description

This Site Reliability Engineer (SRE) role at Google combines software and systems engineering to build and run large-scale, fault-tolerant systems for Google Cloud services. Responsibilities include managing project priorities, designing and developing software solutions, ensuring service reliability and uptime, optimizing existing systems, building infrastructure through automation, and collaborating with partner teams and users. The SRE will contribute to projects like automated troubleshooting, improved monitoring and service level objectives (SLOs), and service podification. The ideal candidate will possess strong coding, algorithm, and large-scale system design skills, along with excellent collaboration and leadership abilities. They will also operate Google's production network telemetry systems and proactively identify and propose solutions to improve system reliability.
Must have:
  • Bachelor's degree in CS or related field
  • 2 years experience with data structures/algorithms
  • Software development experience
  • Contribute to automation projects
  • Identify & propose network solutions
  • Collaborate with partner teams
Good to have:
  • Experience with Google production network
  • Excellent leadership skills

Job Details

Minimum qualifications:

  • Bachelor’s degree in Computer Science, a related field, or equivalent practical experience.
  • 2 years of experience with data structures/algorithms and software development in one or more programming languages.

Preferred qualifications:

  • Experience in software engineering with knowledge of Google production network.
  • Experience with research, propose and launching engineering solutions.
  • Ability to collaborate with current and prospective partner teams, product and users to discover their needs and provide solutions.
  • Excellent collaboration skills with technical goals for the team and partners.
  • Excellent leadership skills.

About the job

Site Reliability Engineering (SRE) combines software and systems engineering to build and run large-scale, massively distributed, fault-tolerant systems. SRE ensures that Google Cloud's services—both our internally critical and our externally-visible systems—have reliability, uptime appropriate to customer's needs and a fast rate of improvement. Additionally SRE’s will keep an ever-watchful eye on our systems capacity and performance.

Much of our software development focuses on optimizing existing systems, building infrastructure and eliminating work through automation. On the SRE team, you’ll have the opportunity to manage the complex challenges of scale which are unique to Google Cloud, while using your expertise in coding, algorithms, complexity analysis and large-scale system design. SRE's culture of diversity, intellectual curiosity, problem solving and openness is key to its success. Our organization brings together people with a wide variety of backgrounds, experiences and perspectives. We encourage them to collaborate, think big and take risks in a blame-free environment. We promote self-direction to work on meaningful projects, while we also strive to create an environment that provides the support and mentorship needed to learn and grow.

In this role, you will manage project priorities, deadlines, and deliverables. You will design, develop, test, deploy, maintain, and enhance software solutions.

Responsibilities

  • Contribute to land projects like Automated Troubleshooting, Better Monitoring and Service Level Objective (SLOs), Podification of services, etc.
  • Identify needs across network telemetry services. Propose, build and launch cross-service solutions to satisfy those needs.
  • Motivate improvements in the team's systems, infrastructure around them, and network telemetry ecosystem.
  • Engage with partner teams, users to make systems reliable with relatable SLOs. Guide technical plans and goals towards creating reliable systems. Operate the network telemetry systems of Google production network.

Similar Jobs

Google - Staff Software Engineer, Performance, Pixel

Google

Mountain View, California, United States (On-Site)
1 Month ago
Google - Software Engineer III, Front End, Google Cloud Business Platforms

Google

San Francisco, California, United States (On-Site)
1 Month ago
Fluence - Jr. Controls Engineer (m/f/d) - German speaker

Fluence

Erlangen, Bavaria, Germany (Hybrid)
7 Months ago
Qualcomm - IT Senior Developer - Apigee

Qualcomm

Hyderabad, Telangana, India (On-Site)
3 Weeks ago
Powerintegration - Sr. Staff IC CAD Engineer

Powerintegration

San Jose, California, United States (On-Site)
1 Month ago
Next Level Business Services - Hadoop AWS Developer

Next Level Business Services

Beaverton, Oregon, United States (On-Site)
7 Months ago
Wind River Systems - Member of Technical Staff

Wind River Systems

Bengaluru, Karnataka, India (On-Site)
7 Months ago
Google - Software Engineering Manager II

Google

Warsaw, Masovian Voivodeship, Poland (On-Site)
1 Month ago
ByteDance - Software Engineer - Serverless Compute Infrastructure

ByteDance

Seattle, Washington, United States (On-Site)
3 Months ago
Ajmera Infotech - Senior ASP.NET Developer with Azure Expertise

Ajmera Infotech

Austin, Texas, United States (On-Site)
3 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Qualcomm - Systems/Data Staff Engineer – Sensors

Qualcomm

Shanghai, China (On-Site)
2 Weeks ago
Uniswap Labs - Senior Backend Engineer

Uniswap Labs

New York, United States (Remote)
2 Weeks ago
Equivalent Jobs - C++ TEAM LEAD (MARKETS EXPANSION)

Equivalent Jobs

(Remote)
6 Months ago
mighty koi - Game Designer

mighty koi

Warsaw, Masovian Voivodeship, Poland (On-Site)
3 Weeks ago
Visa Jobs - Sr. Consultant

Visa Jobs

Atlanta, Georgia, United States (Hybrid)
3 Weeks ago
ByteDance - Research Scientist Graduate (Foundation Model - Generative AI) - 2025 Start (PhD)

ByteDance

San Jose, California, United States (On-Site)
5 Months ago
Google - Staff Image Quality Evaluation Engineer, Silicon

Google

Mountain View, California, United States (On-Site)
1 Month ago
Inworld AI - Senior Software Engineer (C++ Focus)

Inworld AI

Mountain View, California, United States (Hybrid)
2 Months ago
QuinStreet - Director of Client Sales

QuinStreet

United States (Remote)
3 Weeks ago

Get notifed when new similar jobs are uploaded

Jobs in Dublin, County Dublin, Ireland

Google - Account Strategist, Engage, Google Customer Solutions

Google

Dublin, County Dublin, Ireland (On-Site)
1 Month ago
Google - Staff Software Engineer, Site Reliability Engineering

Google

Dublin, County Dublin, Ireland (On-Site)
1 Month ago
Playrix - Senior/Principal 2D Artist (Match-3)

Playrix

Ireland (Remote)
7 Months ago
GameJobs - Project Management Co-Op Student

GameJobs

Dublin, County Dublin, Ireland (On-Site)
1 Month ago
Monzo - Senior Operational, Technology and Outsourcing Risk Manager

Monzo

Dublin, County Dublin, Ireland (On-Site)
2 Weeks ago
Playrix - Development Director

Playrix

Ireland (Remote)
7 Months ago
Notion - Sales Manager, Mid-Market, EMEA

Notion

Dublin, County Dublin, Ireland (On-Site)
7 Months ago
Google - Technical Solutions Consultant

Google

Dublin, County Dublin, Ireland (On-Site)
1 Month ago
Tesla - Senior Sales Advisor

Tesla

Limerick, County Limerick, Ireland (Hybrid)
3 Months ago
Scopely - Senior Server Engineer (Platform)

Scopely

Dublin, County Dublin, Ireland (Hybrid)
3 Months ago

Get notifed when new similar jobs are uploaded

DevOps Jobs

ION - Cloud Engineer Kubernetes

ION

Collecchio, Emilia-Romagna, Italy (Hybrid)
7 Months ago
ION - Senior DevSecOps Engineer, Italy

ION

London, England, United Kingdom (On-Site)
7 Months ago
Virtusa - Cloud DevOps Lead

Virtusa

Andhra Pradesh, India (On-Site)
7 Months ago
Google - Customer Engineer, SAP, Google Cloud

Google

Addison, Texas, United States (On-Site)
1 Month ago
Omnissa - Member of Technical Staff (Automation)

Omnissa

Bengaluru, Karnataka, India (Hybrid)
7 Months ago
Wind River Systems - Member of Technical Staff

Wind River Systems

Bengaluru, Karnataka, India (On-Site)
7 Months ago
Warhorse Studios - DevOps / C# Tools Programmer

Warhorse Studios

Prague, Prague, Czechia (On-Site)
2 Months ago
Sandsoft Games - DevOps & Automation Engineer

Sandsoft Games

Riyadh, Riyadh Province, Saudi Arabia (Hybrid)
2 Months ago
Playtech - Operations Engineer

Playtech

Tallinn, Harju County, Estonia (On-Site)
2 Months ago
Guardian Life - TechOps Engineer

Guardian Life

Gurugram, Haryana, India (On-Site)
8 Months ago

Get notifed when new similar jobs are uploaded

About The Company

London, England, United Kingdom (On-Site)

Bengaluru, Karnataka, India (On-Site)

Mountain View, California, United States (On-Site)

Bengaluru, Karnataka, India (On-Site)

Taipei City, Taiwan (On-Site)

Zürich, Zurich, Switzerland (On-Site)

Kirkland, Washington, United States (On-Site)

New Taipei, New Taipei City, Taiwan (On-Site)

Seattle, Washington, United States (On-Site)

View All Jobs

Get notified when new jobs are added by Google

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug