Site Reliability Engineer

2 Days ago • 2 Years + • DevOps • Network Engineering

Job Summary

Job Description

This Site Reliability Engineer (SRE) role at Google combines software and systems engineering to build and run large-scale, fault-tolerant systems for Google Cloud services. Responsibilities include managing project priorities, designing and developing software solutions, ensuring service reliability and uptime, optimizing existing systems, building infrastructure through automation, and collaborating with partner teams and users. The SRE will contribute to projects like automated troubleshooting, improved monitoring and service level objectives (SLOs), and service podification. The ideal candidate will possess strong coding, algorithm, and large-scale system design skills, along with excellent collaboration and leadership abilities. They will also operate Google's production network telemetry systems and proactively identify and propose solutions to improve system reliability.
Must have:
  • Bachelor's degree in CS or related field
  • 2 years experience with data structures/algorithms
  • Software development experience
  • Contribute to automation projects
  • Identify & propose network solutions
  • Collaborate with partner teams
Good to have:
  • Experience with Google production network
  • Excellent leadership skills

Job Details

Minimum qualifications:

  • Bachelor’s degree in Computer Science, a related field, or equivalent practical experience.
  • 2 years of experience with data structures/algorithms and software development in one or more programming languages.

Preferred qualifications:

  • Experience in software engineering with knowledge of Google production network.
  • Experience with research, propose and launching engineering solutions.
  • Ability to collaborate with current and prospective partner teams, product and users to discover their needs and provide solutions.
  • Excellent collaboration skills with technical goals for the team and partners.
  • Excellent leadership skills.

About the job

Site Reliability Engineering (SRE) combines software and systems engineering to build and run large-scale, massively distributed, fault-tolerant systems. SRE ensures that Google Cloud's services—both our internally critical and our externally-visible systems—have reliability, uptime appropriate to customer's needs and a fast rate of improvement. Additionally SRE’s will keep an ever-watchful eye on our systems capacity and performance.

Much of our software development focuses on optimizing existing systems, building infrastructure and eliminating work through automation. On the SRE team, you’ll have the opportunity to manage the complex challenges of scale which are unique to Google Cloud, while using your expertise in coding, algorithms, complexity analysis and large-scale system design. SRE's culture of diversity, intellectual curiosity, problem solving and openness is key to its success. Our organization brings together people with a wide variety of backgrounds, experiences and perspectives. We encourage them to collaborate, think big and take risks in a blame-free environment. We promote self-direction to work on meaningful projects, while we also strive to create an environment that provides the support and mentorship needed to learn and grow.

In this role, you will manage project priorities, deadlines, and deliverables. You will design, develop, test, deploy, maintain, and enhance software solutions.

Responsibilities

  • Contribute to land projects like Automated Troubleshooting, Better Monitoring and Service Level Objective (SLOs), Podification of services, etc.
  • Identify needs across network telemetry services. Propose, build and launch cross-service solutions to satisfy those needs.
  • Motivate improvements in the team's systems, infrastructure around them, and network telemetry ecosystem.
  • Engage with partner teams, users to make systems reliable with relatable SLOs. Guide technical plans and goals towards creating reliable systems. Operate the network telemetry systems of Google production network.

Similar Jobs

Netflix - Machine Learning Engineer

Netflix

United States (Remote)
3 Months ago
Google - Software Engineer, Performance Modeling

Google

Raleigh, North Carolina, United States (On-Site)
2 Weeks ago
Bluevine India - Python Developer- ML Infrastructure

Bluevine India

Bengaluru, Karnataka, India (Hybrid)
1 Day ago
Social Discovery Group - Senior NLP Engineer

Social Discovery Group

(Remote)
1 Day ago
DNEG - Software Developer – 2D Imaging and Nuke Tools

DNEG

Bengaluru, Karnataka, India (On-Site)
1 Month ago
Fortis Games - Senior DevOps Engineer

Fortis Games

Canada (On-Site)
3 Months ago
Nagarro - Power Platform Developer

Nagarro

Cebu City, Central Visayas, Philippines (On-Site)
6 Months ago
Google - Senior Product Manager, DevOps, Google Cloud

Google

Kirkland, Washington, United States (On-Site)
1 Week ago
Inworld AI - Staff Cloud DevOps/Site Reliability Engineer (SRE) - USA

Inworld AI

Mountain View, California, United States (On-Site)
9 Months ago
Wargaming - DevOps Engineer (Deployment team)

Wargaming

Vilnius, Vilnius County, Lithuania (On-Site)
1 Month ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Google - Software Engineer, People with Disabilities

Google

São Paulo, State Of São Paulo, Brazil (On-Site)
5 Months ago
Suki - Software Engineer III -Backend

Suki

Bengaluru, Karnataka, India (Hybrid)
23 Hours ago
Eventbrite - Site Reliability Engineer II

Eventbrite

(Remote)
1 Day ago
Inkittt - Senior Product Manager, Inkitt Product

Inkittt

San Francisco, California, United States (On-Site)
8 Months ago
Google - Software Engineer III, Embedded Systems/Firmware, Pixel

Google

Mountain View, California, United States (On-Site)
2 Days ago
Sleeper - Senior Frontend Engineer (Mobile)

Sleeper

Las Vegas, Nevada, United States (On-Site)
1 Month ago
Google - Senior Software Developer, Google Cloud Apps

Google

Zürich, Zurich, Switzerland (On-Site)
1 Week ago
Equivalent Jobs - QUANT DEVELOPER

Equivalent Jobs

(Remote)
5 Months ago
Niantic - Senior Computer Vision Software Engineer

Niantic

London, England, United Kingdom (Hybrid)
2 Months ago
Google - Software Engineer III, VirusTotal, Google Cloud

Google

Málaga, Andalusia, Spain (On-Site)
2 Weeks ago

Get notifed when new similar jobs are uploaded

Jobs in Dublin, County Dublin, Ireland

Google - Production Network Engineer

Google

Dublin, County Dublin, Ireland (On-Site)
2 Weeks ago
Google - Staff Product Manager, Subsea Cable Network

Google

Dublin, County Dublin, Ireland (On-Site)
1 Week ago
Playrix - Principal Level Designer (Match-3)

Playrix

Ireland (Remote)
2 Months ago
Virtuos - Senior/Lead AI Technical Animator (12-Month FTC)

Virtuos

Dublin, County Dublin, Ireland (Hybrid)
3 Days ago
Demonware - Project Management Co-Op Student

Demonware

Dublin, County Dublin, Ireland (On-Site)
2 Days ago
Google - Staff Software Engineer, Software Defined Network

Google

Dublin, County Dublin, Ireland (On-Site)
2 Days ago
Google - Cloud BI Sales Specialist

Google

Dublin, County Dublin, Ireland (On-Site)
1 Week ago
Playrix - Office Manager

Playrix

Ireland (On-Site)
1 Month ago
Google - Site Reliability Engineer, Ads Quality Infrastructure

Google

Dublin, County Dublin, Ireland (On-Site)
2 Weeks ago
Google - Account Executive, Mid-Market Sales, Google Customer Solutions

Google

Dublin, County Dublin, Ireland (On-Site)
1 Week ago

Get notifed when new similar jobs are uploaded

DevOps Jobs

Next Level Business Services - Systems Engineer

Next Level Business Services

Redmond, Washington, United States (On-Site)
6 Months ago
Tencent - Senior Cloud Solution Architect

Tencent

California, United States (On-Site)
1 Month ago
Luxoft - Google Cloud Engineer

Luxoft

New Delhi, Delhi, India (Remote)
4 Months ago
PwC - ETIC, GCP Technical Support Engineer - Senior Associate

PwC

Cairo, Cairo Governorate, Egypt (On-Site)
6 Months ago
N-iX - Senior DevOps Engineer

N-iX

Ukraine (Remote)
1 Week ago
Cargo Studio - MIS Engineer

Cargo Studio

(On-Site)
2 Months ago
NVIDIA - Senior Software Engineer, DGX Cloud Orchestration

NVIDIA

California, United States (Remote)
2 Weeks ago
N-iX - Senior Data Engineer

N-iX

Kyiv, Kyiv City, Ukraine (Hybrid)
2 Weeks ago
PENN Interactive - Staff Software Developer, Pricing Engine

PENN Interactive

Philadelphia, Pennsylvania, United States (Hybrid)
3 Months ago
Playtech - DevOps Engineer

Playtech

Vienna, Vienna, Austria (On-Site)
2 Weeks ago

Get notifed when new similar jobs are uploaded

About The Company

A problem isn't truly solved until it's solved for all. Googlers build products that help create opportunities for everyone, whether down the street or across the globe. Bring your insight, imagination and a healthy disregard for the impossible. Bring everything that makes you unique. Together, we can build for everyone.

Mountain View, California, United States (On-Site)

Mountain View, California, United States (On-Site)

Bengaluru, Karnataka, India (On-Site)

Bengaluru, Karnataka, India (On-Site)

Bengaluru, Karnataka, India (On-Site)

Bengaluru, Karnataka, India (On-Site)

Bengaluru, Karnataka, India (On-Site)

View All Jobs

Get notified when new jobs are added by Google

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug