Systems Engineer III, Host Networking Site Reliability Engineering

8 Hours ago • 2 Years + • DevOps

Job Summary

Job Description

This Systems Engineer III role in Site Reliability Engineering (SRE) at Google focuses on maintaining and improving the reliability and performance of large-scale, distributed systems within Google Cloud. Responsibilities include managing the entire service lifecycle, from design and deployment to operation and refinement; providing guidance to team members on availability, performance, and automation; and leading incident response and postmortems. The ideal candidate possesses strong experience with Unix/Linux systems, networking, and programming, along with expertise in designing, troubleshooting, and optimizing large-scale distributed systems. The role requires managing project priorities, developing software solutions, and scaling systems sustainably through automation.
Must have:
  • Bachelor's degree in CS or related field
  • 2+ years programming experience
  • 2+ years Unix/Linux systems or networking experience
  • Experience with large-scale distributed systems
  • Debugging, code optimization, automation skills
Good to have:
  • Experience in computing, distributed systems, storage, or networking
  • Expertise in designing, analyzing, and troubleshooting large-scale distributed systems

Job Details

Minimum qualifications:

  • Bachelor’s degree in Computer Science, a related field, or equivalent practical experience.
  • 2 years of experience with programming in one or more programming languages.
  • 2 years of experience working with Unix/Linux systems internals and administration (e.g., filesystems, inodes, system calls) or networking (e.g., TCP/IP, routing, network topologies and hardware, SDN).

Preferred qualifications:

  • Experience working in computing, distributed systems, storage, or networking.
  • Expertise in designing, analyzing, and troubleshooting large-scale distributed systems.
  • Ability to debug, optimize code, and to automate routine tasks.
  • Excellent problem-solving approach, with effective verbal and written communication skills.

About the job

Site Reliability Engineering (SRE) combines software and systems engineering to build and run large-scale, massively distributed, fault-tolerant systems. SRE ensures that Google Cloud's services—both our internally critical and our externally-visible systems—have reliability, uptime appropriate to customer's needs and a fast rate of improvement. Additionally SRE’s will keep an ever-watchful eye on our systems capacity and performance.

Much of our software development focuses on optimizing existing systems, building infrastructure and eliminating work through automation. On the SRE team, you’ll have the opportunity to manage the complex challenges of scale which are unique to Google Cloud, while using your expertise in coding, algorithms, complexity analysis and large-scale system design. SRE's culture of intellectual curiosity, problem solving and openness is key to its success. Our organization brings together people with a wide variety of backgrounds, experiences and perspectives. We encourage them to collaborate, think big and take risks in a blame-free environment. We promote self-direction to work on meaningful projects, while we also strive to create an environment that provides the support and mentorship needed to learn and grow.

With your technical expertise you will manage project priorities, deadlines, and deliverables. You will design, develop, test, deploy, maintain, and enhance software solutions.

Behind everything our users see online is the architecture built by the Technical Infrastructure team to keep it running. From developing and maintaining our data centers to building the next generation of Google platforms, we make Google's product portfolio possible. We're proud to be our engineers' engineers and love voiding warranties by taking things apart so we can rebuild them. We keep our networks up and running, ensuring our users have the best and fastest experience possible.

Responsibilities

  • Improve the whole life-cycle of services from inception and design, through deployment, operation, and refinement.
  • Manage support services before they go live through activities such as system design consulting, developing software platforms and frameworks, capacity planning, and launch reviews.
  • Provide guidance to other team members on managing availability and performance of mission critical services, on building automation to prevent problem recurrence, and on building automated responses for non-exceptional service conditions.
  • Maintain services once they are live by measuring and monitoring availability, latency, and overall system health. Lead sustainable incident response and blameless postmortems.
  • Scale systems sustainably through mechanisms like automation and evolve systems by driving changes that improve reliability and velocity.

Similar Jobs

Seedify - AI Product Manager

Seedify

(Remote)
3 Months ago
Netflix - Research Engineer (L4) - Member Lifecycle and Monetization

Netflix

United States (Remote)
16 Hours ago
Rebellion - Senior AI Gameplay Programmer

Rebellion

Oxford, England, United Kingdom (Hybrid)
1 Month ago
Netflix - Video Algorithms Intern

Netflix

Los Gatos, California, United States (On-Site)
16 Hours ago
Google - Software Engineer III, AI/ML

Google

Bengaluru, Karnataka, India (On-Site)
3 Months ago
Tencent - Tencent Cloud - Technical Account Manager (South Korea)

Tencent

Seoul, South Korea (On-Site)
3 Months ago
Hitachi - Azure Infra Consultant

Hitachi

Pune, Maharashtra, India (Remote)
6 Months ago
Google - Senior Software Engineer, Turn-up Site Reliability Engineering

Google

Dublin, County Dublin, Ireland (On-Site)
10 Hours ago
Microsoft - Software Engineering: Early in Career Opportunities

Microsoft

Prague, Prague, Czechia (On-Site)
19 Hours ago
Tencent - Tencent Cloud - Senior Cloud Architect (R&D & Solution Design)

Tencent

Jakarta, Jakarta, Indonesia (On-Site)
3 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

NVIDIA - Senior GPU Architect

NVIDIA

Westford, Massachusetts, United States (On-Site)
1 Month ago
ByteDance - Software Engineer, Model Inference

ByteDance

Seattle, Washington, United States (On-Site)
2 Months ago
G5 Games - C++ Gameplay Programmer

G5 Games

(Remote)
12 Hours ago
Mozilla - Senior Data Engineer

Mozilla

United States (Remote)
6 Months ago
Google - Software Engineer III, AI/ML, Google Cloud Application Modernization

Google

Sunnyvale, California, United States (On-Site)
10 Hours ago
Google - Software Engineer, Google Pixel Graphics

Google

Warsaw, Masovian Voivodeship, Poland (On-Site)
8 Hours ago
Zazz - Artificial Intelligence Engineer

Zazz

(Remote)
2 Months ago
Zypp Electric - Social Media Manager

Zypp Electric

Gurugram, Haryana, India (On-Site)
10 Months ago
Salesforce - Database Systems Development - Senior/Lead/Principal Member Technical Staff

Salesforce

Hyderabad, Telangana, India (On-Site)
5 Months ago
Snowed In Studios - Senior Generalist Programmer

Snowed In Studios

Quebec, Canada (Remote)
3 Weeks ago

Get notifed when new similar jobs are uploaded

Jobs in Dublin, County Dublin, Ireland

HP - Staff Video Embedded Engineer- Remote (Ireland)

HP

Galway, County Galway, Ireland (Remote)
6 Months ago
Google - Account Manager, LCS, Retail Marketplaces and Groceries

Google

Dublin, County Dublin, Ireland (On-Site)
8 Hours ago
Playrix - QA Director

Playrix

Ireland (Remote)
5 Months ago
Playrix - Senior Accountant

Playrix

Ireland (Remote)
2 Months ago
Playrix - Game Designer

Playrix

Ireland (Remote)
5 Months ago
Playrix - Senior VFX Artist

Playrix

Ireland (Remote)
5 Months ago
Salesforce - Sales Development Representative - Polish Speaker

Salesforce

Dublin, County Dublin, Ireland (On-Site)
1 Week ago
Playrix - AI Artist

Playrix

Ireland (Remote)
3 Weeks ago
Google - Display and Video 360 Specialist

Google

Dublin, County Dublin, Ireland (On-Site)
10 Hours ago
Tesla - Sales Advisor

Tesla

Dublin, County Dublin, Ireland (On-Site)
2 Months ago

Get notifed when new similar jobs are uploaded

DevOps Jobs

Netflix - Site Reliability Engineer L5 - Open Connect

Netflix

United States (Remote)
2 Months ago
Rackspace Technology - Cloud Database Engineer III/IV

Rackspace Technology

Gurugram, Haryana, India (Remote)
2 Weeks ago
Nagarro - Senior Staff Engineer - Python Full Stack

Nagarro

Colombia (Remote)
2 Months ago
Argus Labs - Site Reliability Engineer (LATAM)

Argus Labs

(Remote)
3 Weeks ago
Google - Cloud Technical Solutions Engineering Manager

Google

Tokyo, Japan (On-Site)
9 Hours ago
ByteDance - Senior Software Engineer - Compute Infrastructure (Orchestration & Scheduling)

ByteDance

San Jose, California, United States (On-Site)
2 Days ago
Google - Technical Sales Specialist II, Platform, Greenfield, Google Cloud

Google

Montreal, Quebec, Canada (On-Site)
9 Hours ago
Wildlife Studios - Associate Site Reliability Engineer

Wildlife Studios

São Paulo, State Of São Paulo, Brazil (On-Site)
3 Weeks ago
Rackspace Technology - AWS Service Delivery Manager

Rackspace Technology

India (Remote)
3 Weeks ago
Google - AI Customer Engineer III, Life Sciences

Google

Austin, Texas, United States (On-Site)
8 Hours ago

Get notifed when new similar jobs are uploaded

About The Company

A problem isn't truly solved until it's solved for all. Googlers build products that help create opportunities for everyone, whether down the street or across the globe. Bring your insight, imagination and a healthy disregard for the impossible. Bring everything that makes you unique. Together, we can build for everyone.

Bucharest, Bucharest, Romania (On-Site)

Bengaluru, Karnataka, India (On-Site)

Bengaluru, Karnataka, India (On-Site)

Dublin, County Dublin, Ireland (On-Site)

Bengaluru, Karnataka, India (On-Site)

Sunnyvale, California, United States (On-Site)

Sunnyvale, California, United States (On-Site)

View All Jobs

Get notified when new jobs are added by Google

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug