Senior Systems Engineer, Cloud Filestore, Site Reliability Engineering

2 Days ago • 5 Years + • DevOps • Backend Development

Job Summary

Job Description

This Senior Systems Engineer role within Google Cloud's Site Reliability Engineering (SRE) team focuses on Cloud Filestore. Responsibilities include improving the service lifecycle (design, deployment, operation, refinement), guiding team members on availability and performance, building automation to prevent problems, and leading incident response and postmortems. The role requires scaling systems sustainably, managing support services, and collaborating on system design, software platforms, capacity planning, and launch reviews. The ideal candidate will have strong experience with programming languages, distributed systems, and a proven track record of project leadership.
Must have:
  • Bachelor's degree in CS or related field
  • 5+ years programming experience
  • 2+ years project leadership experience
  • Experience with distributed systems
  • Excellent problem-solving skills
Good to have:
  • Experience with Java, C, C++, Go, Python, Shell, Perl, JavaScript
  • Experience in computing, distributed systems, storage, or networking

Job Details

Minimum qualifications:

  • Bachelor’s degree in Computer Science, a related field, or equivalent practical experience.
  • 5 years of experience with programming in one or more programming languages.
  • 2 years of experience leading projects.

Preferred qualifications:

  • Experience with managing hosted services/SaaS including one or more of the following programming/scripting languages: Java, C, C++, Go, Python, shell, Perl, JavaScript.
  • Experience working in computing, distributed systems, storage, or networking.
  • Excellent ability in designing, analyzing, and troubleshooting distributed systems.
  • Excellent problem-solving approach, with effective communication skills.

About the job

Site Reliability Engineering (SRE) combines software and systems engineering to build and run large-scale, massively distributed, fault-tolerant systems. SRE ensures that Google Cloud's services—both our internally critical and our externally-visible systems—have reliability, uptime appropriate to customer's needs and a fast rate of improvement. Additionally SRE’s will keep an ever-watchful eye on our systems capacity and performance.

Much of our software development focuses on optimizing existing systems, building infrastructure and eliminating work through automation. On the SRE team, you’ll have the opportunity to manage the complex challenges of scale which are unique to Google Cloud, while using your expertise in coding, algorithms, complexity analysis and large-scale system design. SRE's culture of intellectual curiosity, problem solving and openness is key to its success. Our organization brings together people with a wide variety of backgrounds, experiences and perspectives. We encourage them to collaborate, think big and take risks in a blame-free environment. We promote self-direction to work on meaningful projects, while we also strive to create an environment that provides the support and mentorship needed to learn and grow.

Behind everything our users see online is the architecture built by the Technical Infrastructure team to keep it running. From developing and maintaining our data centers to building the next generation of Google platforms, we make Google's product portfolio possible. We're proud to be our engineers' engineers and love voiding warranties by taking things apart so we can rebuild them. We keep our networks up and running, ensuring our users have the best and fastest experience possible.

Responsibilities

  • Improve the whole life-cycle of services from inception and design, through deployment, operation, and refinement.
  • Provide guidance to other team members on managing availability and performance of mission critical services, on building automation to prevent problem recurrence, and on building automated responses for non-exceptional service conditions.
  • Maintain services by measuring and monitoring availability, latency, and overall system health. Lead incident response and blameless postmortems.
  • Scale systems sustainably through mechanisms like automation and evolve systems by driving changes that improve reliability and velocity.
  • Manage support services before they go live through activities such as system design consulting, developing software platforms and frameworks, capacity planning, and launch reviews.

Similar Jobs

NVIDIA - Senior Firmware Engineer - Memory Subsystem

NVIDIA

Santa Clara, California, United States (On-Site)
3 Months ago
Trend Micro - Large Language Models (LLM) Expert (VicOne_Automotive Security)

Trend Micro

Taipei City, Taiwan (On-Site)
7 Months ago
Wrike - Mid Senior Backend Engineer

Wrike

Bengaluru, Karnataka, India (Hybrid)
7 Hours ago
Side - Software Engineer - PHP

Side

Hyderabad, Telangana, India (On-Site)
23 Hours ago
Riot Games - Staff Software Engineer, Game Build

Riot Games

Los Angeles, California, United States (On-Site)
1 Day ago
DraftKings - Manager, System DBA Operations

DraftKings

Sofia, Sofia City Province, Bulgaria (On-Site)
5 Months ago
Google - Software Engineer, Site Reliability Engineering

Google

Warsaw, Masovian Voivodeship, Poland (On-Site)
2 Weeks ago
Google - Customer Engineer III, Application Modernization, Google Cloud

Google

San Francisco, California, United States (On-Site)
2 Weeks ago
N-iX - Senior Python Engineer (Part-Time)

N-iX

Poland (Remote)
3 Days ago
Epic Games - Senior DevOps Programmer

Epic Games

Montreal, Quebec, Canada (On-Site)
1 Month ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

The Walt Disney Company - Software Engineer II - Databases

The Walt Disney Company

Bristol, Connecticut, United States (On-Site)
1 Month ago
Egnyte - Principal Machine Learning Engineer - AI

Egnyte

India (Remote)
1 Month ago
Stake Logic - Java Back-end Developer

Stake Logic

(Remote)
2 Months ago
Actian - Zen Sustaining Engineer - Bangalore/Pune

Actian

Bengaluru, Karnataka, India (On-Site)
6 Months ago
Netflix - Data Engineer (L4) - Security

Netflix

Los Gatos, California, United States (Hybrid)
2 Weeks ago
ByteDance - Software Engineer Intern, Security Engineering

ByteDance

Singapore (On-Site)
2 Weeks ago
Dun & Bradstreet - 2025 Summer Internship Program - Technology

Dun & Bradstreet

Jacksonville, Florida, United States (On-Site)
6 Months ago
Google - Software Engineer, NetSoft

Google

Sydney, New South Wales, Australia (On-Site)
1 Week ago
SmartBear - Zephyr Enterprise Senior Software Engineer Customer support

SmartBear

Bengaluru, Karnataka, India (On-Site)
5 Months ago
Enverus - Staff Software Engineer

Enverus

Calgary, Alberta, Canada (On-Site)
1 Day ago

Get notifed when new similar jobs are uploaded

Jobs in Dublin, County Dublin, Ireland

Playrix - Senior QA Engineer (Mobile)

Playrix

Ireland (Remote)
6 Months ago
OKX - Privacy Analyst

OKX

Dublin, County Dublin, Ireland (On-Site)
9 Hours ago
Google - Account Strategist, Mid-Market Sales

Google

Dublin, County Dublin, Ireland (On-Site)
1 Week ago
Keywords Studios - Keywords Talent Community

Keywords Studios

Ireland (Remote)
1 Month ago
Whatnot - Senior IT Systems & Ops Engineer

Whatnot

Dublin, County Dublin, Ireland (On-Site)
1 Day ago
Google - Senior Networking Systems Engineer, Site Reliability Engineering

Google

Dublin, County Dublin, Ireland (On-Site)
2 Days ago
Notion - Enterprise Technical Support, German, EMEA

Notion

Dublin, County Dublin, Ireland (On-Site)
6 Months ago
sitecore - VP, Treasury & Tax

sitecore

Dublin, County Dublin, Ireland (On-Site)
1 Month ago
Google - Account Executive, New Business Sales

Google

Dublin, County Dublin, Ireland (On-Site)
2 Weeks ago
Google - Regional Strategy and Operations Lead, Revenue Strategy and Operations

Google

Dublin, County Dublin, Ireland (On-Site)
2 Days ago

Get notifed when new similar jobs are uploaded

DevOps Jobs

Playtech - Integration Engineer

Playtech

Kyiv, Kyiv City, Ukraine (On-Site)
1 Month ago
Google - Software Engineer III, Transformative Compute SRE

Google

Warsaw, Masovian Voivodeship, Poland (On-Site)
1 Week ago
Epic Games - Senior DevOps Programmer

Epic Games

Montreal, Quebec, Canada (On-Site)
1 Month ago
Google - Senior Software Engineer, Diagnostics, Tools, Google Cloud

Google

Taipei City, Taiwan (On-Site)
2 Weeks ago
Ajmera Infotech - Senior Azure DevOps Engineer (IaaS)

Ajmera Infotech

Bengaluru, Karnataka, India (On-Site)
1 Month ago
Luxoft - Murex Technical Developer - Lead

Luxoft

Toronto, Ontario, Canada (On-Site)
5 Months ago
Google - Site Reliability Manager, Platforms and Devices, SRE

Google

Bengaluru, Karnataka, India (On-Site)
2 Weeks ago
SmileGate - AI Cloud Infrastructure Engineer

SmileGate

Seongnam-si, Gyeonggi-do, South Korea (On-Site)
1 Month ago
ByteDance - Cloud Solution Architect (Automotive Industry)

ByteDance

(On-Site)
1 Month ago
One of Us - Tools Developer

One of Us

London, England, United Kingdom (Hybrid)
1 Month ago

Get notifed when new similar jobs are uploaded

About The Company

A problem isn't truly solved until it's solved for all. Googlers build products that help create opportunities for everyone, whether down the street or across the globe. Bring your insight, imagination and a healthy disregard for the impossible. Bring everything that makes you unique. Together, we can build for everyone.

Mountain View, California, United States (On-Site)

Mountain View, California, United States (On-Site)

Bengaluru, Karnataka, India (On-Site)

Bengaluru, Karnataka, India (On-Site)

Bengaluru, Karnataka, India (On-Site)

Bengaluru, Karnataka, India (On-Site)

Bengaluru, Karnataka, India (On-Site)

View All Jobs

Get notified when new jobs are added by Google

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug