Senior Systems Engineer, Cloud Filestore, Site Reliability Engineering

16 Hours ago • 5 Years + • DevOps • Backend Development

Job Summary

Job Description

This Senior Systems Engineer role within Google Cloud's Site Reliability Engineering (SRE) team focuses on Cloud Filestore. Responsibilities include improving the service lifecycle (design, deployment, operation, refinement), guiding team members on availability and performance, building automation to prevent problems, and leading incident response and postmortems. The role requires scaling systems sustainably, managing support services, and collaborating on system design, software platforms, capacity planning, and launch reviews. The ideal candidate will have strong experience with programming languages, distributed systems, and a proven track record of project leadership.
Must have:
  • Bachelor's degree in CS or related field
  • 5+ years programming experience
  • 2+ years project leadership experience
  • Experience with distributed systems
  • Excellent problem-solving skills
Good to have:
  • Experience with Java, C, C++, Go, Python, Shell, Perl, JavaScript
  • Experience in computing, distributed systems, storage, or networking

Job Details

Minimum qualifications:

  • Bachelor’s degree in Computer Science, a related field, or equivalent practical experience.
  • 5 years of experience with programming in one or more programming languages.
  • 2 years of experience leading projects.

Preferred qualifications:

  • Experience with managing hosted services/SaaS including one or more of the following programming/scripting languages: Java, C, C++, Go, Python, shell, Perl, JavaScript.
  • Experience working in computing, distributed systems, storage, or networking.
  • Excellent ability in designing, analyzing, and troubleshooting distributed systems.
  • Excellent problem-solving approach, with effective communication skills.

About the job

Site Reliability Engineering (SRE) combines software and systems engineering to build and run large-scale, massively distributed, fault-tolerant systems. SRE ensures that Google Cloud's services—both our internally critical and our externally-visible systems—have reliability, uptime appropriate to customer's needs and a fast rate of improvement. Additionally SRE’s will keep an ever-watchful eye on our systems capacity and performance.

Much of our software development focuses on optimizing existing systems, building infrastructure and eliminating work through automation. On the SRE team, you’ll have the opportunity to manage the complex challenges of scale which are unique to Google Cloud, while using your expertise in coding, algorithms, complexity analysis and large-scale system design. SRE's culture of intellectual curiosity, problem solving and openness is key to its success. Our organization brings together people with a wide variety of backgrounds, experiences and perspectives. We encourage them to collaborate, think big and take risks in a blame-free environment. We promote self-direction to work on meaningful projects, while we also strive to create an environment that provides the support and mentorship needed to learn and grow.

Behind everything our users see online is the architecture built by the Technical Infrastructure team to keep it running. From developing and maintaining our data centers to building the next generation of Google platforms, we make Google's product portfolio possible. We're proud to be our engineers' engineers and love voiding warranties by taking things apart so we can rebuild them. We keep our networks up and running, ensuring our users have the best and fastest experience possible.

Responsibilities

  • Improve the whole life-cycle of services from inception and design, through deployment, operation, and refinement.
  • Provide guidance to other team members on managing availability and performance of mission critical services, on building automation to prevent problem recurrence, and on building automated responses for non-exceptional service conditions.
  • Maintain services by measuring and monitoring availability, latency, and overall system health. Lead incident response and blameless postmortems.
  • Scale systems sustainably through mechanisms like automation and evolve systems by driving changes that improve reliability and velocity.
  • Manage support services before they go live through activities such as system design consulting, developing software platforms and frameworks, capacity planning, and launch reviews.

Similar Jobs

ByteDance - Senior Backend Software Engineer - Global E-Commerce Supply Chain Billing & Settlement

ByteDance

Seattle, Washington, United States (On-Site)
5 Months ago
SmileGate - Game Data Engineer (Platform Development)

SmileGate

Seongnam-si, Gyeonggi-do, South Korea (On-Site)
3 Months ago
The Walt Disney Company - Principal Software Engineer - Ad Platform

The Walt Disney Company

Santa Monica, California, United States (On-Site)
2 Weeks ago
Nagarro - Staff Engineer, QA Automation

Nagarro

Colombia (Remote)
6 Months ago
Google - Senior Software Engineer, Site Reliability Engineering, Technical Infrastructure

Google

Dublin, County Dublin, Ireland (On-Site)
20 Hours ago
Velotio Technologies - Senior DevOps Engineer (AWS)

Velotio Technologies

Maharashtra, India (Remote)
1 Month ago
Microsoft - Principal Software Engineer

Microsoft

Redmond, Washington, United States (On-Site)
1 Week ago
Dream Sports - Director Systems IT

Dream Sports

Mumbai, Maharashtra, India (On-Site)
1 Month ago
Microsoft - ROP - Software Engineer II

Microsoft

Bengaluru, Karnataka, India (On-Site)
1 Week ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Google - Software Engineering Manager, Processing and Serving, Google Photos

Google

Bengaluru, Karnataka, India (On-Site)
1 Week ago
NinjaVan - Staff Data Engineer

NinjaVan

Hyderabad, Telangana, India (On-Site)
6 Months ago
Nagarro - Senior Engineer, Java Fullstack

Nagarro

India (Remote)
6 Months ago
Google - Technical Solutions Engineer, Infrastructure Compute

Google

Bengaluru, Karnataka, India (On-Site)
1 Week ago
Google - Software Development Manager II, Infrastructure, Google Cloud AI

Google

Sunnyvale, California, United States (On-Site)
1 Week ago
Google - Senior Software Engineer, Google Cloud Dataproc

Google

Sunnyvale, California, United States (On-Site)
1 Week ago
Toppan Merrill - Quality Analyst - Manual

Toppan Merrill

Chennai, Tamil Nadu, India (On-Site)
5 Months ago
Next Level Business Services - Full Stack Java Developer

Next Level Business Services

Boston, Massachusetts, United States (On-Site)
6 Months ago
ByteDance - Android Software Engineer - Global Payment

ByteDance

Singapore (On-Site)
2 Weeks ago
The Walt Disney Company - Lead Developer Integration

The Walt Disney Company

Montévrain, Île-de-France, France (On-Site)
2 Weeks ago

Get notifed when new similar jobs are uploaded

Jobs in Dublin, County Dublin, Ireland

Google - Network Operations Engineer

Google

Dublin, County Dublin, Ireland (On-Site)
1 Week ago
Google - Technical Solutions Consultant

Google

Dublin, County Dublin, Ireland (On-Site)
1 Week ago
Salesforce - Sales Development Representative - Polish Speaker

Salesforce

Dublin, County Dublin, Ireland (On-Site)
3 Weeks ago
Google - Media Solutions Specialist Video, gTech

Google

Dublin, County Dublin, Ireland (On-Site)
18 Hours ago
Google - Customer Growth Associate

Google

Dublin, County Dublin, Ireland (On-Site)
18 Hours ago
Playrix - Senior Node.js Developer (Server)

Playrix

Ireland (Remote)
3 Months ago
Google - Strategic Agency Manager, Google Customer Solutions, German Market

Google

Dublin, County Dublin, Ireland (On-Site)
1 Week ago
Google - Data and Measurement Account Manager

Google

Dublin, County Dublin, Ireland (On-Site)
16 Hours ago
Google - New Business Sales Account Executive, UKI

Google

Dublin, County Dublin, Ireland (On-Site)
18 Hours ago
Playrix - Senior Data Analyst (Marketing)

Playrix

Ireland (Remote)
2 Months ago

Get notifed when new similar jobs are uploaded

DevOps Jobs

Zones - Azure Backend Developer

Zones

Bengaluru, Karnataka, India (On-Site)
6 Months ago
Razer - Lead Site Reliability Engineer

Razer

Shanghai, Shanghai, China (On-Site)
7 Months ago
Codeway - Sr. DevOps Engineer

Codeway

İstanbul, Türkiye (On-Site)
2 Weeks ago
Varonis  - Technical Support Engineer L2

Varonis

New Delhi, Delhi, India (Remote)
3 Weeks ago
bosh group india - 2024_MS_EDE3_XC_SRE_DataEngineering

bosh group india

Bengaluru, Karnataka, India (On-Site)
4 Months ago
Netflix - CDN Site Reliability Engineer (SRE) L4/L5

Netflix

California, United States (Remote)
4 Months ago
Google - Technical Solutions Developer, Workspace Support, Google Cloud

Google

Waterloo, Ontario, Canada (On-Site)
1 Week ago
DigitalOcean - Senior Cloud Support Engineer

DigitalOcean

Hyderabad, Telangana, India (Hybrid)
6 Months ago
Microsoft - Software Engineer Manager - DPU Support

Microsoft

Santa Clara, California, United States (On-Site)
23 Hours ago
Ubisoft - Senior C++ Programmer

Ubisoft

Bucharest, Bucharest, Romania (Hybrid)
5 Months ago

Get notifed when new similar jobs are uploaded

About The Company

A problem isn't truly solved until it's solved for all. Googlers build products that help create opportunities for everyone, whether down the street or across the globe. Bring your insight, imagination and a healthy disregard for the impossible. Bring everything that makes you unique. Together, we can build for everyone.

Dublin, County Dublin, Ireland (On-Site)

New York, New York, United States (On-Site)

Waterloo, Ontario, Canada (On-Site)

Taipei City, Taiwan (On-Site)

San Francisco, California, United States (On-Site)

Saint-Ghislain, Wallonia, Belgium (On-Site)

Bengaluru, Karnataka, India (On-Site)

Austin, Texas, United States (On-Site)

View All Jobs

Get notified when new jobs are added by Google

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug