Staff Software Engineer, Site Reliability Engineering

1 Month ago • 8 Years + • Devops • Backend Development

Job Summary

Job Description

The Staff Software Engineer, Site Reliability Engineering (SRE) at Google Cloud in Warsaw, Poland, ensures the reliability and uptime of Google Cloud services. Responsibilities include participating in the entire service lifecycle, from design and deployment to operation and refinement; supporting services before launch; maintaining live services by monitoring availability and system health; scaling systems through automation; practicing sustainable incident response; and contributing to a culture of intellectual curiosity and problem-solving. The role requires expertise in distributed systems, software development, and algorithm optimization. This position demands strong problem-solving skills and effective communication, managing the complexities of large-scale systems.
Must have:
  • Bachelor's degree in CS or related field
  • 8+ years experience with data structures/algorithms
  • 5+ years software development experience
  • 3+ years leading projects and troubleshooting distributed systems
  • Expertise in designing and troubleshooting large-scale distributed systems
Good to have:
  • Experience in computing, distributed systems, storage, or networking
  • Ability to debug, optimize code, and automate tasks
  • Excellent problem-solving and communication skills

Job Details

Minimum qualifications:

  • Bachelor’s degree in Computer Science, a related field, or equivalent practical experience.
  • 8 years of experience with data structures or algorithms.
  • 5 years of experience with software development in one or more programming languages.
  • 3 years of experience leading projects and designing, analyzing, and troubleshooting distributed systems.

Preferred qualifications:

  • Experience working in computing, distributed systems, storage, or networking.
  • Expertise in designing, analyzing, and troubleshooting large-scale distributed systems.
  • Ability to debug, optimize code, and to automate routine tasks.
  • Excellent problem-solving, with effective verbal and written communication skills.

About the job

Site Reliability Engineering (SRE) combines software and systems engineering to build and run large-scale, massively distributed, fault-tolerant systems. SRE ensures that Google Cloud's services—both our internally critical and our externally-visible systems—have reliability, uptime appropriate to customer's needs and a fast rate of improvement. Additionally SRE’s will keep an ever-watchful eye on our systems capacity and performance.

Much of our software development focuses on optimizing existing systems, building infrastructure and eliminating work through automation. On the SRE team, you’ll have the opportunity to manage the complex challenges of scale which are unique to Google Cloud, while using your expertise in coding, algorithms, complexity analysis and large-scale system design. SRE's culture of intellectual curiosity, problem solving and openness is key to its success. Our organization brings together people with a wide variety of backgrounds, experiences and perspectives. We encourage them to collaborate, think big and take risks in a blame-free environment. We promote self-direction to work on meaningful projects, while we also strive to create an environment that provides the support and mentorship needed to learn and grow.

Behind everything our users see online is the architecture built by the Technical Infrastructure team to keep it running. From developing and maintaining our data centers to building the next generation of Google platforms, we make Google's product portfolio possible. We're proud to be our engineers' engineers and love voiding warranties by taking things apart so we can rebuild them. We keep our networks up and running, ensuring our users have the best and fastest experience possible.

Responsibilities

  • Engage in and improve the lifecycle of services from inception and design, through to deployment, operation and refinement.
  • Support services before they go live through activities such as system design consulting, developing software platforms and frameworks, capacity planning and launch reviews.
  • Maintain services once they are live by measuring and monitoring availability, latency and overall system health.
  • Scale systems sustainably through mechanisms like automation, and evolve systems by pushing for changes that improve reliability and velocity.
  • Practice sustainable incident response and blameless postmortems.

Similar Jobs

Unity - Senior Data Scientist

Unity

Tel Aviv-Yafo, Tel Aviv District, Israel (On-Site)
3 Months ago
Voodoo - Head of Data

Voodoo

Paris, Île-de-France, France (Hybrid)
2 Months ago
NVIDIA - Senior GPU Architect

NVIDIA

Santa Clara, California, United States (On-Site)
4 Months ago
Trend Micro - (Sr.) Data Engineer/AI Trainer

Trend Micro

Taipei City, Taiwan (On-Site)
8 Months ago
Motive - Software Engineer - QA

Motive

(Remote)
1 Month ago
ARHS - Application Engineer/Administrator

ARHS

The Hague, South Holland, Netherlands (On-Site)
7 Months ago
Equivalent Jobs - MLOPS ENGINEER

Equivalent Jobs

(Remote)
7 Months ago
Google - Software Engineer III, Performance, Google Cloud

Google

Warsaw, Masovian Voivodeship, Poland (On-Site)
1 Month ago
Inkittt - Senior Machine Learning Engineer, Recommendations

Inkittt

San Francisco, California, United States (Hybrid)
4 Months ago
Varonis  - Technical Support Engineer L2

Varonis

New Delhi, Delhi, India (Remote)
2 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Arrise Solutions (India)   - Lead ML Engineer

Arrise Solutions (India)

Hyderabad, Telangana, India (On-Site)
8 Months ago
Aisera Jobs - LLM/ML Engineer

Aisera Jobs

(Remote)
3 Years ago
Google - Software Engineer III, Android, Google Play Apps

Google

Bengaluru, Karnataka, India (On-Site)
1 Month ago
QuinStreet - Sr UI Developer

QuinStreet

(Remote)
1 Month ago
Nintendo - Senior Device Driver Software Engineer (NTD)

Nintendo

Redmond, Washington, United States (On-Site)
1 Year ago
GameJobs - Tools Programmer

GameJobs

Amsterdam, North Holland, Netherlands (On-Site)
1 Year ago
Inkittt - Senior Front-End Engineer

Inkittt

Krakow Am See, Mecklenburg-Vorpommern, Germany (Hybrid)
2 Months ago
Google - Staff Software Engineer, Google Cloud AI

Google

Sunnyvale, California, United States (On-Site)
1 Month ago
Suki - Senior Engineering Manager - Backend

Suki

Bengaluru, Karnataka, India (Hybrid)
1 Month ago
Inkittt - Senior Product Analyst

Inkittt

San Francisco, California, United States (Hybrid)
2 Months ago

Get notifed when new similar jobs are uploaded

Jobs in Warsaw, Masovian Voivodeship, Poland

Techland - Senior Character Concept Artist

Techland

Wrocław, Lower Silesian Voivodeship, Poland (On-Site)
6 Months ago
SimCorp - Senior DevOps Engineer — Testing Services

SimCorp

Warsaw, Masovian Voivodeship, Poland (Hybrid)
1 Month ago
McDonald's Corporation - Manager, Tax Operations IOM BU

McDonald's Corporation

Warsaw, Masovian Voivodeship, Poland (On-Site)
1 Month ago
CD PROJEKT RED - Junior Motion Capture Specialist

CD PROJEKT RED

Warsaw, Masovian Voivodeship, Poland (On-Site)
2 Months ago
Google - Senior Software Engineer, Infrastructure Storage, Google Cloud

Google

Warsaw, Masovian Voivodeship, Poland (On-Site)
1 Month ago
Techland - Senior Cinematic Artist

Techland

Warsaw, Masovian Voivodeship, Poland (On-Site)
1 Month ago
Netflix - UI Engineer L5 - UI Component Library & Lifecycle

Netflix

Warsaw, Masovian Voivodeship, Poland (On-Site)
7 Months ago
Google - Software Engineer II, Cloud AI, Early Career

Google

Warsaw, Masovian Voivodeship, Poland (On-Site)
1 Month ago
N-iX - Senior .NET Engineer

N-iX

Poland (Hybrid)
2 Months ago
Virtuos - Graphics Programmer

Virtuos

Poland (Hybrid)
2 Months ago

Get notifed when new similar jobs are uploaded

Devops Jobs

The Walt Disney Company - Senior Pipeline Engineer

The Walt Disney Company

Glendale, California, United States (On-Site)
2 Months ago
Anavation - Senior Cloud Developer

Anavation

Huntsville, Alabama, United States (Remote)
1 Month ago
Revolgy - Customer Support Engineer—AWS, Kubernetes (remote Europe)

Revolgy

United Kingdom (Remote)
2 Months ago
Next Level Business Services - Sr. Big Data Engineer in San Francisco, CA  / McLean, VA

Next Level Business Services

San Francisco, California, United States (On-Site)
7 Months ago
Larian Studios - Senior Automation Engineer

Larian Studios

Warsaw, Masovian Voivodeship, Poland (On-Site)
2 Months ago
ByteDance - Production System Engineer, Infrastructure Engineering Intern

ByteDance

Singapore (On-Site)
2 Months ago
Hashlist - Senior Data Engineer

Hashlist

Pune, Maharashtra, India (Hybrid)
7 Months ago
Salesforce - Distributed Systems Software Engineer - Public Cloud (Senior/Lead/Principal)

Salesforce

San Francisco, California, United States (On-Site)
8 Months ago
Probably Monsters - Build Engineer, Ecosystems (Core Technology)

Probably Monsters

Dallas, Texas, United States (On-Site)
4 Months ago
Microsoft - Software Engineer

Microsoft

Ho Chi Minh City, Ho Chi Minh City, Vietnam (On-Site)
1 Month ago

Get notifed when new similar jobs are uploaded

About The Company

San Diego, California, United States (On-Site)

Los Angeles, California, United States (On-Site)

Massachusetts, United States (Remote)

United States (On-Site)

Bengaluru, Karnataka, India (On-Site)

Chicago, Illinois, United States (Hybrid)

Bengaluru, Karnataka, India (On-Site)

Regensburg, Bavaria, Germany (Remote)

Lanham, Maryland, United States (On-Site)

Toronto, Ontario, Canada (On-Site)

View All Jobs

Get notified when new jobs are added by Google

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug