SRE Manager

2 Weeks ago • All levels • DevOps

Job Summary

Job Description

Netflix's N-Tech Site Reliability Engineering (SRE) team seeks a highly experienced SRE Manager to lead a team of 4 engineers. Responsibilities include leading, mentoring, and developing the team; ensuring service reliability and efficiency; collaborating with cross-functional teams; improving incident management; and overseeing monitoring and alerting systems. The ideal candidate will have a strong technical background in cloud computing, automation, and monitoring, along with proven leadership experience in a fast-paced environment. The role requires excellent communication and problem-solving skills, and experience with tools like AWS, Kubernetes, and Terraform.
Must have:
  • Lead and mentor SRE team
  • Ensure service reliability
  • Collaborate with cross-functional teams
  • Improve incident management
  • Strong technical background
  • Excellent communication skills

Job Details

Netflix is one of the world's leading entertainment services, with 283 million paid memberships in over 190 countries enjoying TV series, films and games across a wide variety of genres and languages. Members can play, pause and resume watching as much as they want, anytime, anywhere, and can change their plans at any time.

The N-Tech Site Reliability Engineering (SRE) team at Netflix ensures the reliability, scalability, and efficiency of our workforce-focused products and services. The SRE team provides services such as Incident Response, Reliability Engineering consulting, and limited embedded SRE engagements. 

We seek a highly experienced and motivated SRE Manager to lead a team of 4 Site Reliability Engineers. You will play a crucial role in maintaining the reliability and efficiency of our services, ensuring that our workforce-enabling products and services are reliable while coordinating with cross-functional teams across various geographical regions. You will have a proven track record of leading top-performing teams in complex, fast-paced environments and will excel in organizing and motivating a team amidst rapid growth and change.

This leadership role is rewarding for people who are passionate about growing talent, building a high-impact team, and leveraging engineering principles to improve reliability. As a key engineering leader in the Netflix Technology Services Organization, you'll contribute to cross-functional initiatives supporting engineering teams across Netflix. If this excites you, we invite you to bring your unique career and life experiences to enrich the culture and diversity of our team.  

RESPONSIBILITIES

  • You will lead, mentor, and develop a team of 4 SREs, fostering a culture of collaboration, innovation, and continuous improvement.

  • You will communicate effectively with stakeholders at all levels, providing updates on team performance, project status, and incident resolutions.

  • You will ensure an appropriate balance exists between incident management's reactive work and the proactive work of reducing future issues. 

  • You will develop and implement strategies to improve the reliability, performance, and scalability of the products and services supported by the SRE team.

  • You will collaborate with cross-functional teams (engineering, product, and operations) to drive critical projects and initiatives.

  • You will influence and improve our incident management lifecycle to identify, mitigate, and learn from reliability risks.

  • You will oversee the design, implementation, and maintenance of monitoring, alerting, and incident response systems.

  • You will ensure the team follows best practices in infrastructure as a code, continuous integration/deployment (CI/CD), and system automation.

  • You will cultivate and maintain high-trust relationships with internal and external partners.

  • You will advocate for the SRE team within the broader organization, representing their needs and concerns.

WE VALUE

  • Curiosity about how complex socio-technical systems successfully operate at scale when failure is inevitable

  • People who see influence as their preferred tool for cultivating relationships and helping the organization improve

  • Collaboration and continuous improvement are fundamental to growing the team’s impact over time

  • A desire to learn and readiness to mentor others both within and outside of the team

SKILLS AND EXPERIENCE

  • Bachelor’s degree in Computer Science, Engineering, or a related field (or equivalent practical experience)

  • Proven success in leading high-performing SRE or DevOps teams in a large-scale, fast-paced environment

  • Outstanding communication and interpersonal skills, with the ability to build strong relationships with team members and stakeholders

  • Strong technical background with hands-on experience in cloud computing, system architecture, automation, and monitoring

  • Excellent problem-solving skills with a focus on root cause analysis and proactive improvements

  • Exceptional organizational skills, with the ability to manage multiple priorities and projects simultaneously

  • Experience with tools and technologies such as AWS, Kubernetes, Terraform, Prometheus, Grafana, Jenkins, and similar.

is a Netflix value and we strive to host a meaningful interview experience for all candidates. If you want an accommodation/adjustment for a disability or any other reason during the hiring process, please send a request to your recruiting partner.

We are an equal-opportunity employer and celebrate diversity, recognizing that diversity builds stronger teams. We approach diversity and inclusion seriously and thoughtfully. We do not discriminate on the basis of race, religion, color, ancestry, national origin, caste, sex, sexual orientation, gender, gender identity or expression, age, disability, medical condition, pregnancy, genetic makeup, marital status, or military service.

Similar Jobs

PlayerUnknown Productions - IT Manager (Part-Time)

PlayerUnknown Productions

Amsterdam, North Holland, Netherlands (Hybrid)
6 Months ago
luxsoft - Senior Developer/Devops

luxsoft

Kuala Lumpur, Federal Territory Of Kuala Lumpur, Malaysia (On-Site)
13 Hours ago
GoTo Group - Principal SRE Engineer (SE5)

GoTo Group

Gurugram, Haryana, India (On-Site)
6 Months ago
ION - Senior Java Developer - Italy

ION

Rome, Lazio, Italy (On-Site)
6 Months ago
ION - Lead Software Engineer, Italy

ION

Milan, Lombardy, Italy (On-Site)
6 Months ago
Google - Technical Solutions Engineer, Data, Google Cloud

Google

Seoul, South Korea (On-Site)
2 Days ago
PowerSchool - Sr Cloud Ops Eng I

PowerSchool

Bengaluru, Karnataka, India (On-Site)
6 Months ago
Info Stretch - Lead Data Engineer

Info Stretch

Chennai, Tamil Nadu, India (On-Site)
6 Months ago
Google - Staff Software Engineer, Site Reliability Engineering

Google

Poland (On-Site)
2 Weeks ago
Rackspace Technology - AWS Migration Engineer

Rackspace Technology

India (Remote)
2 Weeks ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Netflix - Senior Software Engineer (L5) - Developer Infrastructure

Netflix

Los Gatos, California, United States (On-Site)
2 Weeks ago
Lighthouse Games - Senior SDET - C++

Lighthouse Games

Royal Leamington Spa, England, United Kingdom (Hybrid)
1 Month ago
Nagarro - Associate Staff Engineer, .Net Fullstack

Nagarro

Mexico (Remote)
6 Months ago
Telastra - Software Engineer II

Telastra

Bengaluru, Karnataka, India (On-Site)
1 Day ago
ION - Principal Software Engineer, Italy

ION

Milan, Lombardy, Italy (On-Site)
6 Months ago
The Walt Disney Company - Sr Software Engineer (Roku/BrightScript/SceneGraph)

The Walt Disney Company

Santa Monica, California, United States (On-Site)
5 Months ago
Epic Games - Senior Test Automation Programmer

Epic Games

Montreal, Quebec, Canada (On-Site)
2 Months ago
Aera Technology - Senior Performance Engineer

Aera Technology

Pune, Maharashtra, India (On-Site)
6 Months ago
Zenoti - Manager - DevOps

Zenoti

Hyderabad, Telangana, India (On-Site)
23 Hours ago
The Walt Disney Company - Lead Software Engineer (Roku Engineer)

The Walt Disney Company

Santa Monica, California, United States (On-Site)
5 Months ago

Get notifed when new similar jobs are uploaded

Jobs in Warsaw, Masovian Voivodeship, Poland

Eleven Labs - Design Engineer

Eleven Labs

Poland (Remote)
1 Month ago
PwC - Salesforce Architect

PwC

Warsaw, Masovian Voivodeship, Poland (Hybrid)
7 Months ago
SimCorp - Senior Customer Support Consultant

SimCorp

Warsaw, Masovian Voivodeship, Poland (Hybrid)
1 Day ago
Fool's Theory - Character Artist

Fool's Theory

Poland (Remote)
1 Week ago
Google - Senior Data Scientist, Intelligent Automation and Recommendation

Google

Warsaw, Masovian Voivodeship, Poland (On-Site)
2 Weeks ago
Netflix - Full-Stack Engineer (L5)

Netflix

Warsaw, Masovian Voivodeship, Poland (On-Site)
2 Months ago
Donkey crew - Technical Artist

Donkey crew

Wrocław, Lower Silesian Voivodeship, Poland (Hybrid)
9 Hours ago
Google - Senior Software Engineer, Performance Infrastructure

Google

Warsaw, Masovian Voivodeship, Poland (On-Site)
2 Weeks ago
Playtika - Full Stack Developer

Playtika

Poland (Hybrid)
2 Months ago
Adyen - Implementation Manager

Adyen

Warsaw, Masovian Voivodeship, Poland (On-Site)
8 Hours ago

Get notifed when new similar jobs are uploaded

DevOps Jobs

Trend Micro - Cloud Engineer (Golang/Python, Backend Focus) 雲端開發工程師

Trend Micro

Taipei City, Taiwan (On-Site)
6 Months ago
Nagarro - Senior Engineer

Nagarro

Portugal (Remote)
6 Months ago
Playground Games - Build Engineer - Contract

Playground Games

England, United Kingdom (Hybrid)
1 Month ago
SmileGate - Group Purchasing System and Internal Web System Operation (Development)

SmileGate

Seongnam-si, Gyeonggi-do, South Korea (On-Site)
1 Month ago
Fortis Games - Senior Cloud Security Engineer

Fortis Games

Romania (On-Site)
3 Months ago
Epic Games - Senior DevOps Programmer

Epic Games

Cary, North Carolina, United States (On-Site)
2 Months ago
Microsoft - Technical Support Engineer

Microsoft

Bengaluru, Karnataka, India (Hybrid)
1 Week ago
ByteDance - Software Engineer - Serverless Compute Infrastructure

ByteDance

Seattle, Washington, United States (On-Site)
2 Months ago
Offworld - DevOps Engineer

Offworld

New Westminster, British Columbia, Canada (On-Site)
1 Month ago
Limit Break - Senior Site Reliability Engineer

Limit Break

Tokyo, Japan (On-Site)
1 Month ago

Get notifed when new similar jobs are uploaded

About The Company

Netflix is one of the world's leading entertainment services with over 247 million paid memberships in over 190 countries enjoying TV series, films and games across a wide variety of genres and languages. Members can play, pause and resume watching as much as they want, anytime, anywhere, and can change their plans at any time.

London, England, United Kingdom (On-Site)

Berlin, Berlin, Germany (On-Site)

Milan, Lombardy, Italy (On-Site)

Paris, Île-de-France, France (On-Site)

Seoul, South Korea (On-Site)

Los Angeles, California, United States (On-Site)

Los Gatos, California, United States (On-Site)

Pennsylvania, United States (On-Site)

View All Jobs

Get notified when new jobs are added by Netflix

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug