Site Reliability Engineer L5 - Open Connect

4 Months ago • All levels • DevOps • $100,000 PA - $720,000 PA

Job Summary

Job Description

As a Site Reliability Engineer L5 at Netflix's Open Connect, you'll design, scale, operate, automate, and analyze the globally distributed CDN, focusing on Edge Accelerator services. Responsibilities include improving resilience, security, observability, QoE, monitoring, and automation. You'll analyze massive datasets using Netflix's Big Data platform to optimize service delivery and system reliability. On-call rotation and handling production issues are also key aspects of this role. Experience with *nix, networking, data analysis, and large-scale service operations is essential, along with proficiency in programming languages like Go, C, or Python.
Must have:
  • CDN and HTTP cache/proxy expertise
  • Deep understanding of internet protocols
  • Building and maintaining highly distributed systems
  • Proficiency in Go, C, or Python
  • Experience with distributed analytics
  • Excellent communication skills

Job Details

Netflix is one of the world's leading entertainment services, with 283 million paid memberships in over 190 countries enjoying TV series, films and games across a wide variety of genres and languages. Members can play, pause and resume watching as much as they want, anytime, anywhere, and can change their plans at any time.

How do you spark joy in hundreds of millions of people? It starts with a vision - that technology can give voice to stories around the world. In delivering those much-loved stories, Netflix is responsible for a significant portion of global internet traffic. To steward that responsibility, we work collaboratively with ISPs to deploy , Netflix’s Content Delivery Network (CDN), our in-house custom-built network and server infrastructure responsible for delivering 100% of Netflix's video traffic. 

In addition to streaming video delivery, Open Connect Appliances (OCAs) are ideally situated to also improve the latency between clients and the Netflix services running on AWS. The Open Connect Edge Accelerator is taking advantage of the highly geo-distributed nature of Open Connect to improve the quality of experience. It is the entry point for device and website traffic, putting it on the critical path to delivering and monitoring our product experiences. 

We are seeking a seasoned Reliability Engineer with extensive experience in *nix, networking, data analysis, and large-scale service operations experience to design, scale, operate, automate, and analyze our globally distributed CDN, with a focus on the Edge Accelerator services. You will be working on reliability, resilience, performance, latency measurement, steering solutions, low-latency reverse proxy, failover mechanisms, protocol optimizations, and DDoS protection to name a few. 

Qualifications

  • Knowledge of and proven experience with CDNs and HTTP cache/proxy technologies

  • Deep understanding of Internet protocols like TCP, TLS, HTTP/S, and DNS

  • Experience building and maintaining highly distributed, scalable, low-latency, fault-tolerant production systems with a focus on security and reliability 

  • Proficient in a programming language such as Go, C, or Python

  • Experience with distributed analytic processing technologies (Hive, Presto/Trino, Spark SQL, etc)

  • Great communication and documentation skills targeted at cross-team collaboration

  • Motivated by “the art of possible” and able to balance idealism and pragmatism

  • Cool-headed during production issues, able to focus on problem resolution

  • Preferred - BS in Computer Science, Electrical Engineering, or Computer Engineering (or equivalent professional experience)

Responsibilities 

  • Drive continual improvement in resilience, security, observability, quality of experience (QoE), monitoring, instrumentation, and automation with the primary goal of maintaining highly scalable and reliable CDN services worldwide

  • Aggregate, analyze and correlate large amounts of server and application performance data. Use the innovative Netflix Big Data platform as a highly flexible, specialized, and efficient toolset for service delivery optimization and system reliability improvements

  • Participate in on-call rotation and handle escalations for service delivery production issues

  • Have lots of discussions about all the great content and your favorite movies and series 

Things that show how we think

Does this sound interesting? Or does it sound interesting but intimidating? Please don’t self-select; let’s figure it out together. Come join us and play a meaningful role in our journey to entertain the world! We’d love to talk to you!

Netflix is a global company with a diverse member base, which is why the content we produce reflects that: global perspectives and global stories. As we grow globally, we must have the most talented employees with diverse backgrounds, cultures, perspectives, and experiences to support our innovation and creativity. We are an equal opportunity employer and strive to build balanced teams from all walks of life.

Our culture is unique, and we tend to live by our values, so it’s worth learning more about Netflix .

At Netflix, we carefully consider a wide range of compensation factors to determine your personal top of market. We rely on market indicators to determine compensation and consider your specific job, skills, and experience to get it right. These considerations can cause your compensation to vary and will also be dependent on your location. The overall market range for roles in this area of Netflix is typically  $100,000 - $720,000. This market range is based on total compensation (vs. only base salary), which is in line with our compensation philosophy. 

is a Netflix value and we strive to host a meaningful interview experience for all candidates. If you want an accommodation/adjustment for a disability or any other reason during the hiring process, please send a request to your recruiting partner.

We are an equal-opportunity employer and celebrate diversity, recognizing that diversity builds stronger teams. We approach diversity and inclusion seriously and thoughtfully. We do not discriminate on the basis of race, religion, color, ancestry, national origin, caste, sex, sexual orientation, gender, gender identity or expression, age, disability, medical condition, pregnancy, genetic makeup, marital status, or military service.

Job is open for no less than 7 days and will be removed when the position is filled.

Similar Jobs

Wirewheel - Data Engineer

Wirewheel

(Remote)
1 Month ago
Playtika - Senior DATA/AI SRE Engineer

Playtika

Poland (On-Site)
7 Months ago
The Walt Disney Company - Lead Software Engineer - Big Data Infrastructure

The Walt Disney Company

California, United States (On-Site)
2 Months ago
Dream Sports - SDET 3

Dream Sports

Mumbai, Maharashtra, India (On-Site)
3 Months ago
Match Group - Sr. Software Engineer, Machine Learning

Match Group

Palo Alto, California, United States (Hybrid)
7 Months ago
Nagarro - Associate Principal Engineer, QA Automation

Nagarro

Spain (Remote)
7 Months ago
Google - Systems Development Engineer, Google Cloud

Google

Bengaluru, Karnataka, India (On-Site)
1 Month ago
Google - Customer Engineer II, Application Modernization, Retail, Google Cloud

Google

Mountain View, California, United States (On-Site)
1 Month ago
GoTo Group - Senior Software Engineer - Event Platform

GoTo Group

Bengaluru, Karnataka, India (On-Site)
7 Months ago
Tencent - Senior IT Operations Engineer

Tencent

Los Angeles, California, United States (On-Site)
2 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

ByteDance - Engineering Manager Machine Learning Infrastructure

ByteDance

Seattle, Washington, United States (On-Site)
7 Months ago
POWTOON - Instructional Designer & Learning Strategist

POWTOON

United States (Remote)
2 Months ago
Miniclip - Data Engineer

Miniclip

Lisbon, Lisbon, Portugal (On-Site)
1 Month ago
Highspot - SEO Content and Blog Manager

Highspot

(Remote)
2 Months ago
Inworld AI - Staff Platform Engineer

Inworld AI

Vancouver, British Columbia, Canada (On-Site)
1 Month ago
ByteDance - Data Quality Assurance Engineer - Data Platform 2025 Start

ByteDance

Singapore (On-Site)
7 Months ago
GoFundMe - Staff Data Engineer

GoFundMe

San Francisco, California, United States (On-Site)
1 Month ago
PwC - IN-Senior Associate_Big Data Engineer_Data & Analytics_Advisory_ PAN India

PwC

Gurugram, Haryana, India (On-Site)
8 Months ago
OLIVER Agency - Brand Lead

OLIVER Agency

Mumbai, Maharashtra, India (On-Site)
1 Month ago
ByteDance - Backend Software Engineer - Network Security

ByteDance

San Jose, California, United States (On-Site)
1 Month ago

Get notifed when new similar jobs are uploaded

Jobs in United States

Playtech - iGaming Floor Supervisor

Playtech

Atlantic City, New Jersey, United States (On-Site)
1 Month ago
Tencent - Senior Staff Researcher

Tencent

California, United States (On-Site)
3 Months ago
Canva - K-12 District Engagement Advocate - French Speaking

Canva

Austin, Texas, United States (Remote)
6 Months ago
Coherent Corp - Quality Inspector

Coherent Corp

Easton, Pennsylvania, United States (On-Site)
1 Month ago
Qualcomm - Graphics Software Engineer, Senior Staff

Qualcomm

San Diego, California, United States (On-Site)
1 Month ago
Riot Games - Associate Art Director, Characters - Unpublished R&D Product

Riot Games

Los Angeles, California, United States (On-Site)
6 Months ago
Flow - Senior/Staff Backend Software Engineer

Flow

New York, New York, United States (Hybrid)
8 Months ago
Pentair - Field Service Technician

Pentair

Plainfield, Illinois, United States (On-Site)
1 Month ago
Nintendo - Accountant

Nintendo

Redmond, Washington, United States (On-Site)
5 Months ago
ByteDance - Senior Research Engineer, 3D vision

ByteDance

San Jose, California, United States (On-Site)
6 Months ago

Get notifed when new similar jobs are uploaded

DevOps Jobs

Anavation - Systems Integration Engineer

Anavation

Reston, Virginia, United States (On-Site)
7 Months ago
Tencent - Senior Database Administrator (Private Cloud)

Tencent

(On-Site)
2 Months ago
Kefir Games - Middle/Senior DevOps Engineer

Kefir Games

Cyprus (On-Site)
5 Months ago
Tencent - SRE Intern

Tencent

Amsterdam, North Holland, Netherlands (On-Site)
3 Months ago
Ajmera Infotech - Senior ASP.NET Developer with Azure Expertise

Ajmera Infotech

Hyderabad, Telangana, India (On-Site)
6 Months ago
CapSpire - Senior Consultant – Endur Technical

CapSpire

Bengaluru, Karnataka, India (Remote)
7 Months ago
Electronic Arts - Build Software Engineer - Development & Release Engineering

Electronic Arts

Vancouver, British Columbia, Canada (Hybrid)
2 Months ago
Google - Customer Engineer II, Infrastructure Modernization, Biotech, Google Cloud

Google

Cambridge, Massachusetts, United States (On-Site)
1 Month ago
Tencent - Senior Site Reliability Engineer

Tencent

Shanghai, Shanghai, China (On-Site)
8 Months ago

Get notifed when new similar jobs are uploaded

About The Company

Seoul, South Korea (On-Site)

Bogota, Colombia (On-Site)

Singapore, Singapore (On-Site)

Los Angeles, California, United States (On-Site)

Los Angeles, California, United States (On-Site)

Seoul, South Korea (On-Site)

Los Gatos, California, United States (On-Site)

Los Gatos, California, United States (On-Site)

View All Jobs

Get notified when new jobs are added by Netflix

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug