Site Reliability Engineer L5 - Open Connect

2 Months ago • All levels • DevOps • $100,000 PA - $720,000 PA

Job Summary

Job Description

As a Site Reliability Engineer L5 at Netflix's Open Connect, you'll design, scale, operate, automate, and analyze the globally distributed CDN, focusing on Edge Accelerator services. Responsibilities include improving resilience, security, observability, QoE, monitoring, and automation. You'll analyze massive datasets using Netflix's Big Data platform to optimize service delivery and system reliability. On-call rotation and handling production issues are also key aspects of this role. Experience with *nix, networking, data analysis, and large-scale service operations is essential, along with proficiency in programming languages like Go, C, or Python.
Must have:
  • CDN and HTTP cache/proxy expertise
  • Deep understanding of internet protocols
  • Building and maintaining highly distributed systems
  • Proficiency in Go, C, or Python
  • Experience with distributed analytics
  • Excellent communication skills

Job Details

Netflix is one of the world's leading entertainment services, with 283 million paid memberships in over 190 countries enjoying TV series, films and games across a wide variety of genres and languages. Members can play, pause and resume watching as much as they want, anytime, anywhere, and can change their plans at any time.

How do you spark joy in hundreds of millions of people? It starts with a vision - that technology can give voice to stories around the world. In delivering those much-loved stories, Netflix is responsible for a significant portion of global internet traffic. To steward that responsibility, we work collaboratively with ISPs to deploy , Netflix’s Content Delivery Network (CDN), our in-house custom-built network and server infrastructure responsible for delivering 100% of Netflix's video traffic. 

In addition to streaming video delivery, Open Connect Appliances (OCAs) are ideally situated to also improve the latency between clients and the Netflix services running on AWS. The Open Connect Edge Accelerator is taking advantage of the highly geo-distributed nature of Open Connect to improve the quality of experience. It is the entry point for device and website traffic, putting it on the critical path to delivering and monitoring our product experiences. 

We are seeking a seasoned Reliability Engineer with extensive experience in *nix, networking, data analysis, and large-scale service operations experience to design, scale, operate, automate, and analyze our globally distributed CDN, with a focus on the Edge Accelerator services. You will be working on reliability, resilience, performance, latency measurement, steering solutions, low-latency reverse proxy, failover mechanisms, protocol optimizations, and DDoS protection to name a few. 

Qualifications

  • Knowledge of and proven experience with CDNs and HTTP cache/proxy technologies

  • Deep understanding of Internet protocols like TCP, TLS, HTTP/S, and DNS

  • Experience building and maintaining highly distributed, scalable, low-latency, fault-tolerant production systems with a focus on security and reliability 

  • Proficient in a programming language such as Go, C, or Python

  • Experience with distributed analytic processing technologies (Hive, Presto/Trino, Spark SQL, etc)

  • Great communication and documentation skills targeted at cross-team collaboration

  • Motivated by “the art of possible” and able to balance idealism and pragmatism

  • Cool-headed during production issues, able to focus on problem resolution

  • Preferred - BS in Computer Science, Electrical Engineering, or Computer Engineering (or equivalent professional experience)

Responsibilities 

  • Drive continual improvement in resilience, security, observability, quality of experience (QoE), monitoring, instrumentation, and automation with the primary goal of maintaining highly scalable and reliable CDN services worldwide

  • Aggregate, analyze and correlate large amounts of server and application performance data. Use the innovative Netflix Big Data platform as a highly flexible, specialized, and efficient toolset for service delivery optimization and system reliability improvements

  • Participate in on-call rotation and handle escalations for service delivery production issues

  • Have lots of discussions about all the great content and your favorite movies and series 

Things that show how we think

Does this sound interesting? Or does it sound interesting but intimidating? Please don’t self-select; let’s figure it out together. Come join us and play a meaningful role in our journey to entertain the world! We’d love to talk to you!

Netflix is a global company with a diverse member base, which is why the content we produce reflects that: global perspectives and global stories. As we grow globally, we must have the most talented employees with diverse backgrounds, cultures, perspectives, and experiences to support our innovation and creativity. We are an equal opportunity employer and strive to build balanced teams from all walks of life.

Our culture is unique, and we tend to live by our values, so it’s worth learning more about Netflix .

At Netflix, we carefully consider a wide range of compensation factors to determine your personal top of market. We rely on market indicators to determine compensation and consider your specific job, skills, and experience to get it right. These considerations can cause your compensation to vary and will also be dependent on your location. The overall market range for roles in this area of Netflix is typically  $100,000 - $720,000. This market range is based on total compensation (vs. only base salary), which is in line with our compensation philosophy. 

is a Netflix value and we strive to host a meaningful interview experience for all candidates. If you want an accommodation/adjustment for a disability or any other reason during the hiring process, please send a request to your recruiting partner.

We are an equal-opportunity employer and celebrate diversity, recognizing that diversity builds stronger teams. We approach diversity and inclusion seriously and thoughtfully. We do not discriminate on the basis of race, religion, color, ancestry, national origin, caste, sex, sexual orientation, gender, gender identity or expression, age, disability, medical condition, pregnancy, genetic makeup, marital status, or military service.

Job is open for no less than 7 days and will be removed when the position is filled.

Similar Jobs

Nielsen Holdings - Senior Software Engineer - Bigdata ( Java / Scala / Python  & Spark , SQL , AWS).

Nielsen Holdings

Bengaluru, Karnataka, India (Hybrid)
6 Months ago
Egnyte - Lead Technical Consultant, Professional Services

Egnyte

India (Remote)
4 Months ago
Next Level Business Services - Bigdata / Hadoop Architect

Next Level Business Services

Oldsmar, Florida, United States (On-Site)
6 Months ago
Prophecy - Enterprise Engagement Architect

Prophecy

United States (Remote)
1 Month ago
Egnyte - Sr. Customer Success Manager

Egnyte

India (Remote)
3 Months ago
Trend Micro - Cloud Engineer (Golang/Python, Backend Focus) 雲端開發工程師

Trend Micro

Taipei City, Taiwan (On-Site)
6 Months ago
Argus Labs - Site Reliability Engineer (APAC)

Argus Labs

Australia (Remote)
1 Week ago
Demonware - Software Development Intern

Demonware

Shanghai, Shanghai, China (On-Site)
3 Weeks ago
Intrepid Studios,  Inc  - Associate Software Engineer

Intrepid Studios, Inc

(Remote)
2 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Google - Sales Manager, Pursuit, Data Analytics, Google Cloud

Google

Singapore (On-Site)
1 Week ago
Garena - Garena - Data Warehouse Engineer

Garena

Taipei City, Taiwan (On-Site)
3 Months ago
YallaPlay - Studio Production Lead

YallaPlay

(Remote)
6 Days ago
YallaPlay - Mobile Game Unity Developer

YallaPlay

(Remote)
6 Days ago
Epic Games - Principal Data Analyst

Epic Games

New York, New York, United States (On-Site)
1 Week ago
Netflix - Analytics Engineer (L4) - Acquisition

Netflix

Los Gatos, California, United States (On-Site)
1 Week ago
Netflix - CDN Site Reliability Engineer L4/L5 - Live Streaming, Open Connect CDN

Netflix

California, United States (Remote)
1 Week ago
The Walt Disney Company - Senior Data Engineer

The Walt Disney Company

Seattle, Washington, United States (On-Site)
1 Week ago
PwC - IN-Senior Associate _.Net Developer _Data & Analytics _Advisory _PAN India

PwC

Kolkata, West Bengal, India (On-Site)
6 Months ago
Egnyte - Senior Build Engineer - Python - Jenkins

Egnyte

India (Remote)
3 Months ago

Get notifed when new similar jobs are uploaded

Jobs in United States

Google - Strategy and Operations Senior Associate, Agency and Partner Go-to-Market

Google

Chicago, Illinois, United States (On-Site)
6 Days ago
The Pokemon Company International - Sr. Brand Marketing Manager

The Pokemon Company International

Bellevue, Washington, United States (Hybrid)
4 Weeks ago
On Location - Lifecycle Marketing Manager - Olympic & Paralympic Games

On Location

New York, New York, United States (On-Site)
4 Weeks ago
Google - Senior Interaction Designer, Android Developer

Google

Mountain View, California, United States (On-Site)
1 Week ago
The Walt Disney Company - Sr. FinOps Tech Data Analyst

The Walt Disney Company

Washington, United States (On-Site)
1 Month ago
Fandom - Senior Director, Programmatic Monetization

Fandom

United States (Remote)
1 Week ago
ByteDance - Payment Strategy Intern (Global Payment - LATAM)

ByteDance

San Jose, California, United States (On-Site)
1 Week ago
Sinch - Senior Technical Campaign Manager

Sinch

Washington, United States (Remote)
4 Weeks ago
Google - Staff Software Engineer, Machine Learning Runtime Engines

Google

Sunnyvale, California, United States (On-Site)
1 Week ago
Hedra - Machine Learning Engineer (CUDA)

Hedra

New York, New York, United States (On-Site)
4 Weeks ago

Get notifed when new similar jobs are uploaded

DevOps Jobs

N-iX - Senior Python Engineer

N-iX

Ukraine (Remote)
5 Days ago
The Walt Disney Company - Manager, Software Engineering

The Walt Disney Company

Washington, United States (On-Site)
2 Months ago
Wargaming - DevOps Engineer (Deployment team)

Wargaming

Belgrade, Serbia (On-Site)
1 Month ago
CharacterAI - Staff Software Engineer, Site Reliability (SRE)

CharacterAI

Menlo Park, California, United States (On-Site)
4 Weeks ago
Microsoft - Software Engineer II

Microsoft

Santa Clara, California, United States (On-Site)
1 Day ago
Bethesda - Senior DevOps Programmer

Bethesda

Montreal, Quebec, Canada (On-Site)
3 Weeks ago
Glean - Solutions Architect ( EMEA/US East Customer hours )

Glean

Bengaluru, Karnataka, India (On-Site)
5 Months ago
Google - Technical Solutions Engineer, Compute

Google

Dublin, County Dublin, Ireland (On-Site)
4 Days ago
Canva - Senior Software Engineer (Cloud Platform)

Canva

Auckland, Auckland, New Zealand (Remote)
2 Months ago
Activision - Cloud Engineering Co-op

Activision

Vancouver, British Columbia, Canada (Hybrid)
1 Month ago

Get notifed when new similar jobs are uploaded

About The Company

Netflix is one of the world's leading entertainment services with over 247 million paid memberships in over 190 countries enjoying TV series, films and games across a wide variety of genres and languages. Members can play, pause and resume watching as much as they want, anytime, anywhere, and can change their plans at any time.

London, England, United Kingdom (On-Site)

Berlin, Berlin, Germany (On-Site)

Milan, Lombardy, Italy (On-Site)

Paris, Île-de-France, France (On-Site)

Seoul, South Korea (On-Site)

Los Angeles, California, United States (On-Site)

Los Gatos, California, United States (On-Site)

Pennsylvania, United States (On-Site)

View All Jobs

Get notified when new jobs are added by Netflix

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug