Distributed Systems Engineer (L5) - Compute Runtime

6 Months ago • 4-8 Years • DevOps

Job Summary

Job Description

Netflix seeks a skilled Distributed Systems Engineer with experience in evolving large-scale infrastructure systems and container runtimes on Linux. Must have expertise in distributed systems, AWS, Linux application development, Go/Java/C/C++, containers & runtimes, and Linux performance debugging.
Must have:
  • Distributed Systems
  • AWS Experience
  • Linux Development
  • Container Runtimes
Good to have:
  • Container Performance
  • ML/AI Concepts
  • GPU Architecture
  • AMI Management
Perks:
  • Remote Work
  • Netflix Perks

Job Details

Netflix is one of the world’s leading entertainment services with 278 million paid memberships in over 190 countries enjoying TV series, films and games across a wide variety of genres and languages. Members can play, pause and resume watching as much as they want, anytime, anywhere, and can change their plans at any time.

The Role

Netflix has been on the leading edge of cloud adoption since migrating to AWS 15 years ago and runs one of the largest Cloud footprints. The Cloud Engineering organization exists to manage that massive scale, constantly innovating to increase fleet-wide agility, efficiency, and reliability of the Netflix cloud infrastructure, while solving scale problems that we are the first to ever hit. We build, operate, and maintain Compute, Network, and Storage services so that developers at Netflix can rely on foundational building blocks when entertaining hundreds of millions of customers globally.

About the Team

The Compute Runtime team is responsible for the data plane runtime environment for our Kubernetes-based orchestrator, which handles millions of container launches per day.  We also provide the base OS and system services to hundreds of thousands of EC2 instances.  We thrive on solving complex problems and love sharing our learnings with our fellow engineers. Here is a short sample: “”, “” and “

About the Role

We are seeking a highly skilled and accomplished engineer with demonstrable experience in evolving large-scale infrastructure systems and container runtimes on Linux.  The ideal candidate will bring a combination of leading innovative solutions across functional teams and hands-on development experience in AWS/cloud, Linux user-space, networking,  GPUs, and Kubernetes.

Key Responsibilities

  • Technical Delivery: Use your expertise to significantly advance the state of Netflix’s compute offerings for our single and multi-tenant partners.  

  • Strategic Planning: Evolve our infrastructure to meet Netflix’s business objectives around Streaming, Live events, and Gaming.  

  • Project Management: Lead your own and cross-functional teams to deliver on highly ambiguous and open-ended projects enforcing each stage of the Software Development Lifecycle framework.

  • Operational Excellence: Contribute to the ever-improving operational standards of our large-scale global services by applying engineering best practices and providing first-class on-call support.

  • Performance: Identify and resolve performance bottlenecks in the Linux networking stack and resource isolation components to optimize network traffic and minimize noisy neighbor issues for containers.

  • System Integration: Integrate Linux OS changes with user-space applications and container runtime, ensuring seamless operation within the Netflix ecosystem.

  • Presentation: Deliver write-ups, blog posts, and presentations at conferences such as Linux Plumbers and eBPF Summit to represent our Netflix engineering teams.

You will excel in this role with…

  • 4+ years of experience evolving Compute infrastructure for an organization and 8+ years of software engineering experience.

  • Technical expertise in:

    • Distributed systems at scale, preferably on AWS

    • Linux application development and related package managers

    • Go, Java, or C/C++

    • Containers & runtimes-as-a-service

    • Linux performance debugging

    • Basic Networking concepts 

  • Demonstrable experience delivering multiple strategic and ambiguous projects at scale.

  • Leading and influencing teams of 10+ peer engineers.

  • Excellent presentation, communication, and collaboration skills.

We are even more excited about…

  • Container Performance and Container Stack Contributions 

  • Familiarity with ML/AI concepts

  • Knowledge of GPU architecture, CUDA, and workload optimizations

  • AMI Management

We are an equal-opportunity employer and celebrate diversity, recognizing that diversity of thought and background builds stronger teams. We approach diversity and inclusion seriously and thoughtfully. We do not discriminate on the basis of race, religion, color, ancestry, national origin, caste, sex, sexual orientation, gender, gender identity or expression, age, disability, medical condition, pregnancy, genetic makeup, marital status, or military service.

Job is open for no less than 7 days and will be removed when the position is filled.

Similar Jobs

PwC - Senior AI Developer - Roma [DIG]

PwC

Rome, Lazio, Italy (On-Site)
7 Months ago
Canva - Senior Engineering Manager (BE) - Visual Suite Platform - Remote across ANZ

Canva

Melbourne, Victoria, Australia (Remote)
5 Months ago
NVIDIA - Product Validation Tools Software Engineer

NVIDIA

Shanghai, Shanghai, China (On-Site)
2 Months ago
Rackspace Technology - Data Engineer III

Rackspace Technology

Vietnam (Remote)
2 Months ago
Dream Sports - Director System IT

Dream Sports

Mumbai, Maharashtra, India (On-Site)
4 Months ago
Rackspace Technology - OpenStack Cloud Engineer IV

Rackspace Technology

(Remote)
2 Months ago
Garena - Senior/Expert Site Reliability Engineer (SRE)

Garena

Singapore (On-Site)
3 Months ago
Ajmera Infotech - Senior Azure DevOps Engineer (IaaS)

Ajmera Infotech

Hyderabad, Telangana, India (On-Site)
1 Month ago
ARHS - DevSecOps Engineer (Automation Specialist)

ARHS

The Hague, South Holland, Netherlands (On-Site)
6 Months ago
Tencent - Technical Account Manager

Tencent

Tokyo, Japan (On-Site)
3 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

The Walt Disney Company - Senior Java Engineer

The Walt Disney Company

Connecticut, United States (On-Site)
1 Month ago
Nagarro - Senior Staff Engineer, Java

Nagarro

Japan (Remote)
6 Months ago
PwC - Experienced Associate - Forensics Services

PwC

Kuala Lumpur, Federal Territory Of Kuala Lumpur, Malaysia (On-Site)
7 Months ago
Actian - Zen Quality Assurance Engineer - Bangalore/Pune

Actian

Bengaluru, Karnataka, India (On-Site)
6 Months ago
Highspot - Sr. Full Stack Engineer, Training & Coaching

Highspot

Hyderabad, Telangana, India (Hybrid)
6 Months ago
Scopely - Software Engineer

Scopely

Bengaluru, Karnataka, India (Hybrid)
3 Months ago
Canva - Backend Software Engineer (Java) - User Product

Canva

Sydney, New South Wales, Australia (Remote)
3 Months ago
Ness Digital - Senior BI Developer

Ness Digital

Iași, Iași County, Romania (Remote)
1 Month ago
Dream Sports - SDE 2 - ML & Data Platform

Dream Sports

Mumbai, Maharashtra, India (On-Site)
7 Months ago
ByteDance - Backend Software Engineer - Global E-Commerce Supply Chain Billing & Settlement

ByteDance

San Jose, California, United States (On-Site)
6 Months ago

Get notifed when new similar jobs are uploaded

Jobs in United States

Interface AI - Sales Development Representative

Interface AI

United States (Remote)
2 Months ago
ByteDance - Linux Kernel Software Engineer

ByteDance

San Jose, California, United States (On-Site)
1 Month ago
Crunchyroll - Staff Software Engineer, Content Delivery

Crunchyroll

San Francisco, California, United States (Remote)
5 Months ago
DraftKings - Manager, Product Design

DraftKings

Las Vegas, Nevada, United States (On-Site)
2 Months ago
Rackspace Technology - Program Lead - AWS Strategic Collaboration

Rackspace Technology

United States (Remote)
1 Month ago
IGT - Field Service Technician II

IGT

Washington, United States (On-Site)
5 Months ago
Crunchyroll - Senior Software Engineer, Game Consoles

Crunchyroll

San Francisco, California, United States (On-Site)
3 Months ago
ByteDance - Car Source Management - DCar (Third-party Contractor)

ByteDance

Los Angeles, California, United States (On-Site)
6 Months ago
The Walt Disney Company - Senior Software Engineer - Scala

The Walt Disney Company

New York, New York, United States (On-Site)
2 Months ago
Netflix - Staff Design Program Manager - Live

Netflix

Los Angeles, California, United States (On-Site)
3 Months ago

Get notifed when new similar jobs are uploaded

DevOps Jobs

Ajmera Infotech - Senior Azure DevOps Engineer (IaaS)

Ajmera Infotech

Ahmedabad, Gujarat, India (On-Site)
1 Month ago
Sperasoft - Release Engineer

Sperasoft

Lesser Poland Voivodeship, Poland (Hybrid)
1 Month ago
The Walt Disney Company - Sr Systems Engineer

The Walt Disney Company

Celebration, Florida, United States (On-Site)
2 Months ago
PwC - ETIC, OCI Technical Support Engineer - Senior Associate

PwC

Cairo, Cairo Governorate, Egypt (On-Site)
5 Months ago
SmileGate - Game Data Engineer

SmileGate

Seongnam-si, Gyeonggi-do, South Korea (On-Site)
1 Month ago
ByteDance - Site Reliability Engineer (Cloud) - Infrastructure Engineering

ByteDance

Singapore (On-Site)
6 Months ago
Ajmera Infotech - Senior DevOps - Azure Infrastructure + DevOps

Ajmera Infotech

Bengaluru, Karnataka, India (Hybrid)
3 Months ago
Virtuos - Lead Software Engineer

Virtuos

Singapore (On-Site)
1 Month ago
N-iX - Senior Engineer with AWS Greengrass Expertise

N-iX

Ukraine (Remote)
2 Months ago
Nielsen Holdings - Senior Software Engineer (Java/Scala, Spark, Kubernetes, AWS)

Nielsen Holdings

Bengaluru, Karnataka, India (Hybrid)
6 Months ago

Get notifed when new similar jobs are uploaded

About The Company

Netflix is one of the world's leading entertainment services with over 247 million paid memberships in over 190 countries enjoying TV series, films and games across a wide variety of genres and languages. Members can play, pause and resume watching as much as they want, anytime, anywhere, and can change their plans at any time.

London, England, United Kingdom (On-Site)

Berlin, Berlin, Germany (On-Site)

Paris, Île-de-France, France (On-Site)

Seoul, South Korea (On-Site)

Los Angeles, California, United States (On-Site)

Los Gatos, California, United States (On-Site)

Pennsylvania, United States (On-Site)

View All Jobs

Get notified when new jobs are added by Netflix

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug