Software Engineer L4/L5, Training Platform, Machine Learning Platform

33 Minutes ago • All levels • Artificial Intelligence

About the job

Job Description

Netflix seeks a Software Engineer to join its Machine Learning Platform (MLP) team. Responsibilities include designing and building a platform for large-scale machine learning model training, fine-tuning, transformation, and evaluation. This role involves co-designing and optimizing systems for scalability and cost-effectiveness, creating user-friendly APIs, and ensuring engineering excellence through best practices in operations. Collaboration with ML engineers and cross-functional teams is essential. The ideal candidate possesses experience in ML engineering on production systems, large-scale infrastructure for ML, cloud computing (preferably AWS), and excellent communication skills.
Must have:
  • Experience in ML engineering on production systems
  • Building and operating large-scale ML infrastructure
  • Cloud computing experience (AWS preferred)
  • API design and development
  • Excellent communication skills
Good to have:
  • Familiarity with cloud-based AI/ML services
  • Large-scale distributed training expertise
  • Generative AI experience
  • Experience partnering with ML modeling engineers
Perks:
  • Comprehensive benefits including health plans, mental health support, 401k, stock options, disability programs, family-forming benefits, paid time off

Netflix is one of the world's leading entertainment services, with 283 million paid memberships in over 190 countries enjoying TV series, films and games across a wide variety of genres and languages. Members can play, pause and resume watching as much as they want, anytime, anywhere, and can change their plans at any time.

Machine Learning/Artificial Intelligence powers innovation in all areas of the business, from helping members choose the right title for them through personalization, to better understanding our audience and our content slate, to optimizing our payment processing and other revenue-focused initiatives. Building highly scalable and differentiated ML infrastructure is key to accelerating this innovation.

The Opportunity

We are looking for a driven Software Engineer to join the Training Platform team under our Machine Learning Platform (MLP) org. MLP’s charter is to maximize the business impact of all ML use cases at Netflix through highly reliable and flexible ML tooling and infrastructure that supports key product functions such as personalized recommendations, studio algorithms, virtual productions, growth intelligence, and content demand modeling among others.

In this role you will get to: 

  • Design and build the platform that powers large-scale machine learning model training, fine-tuning, model transformation and evaluations workflows and use cases from the entire company

  • Co-design and optimize the systems and models to scale up and increase the cost-effectiveness of machine learning model training

  • Design easy-to-use APIs and interfaces for experienced ML practitioners, as well as non-experts to easy access the training platform

Minimum Job Qualifications

  • Experience in ML engineering on production systems dealing with training or inference of deep learning models.

  • Proven track record of building and operating large-scale infrastructure for machine learning use cases

  • Experience with cloud computing providers, preferably AWS

  • Comfortable with ambiguity and working across multiple layers of the tech stack to execute on both 0-to-1 and 1-to-100 projects

  • Adopt and promote best practices in operations, including observability, logging, reporting, and on-call processes to ensure engineering excellence.

  • Excellent written and verbal communication skills

  • Comfortable working in a team with peers and partners distributed across (US) geographies & time zones.

Preferred Qualifications

  • Understand modern and real-world Machine Learning model development workflows and experience partnering closely with ML modeling engineers

  • Familiarity with cloud-based AI/ML services (e.g., SageMaker, Bedrock, Databricks, OpenAI, etc.)

  • Experience with large-scale distributed training and different parallelism techniques for scaling up training, such as FSDP and tensor/pipeline parallelism

  • Expertise in the area of Generative AI, specifically when it comes to training foundation models, fine tuning them, and distilling them to smaller models

What do we offer?

Netflix's culture is an integral part of our success, and we approach diversity and inclusion seriously and thoughtfully. We are an equal opportunity employer and celebrate diversity, recognizing that bringing together different perspectives and backgrounds helps build stronger teams. We do not discriminate on the basis of race, religion, color, national origin, gender, sexual orientation, age, marital status, veteran status, or disability status.

Our compensation structure consists solely of an annual salary; we do not have bonuses. You choose each year how much of your compensation you want in salary versus stock options. To determine your personal top-of-market compensation, we rely on market indicators and consider your specific job family, background, skills, and experience to determine your compensation in the market range. The range for this role is $100,000 - $720,000.

Netflix provides comprehensive benefits including Health Plans, Mental Health support, a 401(k) Retirement Plan with employer match, Stock Option Program, Disability Programs, Health Savings and Flexible Spending Accounts, Family-forming benefits, and Life and Serious Injury Benefits. We also offer paid leave of absence programs.  Full-time hourly employees accrue 35 days annually for paid time off to be used for vacation, holidays, and sick paid time off. Full-time salaried employees are immediately entitled to flexible time off. See more details about our Benefits .

Netflix has a unique culture and environment.  Learn more .  

is a Netflix value and we strive to host a meaningful interview experience for all candidates. If you want an accommodation/adjustment for a disability or any other reason during the hiring process, please send a request to your recruiting partner.

We are an equal-opportunity employer and celebrate diversity, recognizing that diversity builds stronger teams. We approach diversity and inclusion seriously and thoughtfully. We do not discriminate on the basis of race, religion, color, ancestry, national origin, caste, sex, sexual orientation, gender, gender identity or expression, age, disability, medical condition, pregnancy, genetic makeup, marital status, or military service.

Job is open for no less than 7 days and will be removed when the position is filled.

View Full Job Description
$100.0K - $720.0K/yr (Outscal est.)
$410.0K/yr avg.
California, United States

Add your resume

80%

Upload your resume, increase your shortlisting chances by 80%

About The Company

Netflix is one of the world's leading entertainment services with over 247 million paid memberships in over 190 countries enjoying TV series, films and games across a wide variety of genres and languages. Members can play, pause and resume watching as much as they want, anytime, anywhere, and can change their plans at any time.

Los Gatos, California, United States (Hybrid)

Amsterdam, North Holland, Netherlands (On-Site)

Amsterdam, North Holland, Netherlands (On-Site)

United States (Remote)

Mumbai, Maharashtra, India (On-Site)

Mumbai, Maharashtra, India (On-Site)

View All Jobs

Get notified when new jobs are added by Netflix

Similar Jobs

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Get notifed when new similar jobs are uploaded

Jobs in California, United States

Scientific Games  - Package Assembly Tech II

Scientific Games , United States (On-Site)

HHA Exchange - Director of Growth Marketing

HHA Exchange, United States (Remote)

Meta - Product Manager

Meta, United States (Remote)

ByteDance - Internal Communications Partner - AMS

ByteDance, United States (On-Site)

Notion - Scaled Customer Success Manager

Notion, United States (On-Site)

Next Level Business Services - Business Analyst - Mobility

Next Level Business Services, United States (On-Site)

Rockstar Games - Senior Data Scientist, GTA+ Subscriptions

Rockstar Games, United States (On-Site)

Hasbro - Principal Product Manager - D&D Beyond

Hasbro, United States (On-Site)

Warner Bros Discovery - Staff Software Engineer, Data Platforms

Warner Bros Discovery, United States (On-Site)

ION - Lead UI Developer, New York

ION, United States (Hybrid)

Get notifed when new similar jobs are uploaded

Artificial Intelligence Jobs

undefined - Senior Machine Learning Scientist, Gen AI

Madrid, Community Of Madrid, Spain (On-Site)

Dolby Laboratories - Senior Foundational AI Researcher

Dolby Laboratories, India (Hybrid)

The Walt Disney Company - Principal Machine Learning Engineer

The Walt Disney Company, United States (On-Site)

Microsoft - Research Intern - AI for Domains

Microsoft, United States (On-Site)

Meta - Software Engineer, Machine Learning

Meta, United States (On-Site)

Spellbrush - AI Anime Researcher

Spellbrush, United States (On-Site)

Microsoft - Software Engineer 2

Microsoft, India (On-Site)

Ello - Tech Lead, Machine Learning

Ello, United States (On-Site)

Get notifed when new similar jobs are uploaded