Staff MLOps Engineer

2 Months ago • All levels • DevOps

About the job

Job Description

We are searching for a skilled Staff MLOps Engineer to build and maintain our ML infrastructure. You'll design robust MLOps platforms, manage monitoring solutions, and lead DevOps practices. Expertise in AWS, Kubernetes, and ML orchestration tools is essential.
Must have:
  • Strong Python Proficiency
  • AWS Cloud Engineering
  • Kubernetes Expertise
  • ML Orchestration Tools
Good to have:
  • Statically-typed Languages
  • Terraform Proficiency
  • Kafka Experience
  • NLP/LLMs Experience
Perks:
  • Hybrid Work Model
  • ML Engineering Team
Not hearing back from companies?
Unlock the secrets to a successful job application and accelerate your journey to your next opportunity.
We’re looking for a Staff MLOps Engineer to join our Machine Learning team. You’ll work closely with a team of engineers to create a platform on top of that data that will be leveraged by virtually every other product and system we have built or will build in the future. You’ll be responsible for building and maintaining the infrastructure and tooling that enables our ML Engineers and Data Scientists to focus on model development and feature engineering.

Key Responsibilities:

    • Design, implement, and maintain robust MLOps platforms and tooling for both batch and streaming ML pipelines.
    • Develop and manage monitoring and observability solutions for ML systems.
    • Lead DevOps practices, including CI/CD pipelines and Infrastructure as Code (IaC).
    • Architect and implement cloud-based solutions on AWS.
    • Collaborate with ML Engineers and Data Scientists to develop, train, and deploy machine learning models.
    • Engage in feature engineering and model optimization to improve ML system performance.
    • Participate in the full ML lifecycle, from data preparation to model deployment and monitoring.
    • Optimize and refactor existing systems for improved performance and reliability.
    • Drive technical initiatives and best practices in both MLOps and ML Engineering.

Required Skills and Experience:

    • Strong Python Proficiency: Excellent skills for developing, deploying, and maintaining our machine learning systems.
    • Language Versatility: Experience with statically-typed or JVM languages. Willingness to learn Scala is highly desirable.
    • Cloud Engineering Skills: Extensive experience with Cloud Platforms & Services, ideally AWS (e.g., Lambda, ECS, ECR, CloudWatch, MSK, SNS, SQS).
    • Infrastructure as Code: Proficiency in IaC, particularly Terraform.
    • Kubernetes Expertise: Strong hands-on experience with managing clusters and deploying services.
    • Data Orchestration: Experience with ML orchestration tools (e.g., Flyte, Airflow, Kubeflow, Luigi, or Prefect).
    • CI/CD: Expertise in pipelines, especially GitHub Actions and Jenkins.
    • Networking: Knowledge of concepts and implementation.
    • Streaming: Experience with Kafka and other streaming technologies.
    • ML Monitoring: Familiarity with observability tools (e.g., Arize AI, Weights and Biases).
    • NLP/LLMs: Experience with NLP, LLMs, and RAG systems in production, or strong desire to learn.
    • CLI & Shell Scripting: Proficiency in scripting and command-line tools.
    • APIs: Experience with deploying and managing production APIs.
    • Software Engineering Best-Practices: Knowledge of industry standards and practices.
#LI-Hybrid
#LI-MH1
View Full Job Description

Add your resume

80%

Upload your resume, increase your shortlisting chances by 80%

About The Company

New York, New York, United States (Hybrid)

Austin, Texas, United States (Hybrid)

Vilnius, Vilnius County, Lithuania (On-Site)

Edmonton, Alberta, Canada (Remote)

London, England, United Kingdom (Hybrid)

London, England, United Kingdom (Hybrid)

Belfast, Northern Ireland, United Kingdom (Hybrid)

Vilnius, Vilnius County, Lithuania (Hybrid)

Vilnius, Vilnius County, Lithuania (Hybrid)

Bengaluru, Karnataka, India (Hybrid)

View All Jobs

Get notified when new jobs are added by Bazaar Voice

Similar Jobs

Luxoft - Production Support Consultant

Luxoft, Singapore (On-Site)

Fluence - Sr.Controls Software Engineer

Fluence, India (Hybrid)

Meta - Software Engineer, Machine Learning

Meta, United States (On-Site)

Omind - Senior DevOps Engineer

Omind, India (On-Site)

Rackspace Technology - Senior DataDog Developer

Rackspace Technology, India (Remote)

Guerrilla - SENIOR PLATFORM ENGINEER [DECIMA]

Guerrilla, Netherlands (On-Site)

Netflix - Solutions Support Engineer (L5)

Netflix, Poland (Hybrid)

Extreme Network - Sr Software Systems Engineer- Cloud Networking

Extreme Network, India (Hybrid)

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Salesforce - AI Scientist

Salesforce, United States (On-Site)

Phenom - Devops Engineer - I

Phenom, India (On-Site)

Luxoft - Neoxam Consultant

Luxoft, Singapore (On-Site)

Visa - Data Engineer - Sr. Consultant

Visa, India (Hybrid)

Matic Robots - Systems  Engineer (Embedded Linux)

Matic Robots, United States (On-Site)

Next Level Business Services - SDE Web Developer

Next Level Business Services, United States (On-Site)

Paypal - Senior Technical Trainer / Evangelist

Paypal, United States (Hybrid)

Intel Corporation - Layout Design Engineer

Intel Corporation, Malaysia (Hybrid)

Get notifed when new similar jobs are uploaded

Jobs in London, England, United Kingdom

D3t - Games Associate Art Director

D3t, United Kingdom (Hybrid)

Dambuster Studios - Technical Designer (Enemies) - Junior or Regular

Dambuster Studios, United Kingdom (Hybrid)

Trek - Production Technician

Trek, United Kingdom (On-Site)

Jagex - Senior Director of Brand & MarComms

Jagex, United Kingdom (On-Site)

AppZen - AI Sales Development Representative - London

AppZen, United Kingdom (Hybrid)

Playground Games - Senior Office Administrator - Contract

Playground Games, United Kingdom (On-Site)

Assystems - Power Electronic Systems Engineer

Assystems, United Kingdom (On-Site)

Red Rover Interactive - Senior Backend Developer

Red Rover Interactive, United Kingdom (Hybrid)

DAZN - Global Talent Acquisition Manager

DAZN, United Kingdom (Hybrid)

Playtech - Account Manager

Playtech, United Kingdom (On-Site)

Get notifed when new similar jobs are uploaded

DevOps Jobs

Maxis Studios - DevOps Software Engineer

Maxis Studios, Canada (On-Site)

Meta - Production Engineer

Meta, Poland (On-Site)

Netflix - Solutions Support Engineer (L5)

Netflix, Poland (Hybrid)

Next Level Business Services - Pivotal cloud Architect

Next Level Business Services, United States (On-Site)

Luxoft - Senior Data Engineer with Python

Luxoft, United States (Remote)

Trimble  Inc  - Site Reliability Engineer

Trimble Inc , India (On-Site)

Axon - Solutions Architect, Fusus

Axon, United States (Hybrid)

Egnyte - Principal Engineer

Egnyte, Poland (On-Site)

Get notifed when new similar jobs are uploaded