Cloud MLOps Engineer – AWS Focus

1 Month ago • 3 Years +

Job Summary

Job Description

The Cloud MLOps Engineer will be responsible for building, deploying, and scaling machine learning models within a secure AWS production environment. This role requires developing and managing end-to-end MLOps infrastructure to enable seamless model movement from research to reliable, scalable production systems. Key responsibilities include automating machine learning deployment pipelines, collaborating with Data Scientists and ML Engineers to productionize models, implementing continuous training and CI/CD practices, monitoring model performance, and ensuring security and scalability. The engineer will serve as a bridge between DevOps, Data Science, and Cloud Engineering teams, ensuring operational excellence. The job involves tasks such as architecting and maintaining automated machine learning deployment pipelines in AWS, productionizing models with Data Scientists and ML Engineers and implementing CI/CD practices for machine learning, and building scalable, secure, and fault-tolerant infrastructure using services like AWS SageMaker, EKS, Lambda, S3, EC2, CloudFormation, and/or Terraform.
Must have:
  • Experience in DevOps, Cloud Engineering, or MLOps roles.
  • Experience supporting machine learning production environments.
  • Hands-on experience with AWS cloud services (SageMaker, EKS, S3, IAM, CloudWatch).
  • Strong proficiency with Python and scripting for automation (Bash, etc.).
  • Experience with containerization (Docker) and orchestration (Kubernetes).
  • Expertise in building CI/CD pipelines for ML models.
  • Familiarity with ML lifecycle management tools (MLflow, SageMaker Pipelines, Kubeflow).
  • Deep understanding of model monitoring, drift detection, and retraining workflows.
  • Appreciation for security, reliability, and scalability in production systems.
Good to have:
  • Experience with Infrastructure as Code (Terraform, AWS CloudFormation).
  • Knowledge of data versioning tools (e.g., DVC).
  • Experience with event-driven architectures using AWS Lambda and SQS.
  • Familiarity with monitoring stacks (Prometheus, Grafana, CloudWatch Insights).

Job Details

At Zelis, we Get Stuff Done. So, let’s get to it! 

  

A Little About Us 

Zelis is modernizing the healthcare financial experience for all by providing a connected platform that bridges the gaps and aligns interests across payers, providers, and healthcare consumers. This platform serves more than 750 payers, including the top 5 national health plans, BCBS insurers, regional health plans, TPAs and self-insured employers, and millions of healthcare providers and consumers. Zelis sees across the system to identify, optimize, and solve problems holistically with technology built by healthcare experts—driving real, measurable results for clients. 

  

A Little About You 

You bring a unique blend of personality and professional expertise to your work, inspiring others with your passion and dedication. Your career is a testament to your diverse experiences, community involvement, and the valuable lessons you've learned along the way. You are more than just your resume; you are a reflection of your achievements, the knowledge you've gained, and the personal interests that shape who you are.

Position Overview

We are seeking a Cloud MLOps Engineer to build, deploy, and scale machine learning models in a robust, secure AWS production environment. This role will focus on developing and managing end-to-end MLOps infrastructure, enabling our Data Science and AI teams to move models seamlessly from research into reliable, scalable production systems. You will blend DevOps best practices with ML lifecycle management and serve as a key partner to our Data Scientists, Machine Learning Engineers, and Cloud Infrastructure teams.

If you have a passion for operationalizing AI, driving model deployment automation, and scaling production-grade ML systems — this is the opportunity for you.

What You’ll Do

Key Responsibilities:

  • Architect, build, and maintain automated machine learning deployment pipelines in AWS.

  • Collaborate with Data Scientists and ML Engineers to productionize models and manage the full ML model lifecycle (build, deploy, monitor, retrain).

  • Implement continuous training (CT) and continuous integration/continuous deployment (CI/CD) practices for machine learning.

  • Monitor model performance, detect drift, and automate alerts and retraining workflows.

  • Build scalable, secure, and fault-tolerant infrastructure using services like AWS SageMaker, EKS, Lambda, S3, EC2, CloudFormation, and/or Terraform.

  • Develop and maintain model versioning, governance, and auditing processes.

  • Implement best practices in monitoring, logging, and security for machine learning applications.

  • Serve as a bridge between DevOps, Data Science, and Cloud Engineering teams, ensuring alignment and operational excellence.

What You’ll Bring to Zelis

Required Skills and Experience:

  • 3+ years of experience working in DevOps, Cloud Engineering, or MLOps roles.

  • 2+ years specifically supporting machine learning production environments.

  • Hands-on experience with AWS cloud services — particularly SageMaker, EKS, S3, IAM, CloudWatch.

  • Strong proficiency with Python and scripting for automation (Bash, etc.).

  • Experience with containerization (Docker) and orchestration (Kubernetes).

  • Expertise in building CI/CD pipelines for ML models using tools like GitHub Actions, CodePipeline, Jenkins, or similar.

  • Familiarity with ML lifecycle management tools such as MLflow, SageMaker Pipelines, Kubeflow, or equivalent.

  • Deep understanding of model monitoring, model drift detection, and retraining workflows.

  • Strong appreciation for security, reliability, and scalability in production systems.

Preferred Qualifications:

  • Experience with Infrastructure as Code (Terraform, AWS CloudFormation).

  • Knowledge of data versioning tools (e.g., DVC).

  • Experience with event-driven architectures using AWS Lambda and SQS.

  • Familiarity with monitoring stacks (Prometheus, Grafana, CloudWatch Insights).

What This Role Is Not:

  • Not a traditional DevOps engineer role focused purely on application development pipelines or server maintenance.

  • Not a Data Scientist or ML research role; this is about operationalizing models — not building or training them.

  • Not an AWS SysAdmin or "cloud generalist" position; hands-on experience specifically supporting machine learning deployments is required.

  • Not an entry-level cloud role; we require experience with production systems supporting AI/ML workflows.

If you're passionate about scaling machine learning systems and building world-class MLOps capabilities, we would love to hear from you.

Location and Workplace Flexibility

We have offices in Atlanta GA, Boston MA, Morristown NJ, Plano TX, St. Louis MO, St. Petersburg FL, and Hyderabad, India. We foster a hybrid and remote friendly culture, and all our employee's work locations are based on the needs of the position and determined by the Leadership team. In-office work and activities, if applicable, vary based on the work and team objectives in accordance with Company policies.

  

Equal Employment Opportunity  
Zelis is proud to be an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, age, sex, national origin, disability status, genetics, protected veteran status, sexual orientation, gender identity or expression, or any other characteristic protected by federal, state or local laws. 
 
We welcome applicants from all backgrounds and encourage you to apply even if you don’t meet 100% of the qualifications for the role. We believe in the value of diverse perspectives and experiences and are committed to building an inclusive workplace for all. 

 

Accessibility Support 
We are dedicated to ensuring our application process is accessible to all candidates. If you are a qualified individual with a disability or a disabled veteran and require a reasonable accommodation with any part of the application and/or interview process, please email TalentAcquisition@zelis.com. 

  

Disclaimer 

We are an equal opportunity employer, and all qualified applicants will receive consideration for employment without regard to race, color, religion, sex, gender identity, national origin, disability status, protected veteran status, or any other characteristic protected by law. 

The above statements are intended to describe the general nature and level of work being performed by people assigned to this classification. They are not to be construed as an exhaustive list of all responsibilities, duties, and skills required of personnel so classified. All personnel may be required to perform duties outside of their normal responsibilities, duties, and skills from time to time. 

Similar Jobs

pipa studios - DevOps Analyst

pipa studios

São Paulo, State Of São Paulo, Brazil (Hybrid)
1 Month ago
The Walt Disney Company - Lead Software Engineer, Ad Platforms

The Walt Disney Company

Seattle, Washington, United States (On-Site)
1 Month ago
Behaviour Interactive - Principal Generalist Programmer - Dead by Daylight

Behaviour Interactive

Quebec, Canada (Hybrid)
2 Months ago
Anthology  Inc  - DevOps (SRE) Engineer

Anthology Inc

Brno, South Moravian Region, Czechia (On-Site)
7 Months ago
Oportun - Senior Software ML Engineer

Oportun

(Remote)
1 Month ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Revolgy - L2 Cloud Ops Engineer

Revolgy

(Remote)
3 Months ago
Airbyte - Senior Developer Advocate

Airbyte

San Francisco, California, United States (On-Site)
1 Month ago
Fictiv - Associate Technical Program Manager (EG)

Fictiv

Monterrey, Nuevo Leon, Mexico (On-Site)
1 Month ago
Glean - Tech Lead Manager - Generative AI Product

Glean

Palo Alto, California, United States (Hybrid)
1 Month ago
WaveApps - Software Engineer - DevOps

WaveApps

(Remote)
1 Month ago
ByteDance - SRE and DevOps Tech Lead - Edge Cloud Infrastructure

ByteDance

London, England, United Kingdom (On-Site)
1 Month ago
Scale AI - Machine Learning Engineer, GenAI Quality

Scale AI

San Francisco, California, United States (On-Site)
1 Month ago
Barracuda Networks  Inc  - Senior Software Engineer

Barracuda Networks Inc

Bengaluru, Karnataka, India (Hybrid)
4 Months ago
Tide - Senior Engineer, Python (Data & AI)

Tide

Hyderabad, Telangana, India (Hybrid)
1 Month ago
GoMotive - Software Engineer - Backend

GoMotive

Pakistan (Remote)
2 Months ago

Get notifed when new similar jobs are uploaded

Jobs in Morristown, New Jersey, United States

Google - Developer Relations Engineer, AI Developer Advocate, Cloud AI

Google

Sunnyvale, California, United States (On-Site)
1 Month ago
Framestore - FREELANCE: NUKE - NEW YORK

Framestore

New York, New York, United States (On-Site)
1 Year ago
Second Dinner - Vice President of People

Second Dinner

United States (Remote)
2 Months ago
Zones LLC - Identity Management Consultant

Zones LLC

Beavercreek, Ohio, United States (On-Site)
1 Month ago
Games For Love - Esports Streamer

Games For Love

Washington, United States (Remote)
2 Months ago
NVIDIA - Senior Site Reliability Engineer - AI Research Clusters

NVIDIA

Austin, Texas, United States (Hybrid)
3 Months ago
Lionsgate Games - Assistant, Acquisitions & Co-Productions

Lionsgate Games

Santa Monica, California, United States (On-Site)
2 Months ago
Framestore - Freelance Senior Asset Generalist

Framestore

New York, New York, United States (Hybrid)
1 Month ago
IMC - Junior Information Security Engineer

IMC

Chicago, Illinois, United States (On-Site)
1 Month ago
Cognite - Senior Solution Architect

Cognite

Austin, Texas, United States (Hybrid)
5 Months ago

Get notifed when new similar jobs are uploaded

Similar Category Jobs

Looks like we're out of matches

Set up an alert and we'll send you similar jobs the moment they appear!

About The Company

Zelis is modernizing the healthcare financial experience by providing a connected platform that bridges the gaps and aligns interests across payers, providers, and healthcare consumers. This platform serves more than 750 payers, including the top 5 national health plans, BCBS insurers, regional health plans, TPAs and self-insured employers, and millions of healthcare providers and consumers. Zelis sees across the system to identify, optimize, and solve problems holistically with technology built by healthcare experts – driving real, measurable results for clients.

Hyderabad, Telangana, India (On-Site)

Morristown, New Jersey, United States (Hybrid)

Hyderabad, Telangana, India (On-Site)

Hyderabad, Telangana, India (On-Site)

Hyderabad, Telangana, India (On-Site)

Hyderabad, Telangana, India (On-Site)

Hyderabad, Telangana, India (On-Site)

Hyderabad, Telangana, India (Hybrid)

Hyderabad, Telangana, India (On-Site)

Hyderabad, Telangana, India (On-Site)

View All Jobs

Get notified when new jobs are added by Zelis

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug