Build and maintain MLOps platforms and tooling for both batch and streaming ML pipelines. Develop and manage monitoring and observability solutions for ML systems. Lead DevOps practices, including CI/CD pipelines and Infrastructure as Code (IaC). Architect and implement cloud-based solutions on AWS. Collaborate with ML Engineers and Data Scientists to develop, train, and deploy machine learning models.
Must have:
Strong Python Proficiency
Cloud Engineering Skills
Infrastructure as Code
Kubernetes Expertise
Good to have:
Language Versatility
Streaming
ML Monitoring
NLP/LLMs
Not hearing back from companies?
Unlock the secrets to a successful job application and accelerate your journey to your next opportunity.
We’re looking for a Staff MLOps Engineer to join our Machine Learning team. You’ll work closely with a team of engineers to create a platform on top of that data that will be leveraged by virtually every other product and system we have built or will build in the future. You’ll be responsible for building and maintaining the infrastructure and tooling that enables our ML Engineers and Data Scientists to focus on model development and feature engineering.
Key Responsibilities:
Design, implement, and maintain robust MLOps platforms and tooling for both batch and streaming ML pipelines.
Develop and manage monitoring and observability solutions for ML systems.
Lead DevOps practices, including CI/CD pipelines and Infrastructure as Code (IaC).
Architect and implement cloud-based solutions on AWS.
Collaborate with ML Engineers and Data Scientists to develop, train, and deploy machine learning models.
Engage in feature engineering and model optimization to improve ML system performance.
Participate in the full ML lifecycle, from data preparation to model deployment and monitoring.
Optimize and refactor existing systems for improved performance and reliability.
Drive technical initiatives and best practices in both MLOps and ML Engineering.
Required Skills and Experience:
Strong Python Proficiency: Excellent skills for developing, deploying, and maintaining our machine learning systems.
Language Versatility: Experience with statically-typed or JVM languages. Willingness to learn Scala is highly desirable.