Senior Software Engineer (ML Operations)

2 Days ago • 5 Years + • Research Development

Job Summary

Job Description

WHOOP is seeking a Senior Software Engineer for their MLOps team to develop and optimize ML cloud infrastructure. The role involves building scalable systems for deploying machine learning models, automating CI/CD pipelines, and collaborating with Data Science and AI teams. Responsibilities include designing cloud infrastructure, developing APIs and microservices for real-time inference, optimizing model performance using AWS services like SageMaker and Lambda, and troubleshooting technical challenges. The engineer will also provide guidance on best practices for model deployment and stay updated on ML infrastructure advancements.
Must have:
  • Design, develop, and maintain cloud-based ML infrastructure.
  • Implement CI/CD pipelines for ML models.
  • Collaborate with Data Scientists and AI teams.
  • Develop APIs and microservices for real-time inference.
  • Optimize ML model deployment and performance using cloud services (AWS SageMaker, Lambda, ECS).
  • Monitor and optimize ML model performance in production.
  • Provide guidance on model deployment and infrastructure design.
  • Troubleshoot technical challenges related to model deployment.
  • Stay up-to-date with ML infrastructure advancements.
  • Bachelor’s Degree in Computer Science, Software Engineering, or equivalent experience.
  • 5+ years of experience in software engineering with a focus on ML infrastructure.
  • Deep expertise in AWS services (SageMaker, Lambda, ECS, S3, IAM).
  • Strong programming skills in Python or Java.
  • Proven experience in productionalizing ML models.
  • Expertise in designing scalable, resilient cloud architectures.
  • Strong understanding of microservices and distributed systems.
  • Excellent collaboration skills.
  • Experience working in Agile/Scrum environments.

Job Details

At WHOOP, we're on a mission to unlock human performance and healthspan. WHOOP empowers members to perform at a higher level through a deeper understanding of their bodies and daily lives.

We are looking for a highly skilled Senior Software Engineer to join our MLOps team, focusing on the development and optimization of ML cloud infrastructure. In this role, you will play a critical part in supporting our Data Science and AI teams by building robust, scalable systems for the productionalization of machine learning models. Your work will be at the heart of bringing advanced AI solutions into production, ensuring they are reliable, scalable, and ready to drive value across WHOOP.

RESPONSIBILITIES:

    • Design, develop, and maintain cloud-based infrastructure to support the deployment and scaling of machine learning models. Implement automated pipelines for continuous integration and continuous deployment (CI/CD) of ML models, ensuring seamless transitions from development to production environments.
    • Collaborate closely with Data Scientists and AI teams to understand model requirements and facilitate the transition from prototype to production. 
    • Develop APIs, microservices, and other components necessary to integrate ML models into existing systems, enabling real-time inference and decision-making.
    • Leverage cloud services to optimize the deployment and performance of machine learning models and associated infrastructure. Utilize services such as AWS SageMaker, Lambda, and ECS to build scalable, cost-effective solutions that support real-time ML/AI workloads.
    • Monitor and optimize the performance of ML models in production, addressing issues related to latency, scalability, and resource utilization.
    • Act as a key technical partner to Data Scientists, providing guidance on best practices for model deployment, versioning, and infrastructure design.
    • Support AI teams by troubleshooting and resolving technical challenges related to model deployment and performance in production.
    • Stay up-to-date with the latest advancements in ML infrastructure, cloud computing, and AI deployment strategies. Proactively suggest and implement improvements to enhance the efficiency, reliability, and scalability of ML operations within the organization.

QUALIFICATIONS:

    • Bachelor’s Degree: A degree in Computer Science, Software Engineering, or a related field; or equivalent practical experience.
    • 5+ years of experience in software engineering, with a significant focus on building and maintaining ML infrastructure in cloud environments.
    • Deep expertise in AWS services, including but not limited to SageMaker, Lambda, ECS, S3, and IAM, with the ability to design and optimize cloud-based ML infrastructure.
    • Strong programming skills in languages such as Python or Java, with a focus on building robust, maintainable code.
    • Proven experience in productionalizing ML models, including building APIs and services that enable real-time inference.
    • Expertise in designing scalable, resilient cloud architectures that support large-scale ML operations.Strong understanding of microservices, distributed systems, and the challenges of deploying and maintaining ML models in production environments.
    • Excellent collaboration skills, with the ability to work closely with Data Scientists, AI and Software teams, and other cross-functional stakeholders.
    • Agile Methodologies: Experience working in Agile/Scrum environments, with a focus on rapid iteration and continuous improvement.
This role is based in the WHOOP office located in Boston, MA. The successful candidate must be prepared to relocate if necessary to work out of the Boston, MA office. 

Interested in the role, but don’t meet every qualification? We encourage you to still apply! At WHOOP, we believe there is much more to a candidate than what is written on paper, and we value character as much as experience. As we continue to build a diverse and inclusive environment, we encourage anyone who is interested in this role to apply.

WHOOP is an Equal Opportunity Employer and participates in E-verify to determine employment eligibility.  It is unlawful in Massachusetts to require or administer a lie detector test as a condition of employment or continued employment. An employer who violates this law shall be subject to criminal penalties and civil liability.

Similar Jobs

Looks like we're out of matches

Set up an alert and we'll send you similar jobs the moment they appear!

Similar Skill Jobs

Looks like we're out of matches

Set up an alert and we'll send you similar jobs the moment they appear!

Jobs in Boston, Massachusetts, United States

Looks like we're out of matches

Set up an alert and we'll send you similar jobs the moment they appear!

Research Development Jobs

Looks like we're out of matches

Set up an alert and we'll send you similar jobs the moment they appear!

About The Company

Boston, Massachusetts, United States (On-Site)

Boston, Massachusetts, United States (On-Site)

Boston, Massachusetts, United States (On-Site)

Boston, Massachusetts, United States (On-Site)

Boston, Massachusetts, United States (On-Site)

Boston, Massachusetts, United States (On-Site)

Boston, Massachusetts, United States (On-Site)

Boston, Massachusetts, United States (On-Site)

Boston, Massachusetts, United States (On-Site)

Boston, Massachusetts, United States (On-Site)

View All Jobs

Get notified when new jobs are added by whoop

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug