Software Engineer, Machine Learning

9 Hours ago • All levels • Research Development • $157,500 PA - $175,000 PA

Job Summary

Job Description

AssemblyAI is building at the forefront of Speech AI, creating powerful models for speech-to-text and speech understanding. They are seeking a Machine Learning Engineer to accelerate their AI research-to-production pipeline. This role involves building infrastructure for rapid model deployment, safe testing, and efficient, scalable production inference systems. The ideal candidate will have a strong backend engineering background in distributed systems and containerization, with a deep interest in optimizing the path from research innovation to production value. This is a cross-functional role requiring close collaboration with research and engineering teams. The company is a remote team funded by leading investors and is focused on becoming a great AI company.
Must have:
  • Strong backend engineering experience with Python
  • Experience building/operating distributed, containerized apps on AWS
  • Proficiency in observability solutions
  • Ability to design resilient, scalable architectures
Good to have:
  • MLOps experience with PyTorch and Kubernetes
  • Startup environment experience
  • Experience with globally distributed teams
  • Comfort across the ML lifecycle
  • Experience in audio-related domains
  • Experience with other cloud providers
  • Familiarity with Ray.io, Bazel, monorepos
  • Experience with alternative ML inference frameworks
  • Experience optimizing for low-latency inference

Job Details

About AssemblyAI

At AssemblyAI, we’re building at the forefront of Speech AI, creating powerful models for speech-to-text and speech understanding available through a straightforward API. With more than 200,000 developers building on our API and over 5,000 paying customers, AssemblyAI is helping unlock and support the next generation of powerful, meaningful products built with AI. 

Progress in AI is moving at an unprecedented pace– and our team is made up of experts in AI research that are focused on making sure that our customers are able to stay on the cutting edge, with production-ready AI models that are constantly updating and improving as our team continues to improve accuracy, latency, and what’s possible with Speech AI. Our models consistently rank highest in industry benchmarks for accuracy, outperforming models from Google and Amazon, and up to 30% fewer hallucinations than OpenAI’s Whisper. Our models power more than 2 billion end-user experiences each day, helping companies better understand customer feedback, run more productive meetings with automated meeting notes, and helping improve childhood literacy via ed tech tools. 

We’ve raised funding by leading investors including Accel, Insight Partners, Y Combinator’s AI Fund, Patrick and John Collision, Nat Friedman, and Daniel Gross. We’re a remote team looking to build one of the next great AI companies, and are looking for driven, talented people to help us get there!

About the role:

We're looking for a Machine Learning Engineer to accelerate our AI research-to-production pipeline. This person will build infrastructure enabling our research team to rapidly deploy and safely test new models while maintaining efficient, scalable production inference systems. This person should have a strong backend engineering background in distributed systems and containerization, and be deeply interested in optimizing the path from research innovation to production value. This is a cross-functional role that requires close collaboration with both research teams developing models and engineering teams supporting the broader platform.

What You’ll Do:

  • Design and implement tooling that enables researchers to quickly deploy and evaluate new models in production 
  • Build and maintain high-performance, cost-efficient inference pipelines in production
  • Optimize infrastructure for both iteration speed and production reliability
  • Develop and maintain user-facing APIs that interact with our ML systems
  • Implement comprehensive observability solutions to monitor model performance and system health
  • Troubleshoot complex production issues across distributed systems
  • Continuously improve our MLOps practices to reduce friction between research and production

What You’ll Need:

  • Strong backend engineering experience with Python
  • Experience building and operating distributed, containerized applications, preferably on AWS 
  • Proficiency implementing observability solutions (monitoring, logging, alerting) for production systems
  • Ability to design and implement resilient, scalable architectures

An ideal candidate should also have some of the following:

  • MLOps experience, including familiarity with PyTorch and Kubernetes
  • Experience working in startup environments demonstrating ownership, decisiveness, and rapid iteration
  • Experience collaborating with remote, globally distributed teams
  • Comfort working across the entire ML lifecycle from model serving to API development
  • Experience in audio-related domains (ASR, TTS, or other domains involving audio processing)
  • Experience with other cloud providers
  • Familiarity with Ray.io, Bazel, and monorepos
  • Experience with alternative ML inference frameworks beyond PyTorch
  • Experience optimizing for low-latency, real-time inference

Pay Transparency:

AssemblyAI strives to recruit and retain exceptional talent from diverse backgrounds while ensuring pay equity for our team. Our salary ranges are based on paying competitively for our size, stage, and industry, and are one part of many compensation, benefit, and other reward opportunities we provide.

There are many factors that go into salary determinations, including relevant experience, skill level, qualifications assessed during the interview process, and maintaining internal equity with peers on the team. The range shared below is a general expectation for the function as posted, but we are also open to considering candidates who may be more or less experienced than outlined in the job description. In this case, we will communicate any updates in the expected salary range.

The provided range is the expected salary for candidates in the U.S. Outside of those regions, there may be a change in the range which will be communicated to candidates throughout the interview process.

Similar Jobs

Looks like we're out of matches

Set up an alert and we'll send you similar jobs the moment they appear!

Similar Skill Jobs

Looks like we're out of matches

Set up an alert and we'll send you similar jobs the moment they appear!

Jobs in United States

Looks like we're out of matches

Set up an alert and we'll send you similar jobs the moment they appear!

Research Development Jobs

Looks like we're out of matches

Set up an alert and we'll send you similar jobs the moment they appear!