Software Engineer, Machine Learning

3 Months ago • All levels • $157,500 PA - $175,000 PA

Job Summary

Job Description

AssemblyAI is building Speech AI models for speech-to-text and speech understanding through an API. The Machine Learning Engineer will accelerate the AI research-to-production pipeline, build infrastructure for rapid model deployment and testing, maintain efficient production inference systems, and optimize the path from research to production. This role requires collaboration with research and engineering teams, focusing on backend engineering, distributed systems, and containerization, to develop and implement tooling for researchers, build and maintain high-performance inference pipelines, optimize infrastructure, develop APIs, and implement observability solutions. The role involves troubleshooting production issues and improving MLOps practices.

Must have:

Strong backend engineering experience with Python
Experience building and operating containerized applications
Implement observability solutions for production systems
Design and implement scalable architectures

Good to have:

MLOps experience with PyTorch and Kubernetes
Experience in startup environments
Experience with remote, globally distributed teams
Experience across the ML lifecycle
Experience in audio-related domains
Experience with alternative ML inference frameworks

8 skills required

8 skills required for this role

Add these skills to join the top 1% applicants for this job

kubernetes

aws

python

pytorch

model-serving

innovation

cross-functional

game-texts

Job Details

About AssemblyAI

At AssemblyAI, we’re building at the forefront of Speech AI, creating powerful models for speech-to-text and speech understanding available through a straightforward API. With more than 200,000 developers building on our API and over 5,000 paying customers, AssemblyAI is helping unlock and support the next generation of powerful, meaningful products built with AI.

Progress in AI is moving at an unprecedented pace– and our team is made up of experts in AI research that are focused on making sure that our customers are able to stay on the cutting edge, with production-ready AI models that are constantly updating and improving as our team continues to improve accuracy, latency, and what’s possible with Speech AI. Our models consistently rank highest in industry benchmarks for accuracy, outperforming models from Google and Amazon, and up to 30% fewer hallucinations than OpenAI’s Whisper. Our models power more than 2 billion end-user experiences each day, helping companies better understand customer feedback, run more productive meetings with automated meeting notes, and helping improve childhood literacy via ed tech tools.

We’ve raised funding by leading investors including Accel, Insight Partners, Y Combinator’s AI Fund, Patrick and John Collision, Nat Friedman, and Daniel Gross. We’re a remote team looking to build one of the next great AI companies, and are looking for driven, talented people to help us get there!

About the role:

We're looking for a Machine Learning Engineer to accelerate our AI research-to-production pipeline. This person will build infrastructure enabling our research team to rapidly deploy and safely test new models while maintaining efficient, scalable production inference systems. This person should have a strong backend engineering background in distributed systems and containerization, and be deeply interested in optimizing the path from research innovation to production value. This is a cross-functional role that requires close collaboration with both research teams developing models and engineering teams supporting the broader platform.

What You’ll Do:

Design and implement tooling that enables researchers to quickly deploy and evaluate new models in production
Build and maintain high-performance, cost-efficient inference pipelines in production
Optimize infrastructure for both iteration speed and production reliability
Develop and maintain user-facing APIs that interact with our ML systems
Implement comprehensive observability solutions to monitor model performance and system health
Troubleshoot complex production issues across distributed systems
Continuously improve our MLOps practices to reduce friction between research and production

What You’ll Need:

Strong backend engineering experience with Python
Experience building and operating distributed, containerized applications, preferably on AWS
Proficiency implementing observability solutions (monitoring, logging, alerting) for production systems
Ability to design and implement resilient, scalable architectures

An ideal candidate should also have some of the following:

MLOps experience, including familiarity with PyTorch and Kubernetes
Experience working in startup environments demonstrating ownership, decisiveness, and rapid iteration
Experience collaborating with remote, globally distributed teams
Comfort working across the entire ML lifecycle from model serving to API development
Experience in audio-related domains (ASR, TTS, or other domains involving audio processing)
Experience with other cloud providers
Familiarity with Ray.io, Bazel, and monorepos
Experience with alternative ML inference frameworks beyond PyTorch
Experience optimizing for low-latency, real-time inference

Pay Transparency:

AssemblyAI strives to recruit and retain exceptional talent from diverse backgrounds while ensuring pay equity for our team. Our salary ranges are based on paying competitively for our size, stage, and industry, and are one part of many compensation, benefit, and other reward opportunities we provide.

There are many factors that go into salary determinations, including relevant experience, skill level, qualifications assessed during the interview process, and maintaining internal equity with peers on the team. The range shared below is a general expectation for the function as posted, but we are also open to considering candidates who may be more or less experienced than outlined in the job description. In this case, we will communicate any updates in the expected salary range.

The provided range is the expected salary for candidates in the U.S. Outside of those regions, there may be a change in the range which will be communicated to candidates throughout the interview process.

Working at AssemblyAI

We are a small but mighty group of startup veterans and experienced AI researchers with over 20 years of expertise in Machine Learning, Speech Recognition, and NLP. As a fully remote team, we’re looking for people to join our team who are ambitious, curious, and lead with integrity. We’re still in the early days of AI and of AssemblyAI’s journey, and are looking for teammates who won’t just fit in, but will help us define and build our company culture.

We’re committed to creating a space where our employees can bring their full selves to work and have equal opportunity to succeed. No matter your race, gender identity or expression, sexual orientation, religion, origin, ability, age, veteran status, if joining this mission speaks to you, we encourage you to apply!

Keep Exploring AssemblyAI:

Check us out on YouTube!

Learn more about AI models for speech recognition

Core Transcription | Audio Intelligence | LeMUR | Try the Playground

Our $50M Series C fundraise