Software Engineer, Machine Learning

1 Month ago • All levels • $157,500 PA - $175,000 PA

Job Summary

Job Description

AssemblyAI is building Speech AI models for speech-to-text and speech understanding through an API. The Machine Learning Engineer will accelerate the AI research-to-production pipeline, build infrastructure for rapid model deployment and testing, maintain efficient production inference systems, and optimize the path from research to production. This role requires collaboration with research and engineering teams, focusing on backend engineering, distributed systems, and containerization, to develop and implement tooling for researchers, build and maintain high-performance inference pipelines, optimize infrastructure, develop APIs, and implement observability solutions. The role involves troubleshooting production issues and improving MLOps practices.
Must have:
  • Strong backend engineering experience with Python
  • Experience building and operating containerized applications
  • Implement observability solutions for production systems
  • Design and implement scalable architectures
Good to have:
  • MLOps experience with PyTorch and Kubernetes
  • Experience in startup environments
  • Experience with remote, globally distributed teams
  • Experience across the ML lifecycle
  • Experience in audio-related domains
  • Experience with alternative ML inference frameworks

Job Details

About AssemblyAI

At AssemblyAI, we’re building at the forefront of Speech AI, creating powerful models for speech-to-text and speech understanding available through a straightforward API. With more than 200,000 developers building on our API and over 5,000 paying customers, AssemblyAI is helping unlock and support the next generation of powerful, meaningful products built with AI. 

Progress in AI is moving at an unprecedented pace– and our team is made up of experts in AI research that are focused on making sure that our customers are able to stay on the cutting edge, with production-ready AI models that are constantly updating and improving as our team continues to improve accuracy, latency, and what’s possible with Speech AI. Our models consistently rank highest in industry benchmarks for accuracy, outperforming models from Google and Amazon, and up to 30% fewer hallucinations than OpenAI’s Whisper. Our models power more than 2 billion end-user experiences each day, helping companies better understand customer feedback, run more productive meetings with automated meeting notes, and helping improve childhood literacy via ed tech tools. 

We’ve raised funding by leading investors including Accel, Insight Partners, Y Combinator’s AI Fund, Patrick and John Collision, Nat Friedman, and Daniel Gross. We’re a remote team looking to build one of the next great AI companies, and are looking for driven, talented people to help us get there!

About the role:

We're looking for a Machine Learning Engineer to accelerate our AI research-to-production pipeline. This person will build infrastructure enabling our research team to rapidly deploy and safely test new models while maintaining efficient, scalable production inference systems. This person should have a strong backend engineering background in distributed systems and containerization, and be deeply interested in optimizing the path from research innovation to production value. This is a cross-functional role that requires close collaboration with both research teams developing models and engineering teams supporting the broader platform.

What You’ll Do:

  • Design and implement tooling that enables researchers to quickly deploy and evaluate new models in production 
  • Build and maintain high-performance, cost-efficient inference pipelines in production
  • Optimize infrastructure for both iteration speed and production reliability
  • Develop and maintain user-facing APIs that interact with our ML systems
  • Implement comprehensive observability solutions to monitor model performance and system health
  • Troubleshoot complex production issues across distributed systems
  • Continuously improve our MLOps practices to reduce friction between research and production

What You’ll Need:

  • Strong backend engineering experience with Python
  • Experience building and operating distributed, containerized applications, preferably on AWS 
  • Proficiency implementing observability solutions (monitoring, logging, alerting) for production systems
  • Ability to design and implement resilient, scalable architectures

An ideal candidate should also have some of the following:

  • MLOps experience, including familiarity with PyTorch and Kubernetes
  • Experience working in startup environments demonstrating ownership, decisiveness, and rapid iteration
  • Experience collaborating with remote, globally distributed teams
  • Comfort working across the entire ML lifecycle from model serving to API development
  • Experience in audio-related domains (ASR, TTS, or other domains involving audio processing)
  • Experience with other cloud providers
  • Familiarity with Ray.io, Bazel, and monorepos
  • Experience with alternative ML inference frameworks beyond PyTorch
  • Experience optimizing for low-latency, real-time inference

Pay Transparency:

AssemblyAI strives to recruit and retain exceptional talent from diverse backgrounds while ensuring pay equity for our team. Our salary ranges are based on paying competitively for our size, stage, and industry, and are one part of many compensation, benefit, and other reward opportunities we provide.

There are many factors that go into salary determinations, including relevant experience, skill level, qualifications assessed during the interview process, and maintaining internal equity with peers on the team. The range shared below is a general expectation for the function as posted, but we are also open to considering candidates who may be more or less experienced than outlined in the job description. In this case, we will communicate any updates in the expected salary range.

The provided range is the expected salary for candidates in the U.S. Outside of those regions, there may be a change in the range which will be communicated to candidates throughout the interview process.

Working at AssemblyAI

We are a small but mighty group of startup veterans and experienced AI researchers with over 20 years of expertise in Machine Learning, Speech Recognition, and NLP. As a fully remote team, we’re looking for people to join our team who are ambitious, curious, and lead with integrity. We’re still in the early days of AI and of AssemblyAI’s journey, and are looking for teammates who won’t just fit in, but will help us define and build our company culture. 

We’re committed to creating a space where our employees can bring their full selves to work and have equal opportunity to succeed. No matter your race, gender identity or expression, sexual orientation, religion, origin, ability, age, veteran status, if joining this mission speaks to you, we encourage you to apply!

Keep Exploring AssemblyAI:

Check us out on YouTube!

Learn more about AI models for speech recognition

Core Transcription | Audio Intelligence | LeMUR | Try the Playground

Our $50M Series C fundraise

Similar Jobs

Veeam Software - Virtualization Backup Engineer (German Speaker)

Veeam Software

Poland (Remote)
1 Month ago
Numrah - Fullstack Engineer

Numrah

(Remote)
1 Month ago
Nielsen Holdings - DevOps Engineer (Terraform, Jenkins, GitLab CI/CD, Python, Airflow)

Nielsen Holdings

Bengaluru, Karnataka, India (Hybrid)
8 Months ago
Discord - Senior Software Engineer, Data Platform

Discord

San Francisco, California, United States (Remote)
1 Month ago
Meta - Production Engineer

Meta

Sunnyvale, California, United States (Remote)
7 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

PlayStation Global - Site Reliability Engineer

PlayStation Global

Carlsbad, California, United States (On-Site)
2 Months ago
Better ME - Senior Backend (Node.js) Engineer (Web)

Better ME

Kyiv, Kyiv City, Ukraine (Remote)
1 Month ago
Minted - Director of Engineering, Shopping and E-commerce

Minted

San Francisco, California, United States (Hybrid)
3 Weeks ago
Rockstar Games - Senior DevOps Engineer

Rockstar Games

Edinburgh, Scotland, United Kingdom (On-Site)
9 Months ago
Canonical - GitOps Engineering Manager

Canonical

(Remote)
1 Month ago
JMA - Advanced Engineer - QA - E2E

JMA

Syracuse, New York, United States (On-Site)
2 Months ago
Outbrain - Cloud Engineer

Outbrain

Ljubljana, Ljubljana, Slovenia (Hybrid)
1 Month ago
Zamp - Backend Engineer

Zamp

Bengaluru, Karnataka, India (On-Site)
1 Year ago
Veeam Software - Enterprise Field Marketing Manager

Veeam Software

United Kingdom (Remote)
1 Month ago
Version1 - Intermediate Java Software Engineer

Version1

London, England, United Kingdom (On-Site)
3 Months ago

Get notifed when new similar jobs are uploaded

Jobs in Worldwide

Looks like we're out of matches

Set up an alert and we'll send you similar jobs the moment they appear!

Similar Category Jobs

Looks like we're out of matches

Set up an alert and we'll send you similar jobs the moment they appear!