This role is remote, so it can be executed globally. If you prefer, you can work from our offices in New York, London and Warsaw.
At ElevenLabs, we are pioneering voice technology with our cutting-edge research and products.
We launched in January 2023 and have since reached over 1 million users globally and have partnered with the world’s biggest names (see customer stories). We have closed our Series-B funding at 1.1B valuation earlier this year and are backed by the leading names in tech and AI (Nat Fridman, Daniel Gross, Andreessen Horowitz, Instagram co-founder Mike Krieger, Oculus VR co-founder Brendan Iribe, Deepmind & Inflection co-founder Mustafa Suleyman, and many others).
We are at an exciting phase of our growth and innovation and are looking for ambitious people to help us further push the boundaries of voice AI. This is a rare chance to be an early member of a company on the rise. If this excites you, we want to meet you!
A global team of passionate and innovative individuals united by curiosity and a shared goal: to be the first choice for AI audio solutions. Together, we are shaping a new technology and market from the ground up. We innovate quickly and take pride in getting things right, from the big picture initiatives to the details that keep us moving smoothly every day. We work with high autonomy and accountability where the best idea wins at any time and from anyone.
We are looking for an ML Researcher to join the research team at ElevenLabs. You will thrive in the role if you enjoy doing the following:
Creating and upholding a reliable and expandable data management system specialized for text-to-speech projects. This includes establishing guidelines for versioning and ensuring data quality.
Establishing a streamlined process for autonomously training, assessing, and launching text-to-speech models. This encompasses implementing procedures for dynamic learning, as well as routines for fine-tuning and refreshing validation data.
Investigating cutting-edge approaches and strategies in machine learning, deep learning, and algorithms pertaining to text-to-speech technology.
We're looking for exceptional individuals who combine technical excellence with ethical awareness, who are excited by hard problems and motivated by human impact. You’ll strive with us if you:
Are passionate about audio AI driven by a desire to make content universally accessible and breaking the frontiers of new tech.
Are a highly motivated and driven individual with a strong work ethic. Our team is aware of this critical moment of audio AI evolution and is committed to going the extra mile to lead.
Are analytical, efficient, and strive on solving complex challenges with a first principles mindset.
Consistently strive for excellence, delivering high-quality work quickly and exceeding expectations.
Take initiative and work autonomously from day one, prioritizing learning and contribution while leaving ego aside.
We do not require any formal certifications, or degrees. Instead, we are seeking enthusiastic software engineers who can showcase solving impressively hard problems with artifacts such as past projects, designs, or GitHub contributions. We do require:
3+ years industry experience as a Machine Learning Engineer, with a key emphasis on constructing data pipelines, as well as developing and implementing machine learning models.
Demonstrating the capacity to autonomously evaluate novel concepts or enhance current machine learning projects, with the potential outcome of contributing to published works.
Extensive background in conducting exploratory research to enhance the excellence of gathered data, particularly within the realm of audio and text-to-speech domains.
High-velocity innovation: Rapid experimentation, lean autonomous teams, and minimal bureaucracy.
A truly global team: Collaboration with teammates across 30+ countries, a global customer footprint and office hubs in New York, London and Warsaw. Annual company offsite for the whole team to get together (the last one in Croatia!)
Remote first: We prioritize your talent, not your location, with structured asynchronous workflows for maximum impact and minimal meetings.
Continuous growth: Collaborate with AI leaders, shape your path, and contribute where you excel most.
#LI-remote
Overview
ElevenLabs is an AI Audio research and deployment company.
Our research team develops AI Audio models that generate realistic, versatile and contextually-aware speech and sound effects. Our product team makes these models accessible for everyday users, prosumers, and businesses to create & localize content.
Our technology is used to voice audiobooks and news articles, animate video game characters, help in film pre-production, automate localization processes in entertainment, create dynamic audio content for social media and advertising, and train medical professionals. It has also given back voices to those who have lost them and helped individuals with accessibility needs in their daily lives.
For more information, visit www.elevenlabs.io