AI Researcher (Voice)

Tavus

| San Francisco, California, United States (Remote) | Full Time | 4 months ago

Apply Now

Job Summary

Tavus is a research lab building AI Humans, a new interface for human-machine interaction. These real-time human simulation models enable meaningful, face-to-face conversations, combining emotional intelligence with machine reliability. The company is Series A backed by Sequoia Capital, Y Combinator, and Scale Venture Partners. This Senior Researcher role involves leading research on generative video and audio models, collaborating with the Applied ML team for production, and staying current with AI advancements. The ideal candidate thrives in a fast-paced startup environment, prioritizes effectively, and takes calculated risks.

Must Have

Lead research efforts on generative video and audio models (ex: text-to-speech, speech-to-speech, audio-to-expression and other speech and multimodal AI topics)
Work with the Applied ML team to help productionize our research
Stay relevant with the latest advancements (and help us create the latest advancements!)

Good to Have

Skills in 3D graphics
Gaussian splatting
Other, additional experience with generative models
PhD or equivalent experience
Experience leading research teams
Knowledge of best practices in Software Development

Perks & Benefits

flexible work schedule
unlimited PTO
competitive healthcare
gear stipends
opportunity to learn
directly drive impact

Job Description

About Us

Tavus is a research lab pioneering human computing. We’re building AI Humans: a new interface that closes the gap between people and machines, free from the friction of today’s systems. Our real-time human simulation models let machines see, hear, respond, and even look real—enabling meaningful, face-to-face conversations. AI Humans combine the emotional intelligence of humans with the reach and reliability of machines, making them capable, trusted agents available 24/7, in every language, on our terms.

Imagine a therapist anyone can afford. A personal trainer that adapts to your schedule. A fleet of medical assistants that can give every patient the attention they need. With Tavus, individuals, enterprises, and developers can all build AI Humans to connect, understand, and act with empathy at scale.

We’re a Series A company backed by world-class investors including Sequoia Capital, Y Combinator, and Scale Venture Partners.

Be part of shaping a future where humans and machines truly understand each other.

The Role

We’re looking for a Senior Researcher to join our core AI team. Our ideal partner-in-crime works well in startup environments, is comfortable prioritizing for themselves, and is always down to take calculated risks. We’re moving fast and not looking for people to come along for the ride - we’re looking for people to pave the path.

Your Mission 🚀

Lead research efforts on generative video and audio models (ex: text-to-speech, speech-to-speech, audio-to-expression and other speech and multimodal AI topics)
Work with the Applied ML team to help productionize our research
Stay relevant with the latest advancements (and help us create the latest advancements!)

Requirements

Have proven experience with flow matching, diffusion models, auto regressive networks in the audio domain.
Have experience training deep learning models: from medium-sized to large models.
Have experience building streaming text-to-speech models or speech-to-speech models
Have strong foundations in audio modeling and demonstrated ability to innovate rapidly through prototyping.
Know state-of-the-art architectures in representation learning: audio or image domain, face animation (in addition to having a deep understanding of the direct field of expertise above)
Have excellent programming skills and be fluent in PyTorch
Show evidence of original research, with publications in top-tier or solid second-tier venues (e.g., CVPR, NeurIPS, BMVC or equivalent).
Be excited about building lifelike, expressive avatars for real-time applications.

Additionally, having some of the following experiences may help you be successful in this position:

Skills in 3D graphics, Gaussian splatting
Other, additional experience with generative models
PhD or equivalent experience preferred
Experience leading research teams
Knowledge of best practices in Software Development

Please note that this position is preferably hybrid in San Francisco and we offer relocation. However we are open to remote candidates as well.

Benefits & Culture

When you join Tavus, you’re joining a diverse and supportive team. Our work is driven by our people, and our success is shared by all. This position has a flexible work schedule, unlimited PTO, competitive healthcare, and gear stipends, as well as plenty of fun. At the end of the day, we want Tavus to be a place for you to learn, directly drive impact, and work with a team you love.

To learn more about our team culture and benefits, check out our hiring page.

Tavus is growing fast, and we’d like you to grow with us. If you’re excited to get your hands dirty and help make machines more human, drop your resume and we’ll be in touch.

We are not looking for cultural fits, we are looking for culture creators. Diversity is what drives our success – it’s at the core of how we hire, communicate, and work. We are inclusive to all and combine our diverse backgrounds, skill sets, and perspectives to build the best experiences for our clients.