AI Researcher (Voice)

1 Month ago • All levels • $160,000 PA - $250,000 PA
Research Development

Job Description

Tavus is a research lab building AI Humans, a new interface for human-machine interaction. These real-time human simulation models enable meaningful, face-to-face conversations, combining emotional intelligence with machine reliability. The company is Series A backed by Sequoia Capital, Y Combinator, and Scale Venture Partners. This Senior Researcher role involves leading research on generative video and audio models, collaborating with the Applied ML team for production, and staying current with AI advancements. The ideal candidate thrives in a fast-paced startup environment, prioritizes effectively, and takes calculated risks.
Good To Have:
  • Skills in 3D graphics
  • Gaussian splatting
  • Other, additional experience with generative models
  • PhD or equivalent experience
  • Experience leading research teams
  • Knowledge of best practices in Software Development
Must Have:
  • Lead research efforts on generative video and audio models (ex: text-to-speech, speech-to-speech, audio-to-expression and other speech and multimodal AI topics)
  • Work with the Applied ML team to help productionize our research
  • Stay relevant with the latest advancements (and help us create the latest advancements!)
Perks:
  • flexible work schedule
  • unlimited PTO
  • competitive healthcare
  • gear stipends
  • opportunity to learn
  • directly drive impact

Add these skills to join the top 1% applicants for this job

game-texts
prototyping
pytorch
deep-learning

About Us

Tavus is a research lab pioneering human computing. We’re building AI Humans: a new interface that closes the gap between people and machines, free from the friction of today’s systems. Our real-time human simulation models let machines see, hear, respond, and even look real—enabling meaningful, face-to-face conversations. AI Humans combine the emotional intelligence of humans with the reach and reliability of machines, making them capable, trusted agents available 24/7, in every language, on our terms.

Imagine a therapist anyone can afford. A personal trainer that adapts to your schedule. A fleet of medical assistants that can give every patient the attention they need. With Tavus, individuals, enterprises, and developers can all build AI Humans to connect, understand, and act with empathy at scale.

We’re a Series A company backed by world-class investors including Sequoia Capital, Y Combinator, and Scale Venture Partners.

Be part of shaping a future where humans and machines truly understand each other.

The Role

We’re looking for a Senior Researcher to join our core AI team. Our ideal partner-in-crime works well in startup environments, is comfortable prioritizing for themselves, and is always down to take calculated risks. We’re moving fast and not looking for people to come along for the ride - we’re looking for people to pave the path.

Your Mission 🚀

  • Lead research efforts on generative video and audio models (ex: text-to-speech, speech-to-speech, audio-to-expression and other speech and multimodal AI topics)
  • Work with the Applied ML team to help productionize our research
  • Stay relevant with the latest advancements (and help us create the latest advancements!)

Requirements

  • Have proven experience with flow matching, diffusion models, auto regressive networks in the audio domain.
  • Have experience training deep learning models: from medium-sized to large models.
  • Have experience building streaming text-to-speech models or speech-to-speech models
  • Have strong foundations in audio modeling and demonstrated ability to innovate rapidly through prototyping.
  • Know state-of-the-art architectures in representation learning: audio or image domain, face animation (in addition to having a deep understanding of the direct field of expertise above)
  • Have excellent programming skills and be fluent in PyTorch
  • Show evidence of original research, with publications in top-tier or solid second-tier venues (e.g., CVPR, NeurIPS, BMVC or equivalent).
  • Be excited about building lifelike, expressive avatars for real-time applications.

Additionally, having some of the following experiences may help you be successful in this position:

  • Skills in 3D graphics, Gaussian splatting
  • Other, additional experience with generative models
  • PhD or equivalent experience preferred
  • Experience leading research teams
  • Knowledge of best practices in Software Development

Please note that this position is preferably hybrid in San Francisco and we offer relocation. However we are open to remote candidates as well.

Benefits & Culture

When you join Tavus, you’re joining a diverse and supportive team. Our work is driven by our people, and our success is shared by all. This position has a flexible work schedule, unlimited PTO, competitive healthcare, and gear stipends, as well as plenty of fun. At the end of the day, we want Tavus to be a place for you to learn, directly drive impact, and work with a team you love.

To learn more about our team culture and benefits, check out our hiring page.

Tavus is growing fast, and we’d like you to grow with us. If you’re excited to get your hands dirty and help make machines more human, drop your resume and we’ll be in touch.

We are not looking for cultural fits, we are looking for culture creators. Diversity is what drives our success – it’s at the core of how we hire, communicate, and work. We are inclusive to all and combine our diverse backgrounds, skill sets, and perspectives to build the best experiences for our clients.

Set alerts for more jobs like AI Researcher (Voice)
Set alerts for new jobs by Tavus
Set alerts for new Research Development jobs in United States
Set alerts for new jobs in United States
Set alerts for Research Development (Remote) jobs

Contact Us
hello@outscal.com
Made in INDIA 💛💙