Research Engineer, Multimodal Audio

3 Weeks ago • All levels • Audio Engineering • $225,000 PA - $400,000 PA

Job Summary

Job Description

As a Research Engineer on the Multimodal team, you will be a self-motivated, resourceful contributor with full-stack machine learning expertise. You will own the end-to-end development of new multimodal audio capabilities, covering data collection, training state-of-the-art models, building evaluations, optimizing inference algorithms, and refining systems based on user feedback. Responsibilities include determining training data needs, collecting data, writing distributed data gathering pipelines, developing new model architectures, creating new evaluations, writing fast inference algorithms, integrating feedback mechanisms, and working with large-scale multimodal datasets.
Must have:
  • PhD or equivalent research experience
  • Experience with large-scale audio data processing
  • Experience with transformer model training
  • Strong engineering skills in PyTorch
  • Track record of releases, publications, or open-source projects related to transformer model training
  • Deep understanding of the ML whole stack
  • Proven ability to own ML projects from start to finish
Perks:
  • Offers Equity

Job Details

About the role and team

As a Research Engineer on our Multimodal team, you’ll be a self-motivated, resourceful contributor with full-stack machine learning expertise—from data collection and training state-of-the-art models to building evaluations, optimizing inference algorithms, and refining systems based on user feedback.

In this role, you'll take end-to-end ownership of developing new multimodal Audio capabilities, requiring you to think holistically about every stage of the ML pipeline. You'll need to be comfortable working across the full stack, tackling challenges at any level—from designing data pipelines to fine-tuning models and deploying real-time inference. If you thrive in an environment where you can wear multiple hats and solve complex problems with a hands-on approach, we’d love to hear from you.

What you'll do

  • Determine the type of training data we need, finding where we can collect it, and writing distributed data gathering pipelines to ingest data.

  • Develop new model architectures that push the state-of-the-art in terms of quality, scale, and inference speed.

  • Create new evaluations that capture different aspects of generative outputs

  • Write fast inference algorithms to serve these models at scale.

  • Work with product teams to integrate feedback mechanisms into the product, which we use to improve the model.

  • Work with large scale multimodal datasets.

Who you are

  • "All Industry Levels": at least PhD (or equivalent) research experience.

  • Experiences in large scale audio data processing, and transformer model training.

  • Strong engineering skills in deep learning frameworks of PyTorch.

  • Track record of releases, publications, and/or open source projects related to transformer model training.

  • Have a deep understanding of the “whole stack” and track record of successfully owning projects from start to finish when it comes to designing, training, evaluating and deploying machine learning models, especially large language models.

About Character.AI

Character.AI empowers people to connect, learn and tell stories through interactive entertainment. Over 20 million people visit Character.AI every month, using our technology to supercharge their creativity and imagination. Our platform lets users engage with tens of millions of characters, enjoy unlimited conversations, and embark on infinite adventures.


In just two years, we achieved unicorn status and were honored as Google Play's AI App of the Year—a testament to our innovative technology and visionary approach.


Join us and be a part of establishing this new entertainment paradigm while shaping the future of Consumer AI!

At Character, we value diversity and welcome applicants from all backgrounds. As an equal opportunity employer, we firmly uphold a non-discrimination policy based on race, religion, national origin, gender, sexual orientation, age, veteran status, or disability. Your unique perspectives are vital to our success.

Similar Jobs

Looks like we're out of matches

Set up an alert and we'll send you similar jobs the moment they appear!

Similar Skill Jobs

Looks like we're out of matches

Set up an alert and we'll send you similar jobs the moment they appear!

Jobs in Redwood City, California, United States

Looks like we're out of matches

Set up an alert and we'll send you similar jobs the moment they appear!

Audio Engineering Jobs

Looks like we're out of matches

Set up an alert and we'll send you similar jobs the moment they appear!

About The Company

Character is one of the world's leading personal AI platforms. Founded in 2021 by AI pioneers Noam Shazeer and Daniel De Freitas, Character is a full-stack AI company with a globally scaled direct-to-consumer platform. 

Redwood City, California, United States (Hybrid)

Redwood City, California, United States (On-Site)

Redwood City, California, United States (On-Site)

New York, New York, United States (On-Site)

San Francisco, California, United States (On-Site)

Palo Alto, California, United States (On-Site)

San Francisco, California, United States (On-Site)

San Francisco, California, United States (On-Site)

San Francisco, California, United States (On-Site)

Menlo Park, California, United States (Remote)

View All Jobs

Get notified when new jobs are added by Character.AI

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug