Scale partners with the world's leading AI labs to accelerate progress in frontier generative AI. We are building a dedicated research team focused on pushing the boundaries of speech generation, speech recognition, and speech-to-speech transformation. We are hiring Research Scientists and Research Engineers with deep expertise across text-to-speech (TTS), speech-to-speech (STS), and automatic speech recognition (ASR) to help define the next era of human-AI communication.
In this role, you will invent and deploy new algorithms to expand the capabilities, fidelity, and generalization of large-scale audio models. You will shape research directions at the intersection of modeling, data, and evaluation for speech systems. You will collaborate with world-class researchers and directly contribute technical and strategic insights to the development of the next generation of open and proprietary foundation models in audio. We encourage collaborations within the industry and academia, and support the publication of research findings. Successful candidates will partner with top foundation model labs, providing both technical and strategic input on the development of the next generation of generative AI models.
You will:
We’re looking for:
Get notifed when new similar jobs are uploaded
Get notifed when new similar jobs are uploaded
Get notifed when new similar jobs are uploaded