Research Intern - Speech and Audio Processing

53 Minutes ago • 1 Years + • Artificial Intelligence

About the job

Job Description

Microsoft's Azure AI team seeks Research Interns to contribute to cutting-edge research in speech and audio processing. Interns will collaborate with researchers and scientists on projects involving end-to-end speech recognition and translation, speech and audio generation, and multimodal work with LLMs. The internship involves prototyping, demonstrating, and potentially publishing findings within Microsoft's Azure AI and Azure Open AI services. Responsibilities include conducting both fundamental and applied research, collaborating with teams, and presenting findings. The internship offers a dynamic environment within a world-class research lab.
Must have:
  • PhD candidate in related field
  • 1+ years research experience
  • Experience in speech recognition/generation
  • Proficiency in deep learning/machine learning
  • Publication in top-tier conferences/journals
Good to have:
  • Experience with PyTorch/TensorFlow
  • Strong communication and writing skills
Perks:
  • Industry leading healthcare
  • Educational resources
  • Product and service discounts
  • Savings and investment programs
  • Maternity/paternity leave
  • Generous time off
  • Giving programs
  • Networking opportunities

Overview

Research Internships at Microsoft provide a dynamic environment for research careers with a network of world-class research labs led by globally-recognized scientists and engineers, who pursue innovation in a range of scientific and technical disciplines to help solve complex challenges in diverse fields, including computing, healthcare, economics, and the environment.

The Azure AI team is on a mission to advance the state of the art in AI and deliver on our company’s vision for how intelligent cloud and intelligent edge will shape the next phase of innovation. The team includes top scientists and researchers from across Microsoft who are creating a center of excellence in speech, computer vision, and natural language.

 

Speech and audio AI technology is one of the key drivers for advancing natural user interfaces with natural spoken language. The Speech and Audio Group is on a mission to develop the core speech technologies that empower millions of users to achieve more. Our group brings together talent in the areas of signal processing, speech recognition, speech translation, and speech and audio generation to develop and deliver robust, natural, and scalable speech experience across a rich set of scenarios and languages.

 

We are seeking interns to contribute to pioneering research in speech and audio. This includes areas such as end-to-end speech recognition and translation, speech and audio generation, and multimodal with LLM. As a Research Intern, you will have the opportunity to conduct both fundamental and applied research in collaboration with our researchers and scientists.

 

This position offers the opportunity to prototype, demonstrate, and publish your findings within Microsoft’s privileged environment for Azure AI and Azure Open AI services. Most importantly, you will have the chance to push the boundaries of current technologies, making a significant impact on millions of users.

Qualifications

Required Qualifications

  • Current PhD candidate in speech recognition, speech translation, separation, and enhancement, speaker recognition and diarization, speech and audio generation, audio processing, deep learning, machine learning, AI or a related field.
  • At least 1 year of research experience in speech recognition, speech translation, speech and audio generation, speech LLM, deep learning, machine learning, AI or a related field.

Other Requirements

  • Research Interns are expected to be physically located in their manager’s Microsoft worksite location for the duration of their internship.
  • In addition to the qualifications below, you’ll need to submit a minimum of two reference letters for this position as well as a cover letter and any relevant work or research samples. After you submit your application, a request for letters may be sent to your list of references on your behalf. Note that reference letters cannot be requested until after you have submitted your application, and furthermore, that they might not be automatically requested for all candidates. You may wish to alert your letter writers in advance, so they will be ready to submit your letter. 

Preferred Qualifications

  • Experience with open-source tools such as PyTorch, Tensorflow, etc.
  • Publication(s) in top-tier conferences or journals in related fields (e.g., ICASSP, Interspeech, ASRU, SLT, IEEE/ACM Transactions on Audio, Speech and Language Processing, Speech Communication, Computer Speech and Language, etc.).
  • Proficient communication and writing skills.

The base pay range for this internship is USD $6,550 - $12,880 per month. There is a different range applicable to specific work locations, within the San Francisco Bay area and New York City metropolitan area, and the base pay range for this role in those locations is USD $8,480 - $13,920 per month.

 

Certain roles may be eligible for benefits and other compensation. Find additional benefits and pay information here: 

Microsoft accepts applications and processes offers for these roles on an ongoing basis.

Responsibilities

Research Interns put inquiry and theory into practice. Alongside fellow doctoral candidates and some of the world’s best researchers, Research Interns learn, collaborate, and network for life. Research Interns not only advance their own careers, but they also contribute to exciting research and development strides. During the 12-week internship, Research Interns are paired with mentors and expected to collaborate with other Research Interns and researchers, present findings, and contribute to the vibrant life of the community. Research internships are available in all areas of research, and are offered year-round, though they typically begin in the summer.

Benefits/perks listed below may vary depending on the nature of your employment with Microsoft and the country where you work.
Industry leading healthcare
Educational resources
Discounts on products and services
Savings and investments
Maternity and paternity leave
Generous time away
Giving programs
Opportunities to network and connect
View Full Job Description
$78.6K - $154.6K/yr (Outscal est.)
$116.6K/yr avg.
Redmond, Washington, United States

Add your resume

80%

Upload your resume, increase your shortlisting chances by 80%

About The Company

Microsoft is a tech giant that develops, licenses, and supports a range of software products, services, and devices.

Redmond, Washington, United States (On-Site)

Redmond, Washington, United States (On-Site)

Barcelona, Catalonia, Spain (Hybrid)

Madrid, Community Of Madrid, Spain (Hybrid)

Redmond, Washington, United States (Hybrid)

View All Jobs

Get notified when new jobs are added by Microsoft

Similar Jobs

PlayStation Global - Senior Machine Learning Engineer, Anomaly Detection

PlayStation Global, United Kingdom (Hybrid)

Netflix - Machine Learning Intern - Spring or Summer 2025

Netflix, United States (On-Site)

Avantor - Data Scientist

Avantor, India (On-Site)

Salesforce - Principal Data Scientist

Salesforce, United States (On-Site)

Microsoft - Software Engineer

Microsoft, India (On-Site)

Rec Room - Machine Learning Engineer

Rec Room, United States (Remote)

Keywords Studios (Player Support) - Technical Research Associate - AI

Keywords Studios (Player Support), Poland (Hybrid)

Nextbrain - Computer Vision Engineer

Nextbrain, India (On-Site)

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Truecaller - Senior MLOps Engineer

Truecaller, Sweden (On-Site)

Unity - Principal Machine Learning Engineer

Unity, United States (On-Site)

Microsoft - Principal Software Engineer - AI Platform

Microsoft, United States (On-Site)

Arrise Solutions (India)   - Data Scientist - Recommender S/m's

Arrise Solutions (India) , India (On-Site)

NPS Prism - Data Science Entry Level Role

NPS Prism, India (On-Site)

Epic Games - Principal Research Scientist

Epic Games, United States (On-Site)

The Walt Disney Company - Principal Machine Learning Engineer, Research - Ad Platforms

The Walt Disney Company, United States (On-Site)

Get notifed when new similar jobs are uploaded

Jobs in Redmond, Washington, United States

Bit Reactor, LLC - SENIOR GRAPHICS ENGINEER

Bit Reactor, LLC, United States (On-Site)

Glean - Channel Manager, AMER - East+Canada

Glean, United States (On-Site)

Meta - 3D Artist, Horizon

Meta, United States (On-Site)

Microsoft - Senior Engineer Circuit Designer

Microsoft, United States (On-Site)

Fluence - Sr. Manager People Platforms

Fluence, United States (Hybrid)

Warner Bros Discovery - Director, Global Long Form Planning

Warner Bros Discovery, United States (On-Site)

PENN Interactive - Motion Designer

PENN Interactive, United States (On-Site)

Get notifed when new similar jobs are uploaded

Artificial Intelligence Jobs

Talentica Software - Data Scientist

Talentica Software, India (Remote)

Microsoft - Member of Technical Staff, AI - Multimodal

Microsoft, United States (On-Site)

Thatgamecompany - Machine Learning Engineer

Thatgamecompany, (Remote)

Sphere Entertainment Co - Director Production Technology Innovation

Sphere Entertainment Co, United States (On-Site)

Wargaming - AI Vendor Manager

Wargaming, Cyprus (On-Site)

Get notifed when new similar jobs are uploaded