Senior Researcher: Artificial General Intelligence (Audio, Speech and Multimodal Processing)

3 Months ago • 4-8 Years • Artificial Intelligence • $129,600 PA - $219,600 PA

Job Summary

Job Description

Tencent seeks Senior Researchers in Artificial General Intelligence (AGI), focusing on audio, speech, and multimodal processing. Responsibilities include developing novel model architectures and algorithms for tasks like speech enhancement, recognition, synthesis, and music processing within unified multimodal foundation models. The role involves identifying research areas, setting long-term goals, designing experiments, writing code, analyzing results, and collaborating with engineers. Successful candidates will prioritize research applicable to Tencent's products, deploy promising ideas, and publish research findings. Strong communication and collaboration skills are essential, along with expertise in deep learning frameworks like PyTorch and experience with large-scale datasets.
Must have:
  • PhD in relevant field
  • Publications in top AI/speech conferences
  • Expertise in speech/audio processing
  • Deep learning model building/optimization
  • Strong communication skills
Good to have:
  • Familiarity with transformer models
  • Self-supervised learning experience
  • Experience with large datasets and big models
  • Multimodal foundation model experience
  • Model optimization for production
Perks:
  • Medical, dental, vision insurance
  • 401(k) plan
  • Paid vacation and holidays
  • Paid sick leave
  • Relocation package (potential)

Job Details

Business Unit

Technology Engineering Group (TEG) is responsible for supporting the company and its business groups on technology and operational platforms, as well as the construction and operation of R&D management and data centers, TEG provides users with a full range of customer services. As the operator of the largest networking, devices, and data center in Asia,TEG also leads the Tencent Technology Committee in strengthening infrastructure R&D through internal and distributed open source collaboration, constructing new platforms and supporting business innovation.

What the Role Entails

About the job
Tencent is seeking researchers in artificial general intelligence (AGI) with a focus in audio, speech and multimodal processing at the senior and principal levels to join our AI Lab in Seattle, Beijing, and Shenzhen. We are looking for recognized experts and thought leaders specializing in speech, audio and multimodal processing to tackle a variety of tasks, including (but not limited to) speech enhancement, speech recognition, audio/speech synthesis, speech codec, music processing, and spatial audio in unified multi-modal foundation models. The ideal candidates are those who are self-motivated and passionate about advancing the state of the art of AGI by developing novel model architectures and algorithms and solving real-world problems. The job level will be determined based on the experience and accomplishments of the candidate.

  • Work with other researchers to identify new and upcoming research areas, long-term ambitious research goals, and intermediate milestones by interacting with potential external and internal collaborators. Own long-term research strategy and plans to expand the impact of Tencent AI Lab.
  • Identify undefined problems in existing technology and develop theoretically sound novel models and algorithms to address them.
  • Design experiments, write reusable code, run evaluations, and analyze results.
  • Collaborate with other researchers and engineers across functional groups to push forward the state-of-the art of AGI.
  • Prioritize research that can be applied to Tencent's products. Deploy promising ideas quickly and broadly.
  • Author research papers to share and generate the impact of research results across organizations and in the research community.
  • Share research trends and best practices in the community by reviewing academic papers, serving on program committees and grant panels, speaking at Tencent events or research conferences, or organizing research conferences and visioning activities.

Who We Look For

  • Currently has or is in the process of obtaining a PhD degree in AI, computer science, electrical engineering, math, physics, or related technical fields.
  • Proven record of influential publications in AI or speech, music and audio-specific conferences/journals (e.g., NeuIPS, ICML, IEEE Trans. ASLP, ICASSP, Interspeech, ISMIR, AES.)
  • Expertise in speech, music and audio processing from both a signal processing standpoint and machine learning standpoint and ability to integrate traditional signal processing techniques with deep learning models to advance current speech, music and audio systems.
  • Proficient in building and optimizing models for speech recognition, synthesis, enhancement, or other audio-related tasks.
  • Hands-on experience with deep learning frameworks such as PyTorch. Has proven ability to design, train, and deploy deep learning models for speech, music and audio processing tasks with ability to write efficient, reusable code for processing large volumes of high-dimensional audio data.
  • Strong communication skills for articulating research ideas, results, and the impact of innovations both within the organization and in the broader research community.
  • Work authorization in the country of employment at the time of hire and maintains ongoing work authorization during employment.

Qualifications (Preferred):

  • Familiarity with state-of-the-art (SOTA) approaches in speech, music and audio processing, such as transformer-based models, self-supervised learning (SSL) for speech, or end-to-end speech recognition and text-to-speech systems.
  • Understanding of related fields such as acoustics, auditory perception, computer vision, natural language processing, or neuroscience as they apply to speech, music and audio processing. Ability to incorporate insights from these fields into the development of novel speech, music and audio technologies.
  • Experience working with large-scale speech, music, audio and video datasets and developing big models that scale across multiple GPUs or cloud-based systems.
  • Experience in multi-modal foundation models.
  • Experience in model optimization for deployment in production environments.
  • Experience in setting up and managing recordings using different types of microphone equipped devices, understanding their characteristics, and how they affect the captured audio quality.

We are interested in both new graduates and those with post-PhD academic or industry experience. Priority will be given to candidates who have demonstrated the ability to develop original research agendas and perform hands-on research, and who work well in a collaborative and dynamic environment.

About Tencent

Tencent is a world-leading internet and technology company that develops innovative products and services to improve the quality of life of people around the world.

Founded in 1998 with its headquarters in Shenzhen, China, Tencent's guiding principle is to use technology for good. Our communication and social services connect more than one billion people around the world, helping them to keep in touch with friends and family, access transportation, pay for daily necessities, and even be entertained.

Tencent publishes some of the world's most popular video games and other high-quality digital content, enriching interactive entertainment experiences for people around the globe.

Tencent also offers a range of services such as cloud computing, advertising, FinTech, and other enterprise services to support our clients' digital transformation and business growth.

Location State(s)

Washington

The base pay range for this position in the state(s) above is $129,600.0 to $219,600.0 per year. Actual pay is based on market location and may vary depending on job-related knowledge, skills, and experience. A sign on payment, relocation package, and restricted stock units may be provided as part of the compensation package, as well as other medical, financial, and/or other benefits, dependent on the specific position offered.

Employees (and their families) are covered by medical, dental, vision, and basic life insurance. Employees are also able eligible to participate in the Company’s 401(k) plan, accrue from 15 up to 25 days of vacation leave per year, up to 10 paid holidays per year, 2 floating holidays and accrue up to 10 days of paid sick leave per year. Your benefits eligibility requirement will be adjusted to reflect your location, employment status, duration of employment with the company, and position level. Benefits may be pro-rated for those who start working during the calendar year.

Similar Jobs

ZeptoLab - Lead Unity Developer, remote

ZeptoLab

İstanbul, İstanbul, Türkiye (Remote)
3 Months ago
Moon Active - Unity Developer

Moon Active

Tel Aviv-Yafo, Tel Aviv District, Israel (Hybrid)
6 Months ago
Animoca Brands - Game Developer

Animoca Brands

Philippines (Remote)
5 Months ago
Meta - Data Engineer, Analytics

Meta

Seattle, Washington, United States (On-Site)
3 Months ago
Inworld AI - Senior Software Development Engineer in Test (SDET) – Game Engine SDKs - Canada

Inworld AI

Vancouver, British Columbia, Canada (On-Site)
4 Months ago
Krafton  - [Global Strategy & BD Div.] Strategy Manager(AI Ethics) (4년 ~ 7년)

Krafton

Seoul, South Korea (On-Site)
2 Months ago
Tata Consultancy Servicess - Generative AI Engineer

Tata Consultancy Servicess

Pune, Maharashtra, India (On-Site)
4 Months ago
Barbaricum - Senior Technical Project Manager

Barbaricum

Springfield, Virginia, United States (On-Site)
4 Months ago
Meta - Research Scientist Intern, Smart Glasses in Wearables AI (PhD)

Meta

Washington, District Of Columbia, United States (On-Site)
3 Months ago
Zoox - Senior Software Engineer - High Performance Computing

Zoox

Foster City, California, United States (Hybrid)
4 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Visual Concepts - Senior Motion Graphics Artist

Visual Concepts

Budapest, Hungary (On-Site)
5 Months ago
Qualcomm - 3D Generalist Game Artist - Staff

Qualcomm

Hyderabad, Telangana, India (On-Site)
4 Months ago
Unity - Technical Program Manager

Unity

Copenhagen, Denmark (On-Site)
2 Months ago
Intel Corporation - Analog Product Development Engineering Manager

Intel Corporation

Penang, Malaysia (Hybrid)
2 Months ago
Meta - Software Engineer, Infrastructure

Meta

Atlanta, Georgia, United States (Remote)
3 Months ago
Playrix - Senior Playable Ads Developer (Gardenscapes)

Playrix

Ukraine (Remote)
4 Months ago
Meta - Silicon Architect

Meta

Sunnyvale, California, United States (On-Site)
3 Months ago
Wizcorp - Software Engineer

Wizcorp

Tokyo, Japan (Remote)
4 Months ago
Intel Corporation - Quality and Reliability Engineer

Intel Corporation

Dalian, Liaoning, China (On-Site)
3 Months ago
Playrix - Level Design Team Lead

Playrix

Almaty, Almaty Region, Kazakhstan (Remote)
4 Months ago

Get notifed when new similar jobs are uploaded

Jobs in Bellevue, Washington, United States

ION - Application Support Engineer (Trading Systems)  - 5882

ION

New York, New York, United States (On-Site)
4 Months ago
Postman - Senior Cloud Database Engineer

Postman

San Francisco, California, United States (Remote)
4 Months ago
Nagarro - Senior Analyst, Support Sales

Nagarro

New York, New York, United States (On-Site)
4 Months ago
Insight Global - Environment Artist

Insight Global

United States (Remote)
6 Months ago
Cypress HCM - VFX Artist 1

Cypress HCM

Redmond, Washington, United States (On-Site)
7 Months ago
The Walt Disney Company - Senior Manager, Product Management - Ad Decisioning

The Walt Disney Company

Glendale, California, United States (On-Site)
3 Months ago
Trek - Assembler

Trek

Columbus, Ohio, United States (On-Site)
4 Months ago
Matic Robots - Research Engineer

Matic Robots

Mountain View, California, United States (On-Site)
4 Months ago
Paypal - Distinguished Engineer

Paypal

San Jose, California, United States (Hybrid)
4 Months ago
WebMD - Wellness Program Coordinator (Toledo,OH)

WebMD

United States (On-Site)
3 Months ago

Get notifed when new similar jobs are uploaded

Artificial Intelligence Jobs

Tencent - Principal Researcher: Artificial General Intelligence (Audio, Speech and Multimodal Processing)

Tencent

Bellevue, Washington, United States (On-Site)
5 Months ago
Wargaming - Director of AI Engineering

Wargaming

Warsaw, Masovian Voivodeship, Poland (On-Site)
3 Months ago
Google - Engineering Manager, Gemini Code Assist

Google

(On-Site)
2 Months ago
Zoox - Senior Machine Learning Engineer - Collision Avoidance System

Zoox

Foster City, California, United States (Hybrid)
4 Months ago
ByteDance - Research Scientist in Foundation Model, Music Core Machine Learning Graduates - 2024 Start (PhD)

ByteDance

San Jose, California, United States (On-Site)
3 Months ago
The Walt Disney Company - Manager, Technology Communications & Education

The Walt Disney Company

Burbank, California, United States (On-Site)
2 Months ago
One97 Communications  - Data Science - Lead Data Scientist

One97 Communications

Noida, Uttar Pradesh, India (On-Site)
5 Months ago
ByteDance - Research Scientist Graduate (Foundation Model - Generative AI) - 2025 Start (PhD)

ByteDance

Seattle, Washington, United States (On-Site)
2 Months ago
Salesforce - Salesforce AI Research Intern - Summer 2025

Salesforce

Palo Alto, California, United States (On-Site)
4 Months ago
AI Fund - Head of AI @ Olakai

AI Fund

California, United States (Remote)
4 Months ago

Get notifed when new similar jobs are uploaded

About The Company

Tencent is a world-leading internet and technology company that develops innovative products and services to improve the quality of life of people around the world.


Founded in 1998 with its headquarters in Shenzhen, China, Tencent's guiding principle is to use technology for good. Our communication and social services connect more than one billion people around the world, helping them to keep in touch with friends and family, access transportation, pay for daily necessities, and even be entertained.


Tencent also publishes some of the world's most popular video games and other high-quality digital content, enriching interactive entertainment experiences for people around the globe.


Tencent also offers a range of services such as cloud computing, advertising, FinTech, and other enterprise services to support our clients' digital transformation and business growth.


Tencent has been listed on the Stock Exchange of Hong Kong since 2004.

Tokyo, Japan (On-Site)

Shenzhen, Guangdong Province, China (On-Site)

Amsterdam, North Holland, Netherlands (On-Site)

Amsterdam, North Holland, Netherlands (On-Site)

Tokyo, Japan (On-Site)

View All Jobs

Get notified when new jobs are added by Tencent

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug