Research Internship – Reinforcement Learning for Large Foundation Models

Tencent

| Bellevue, WA, USA (On Site) | Full Time | 2 months ago

Apply Now

Job Summary

Tencent AI Lab at Seattle Area is seeking Research Interns for 2026, focusing on Reinforcement Learning for Large Foundation Models. The role involves developing stable and efficient RL algorithms to enhance large foundation models in complex reasoning, agent tasks, autonomous exploration, and continuous learning. Interns will work on core problems in RL algorithms, reward modeling, and world models, conducting large-scale experiments and publishing research papers.

Must Have

Ph.D. in Computer Science, Machine Learning, Artificial Intelligence, or related fields from a top university
Self-motivated and excited about developing novel techniques
Research experiences in natural language processing or machine learning
Proficient in Python programming
Experienced in developing with deep learning frameworks such as PyTorch
Good publication track records and history of creativity and intellectual flexibility
Excellent communication and teamwork skills
Capable of collaborating with cross-functional teams to drive project success and innovation
Intern duration: 3 months (with the possibility of extension)
Can start any time in the year 2026

Perks & Benefits

1 hour of paid sick leave for every 30 hours worked
Up to 13 paid holidays throughout the calendar year
Eligible to enroll in the Company-sponsored medical plan (for full-time interns)

Job Description

Business Unit

What the Role Entails

About Tencent AI Lab at Seattle Area

Tencent is a leading internet company in China. Tencent AI Lab at Seattle Area was established in May 2017. The lab strives to continuously improve AI's capability in perception, cognition, and creativity. Researchers there aim at solving challenging real-world problems with advanced technologies and publish extensively at top conferences and journals.

Research Internship – Reinforcement Learning for Large Foundation Models

Tencent AI Lab is dedicated to advancing cutting-edge AI technologies, with a particular focus on innovative breakthroughs in large foundation models. The lab's long-term ambition is to drive the development of Artificial General Intelligence (AGI), and ultimately, Artificial Superintelligence (ASI). We are currently seeking research interns for the year of 2026, in the area of reinforcement learning (RL) for large foundation models, with an emphasis on developing stable and efficient RL algorithms. The goal is to empower large foundation models in complex reasoning ang agent tasks and enhance their capabilities in autonomous exploration and continuous learning. Our Seattle area office is located in Bellevue WA.

Every research intern will work with researchers on a research project aimed at attacking one of the core problems on the design and optimization of RL algorithms for large foundation models. Research areas include but are not limited to Reinforcement Learning Algorithms, Reward Modeling, and World Models. We will conduct large-scale experiments of RL algorithms in scenarios such as complex reasoning and autonomous agents, deliver impactful algorithms for real world applications, and publish influential research papers.

Who We Look For

Requirements & Qualifications

The ideal intern candidates are those who

Ph.D. in Computer Science, Machine Learning, Artificial Intelligence, or related fields from a top university,
are self-motivated and excited about developing novel techniques,
have research experiences in natural language processing or machine learning,
are proficient in Python programming and experienced in developing with deep learning frameworks such as PyTorch.
have good publication track records and history of creativity and intellectual flexibility,
have excellent communication and teamwork skills, capable of collaborating with cross-functional teams to drive project success and innovation.
Intern duration: 3 months (with the possibility of extension). Can start any time in the year 2026.

Location State(s)

US-Washington-Bellevue

The expected base pay range for this position in the location(s) listed above is $27.00 to $57.70 per hour. Actual pay may vary depending on job-related knowledge, skills, and experience. This position will be eligible for 1 hour of paid sick leave for every 30 hours worked and up to 13 paid holidays throughout the calendar year. Subject to the terms and conditions of the applicable plans then in effect, full-time interns are also eligible to enroll in the Company-sponsored medical plan.

Equal Employment Opportunity at Tencent

As an equal opportunity employer, we firmly believe that diverse voices fuel our innovation and allow us to better serve our users and the community. We foster an environment where every employee of Tencent feels supported and inspired to achieve individual and common goals.

Who we are

Tencent is a world-leading internet and technology company that develops innovative products and services to improve the quality of life for people around the world.

Equal Employment Opportunity at Tencent

10 Skills Required For This Role

Team Management Cross Functional Communication Game Texts Pytorch Deep Learning Reinforcement Learning Python Algorithms Machine Learning

Similar Jobs

Research Development

Team Lead - Annotations

GoMotive • Islamabad, Islamabad Capital Territory, Pakistan (Remote)

Research Internship – Reinforcement Learning for Large Foundation Models

Job Summary

Must Have

Perks & Benefits

Job Description

Business Unit

What the Role Entails

Who We Look For

Equal Employment Opportunity at Tencent

Who we are

Equal Employment Opportunity at Tencent

10 Skills Required For This Role

Similar Jobs

Research Development

Software Development & Engineering