Research Internship – Reinforcement Learning for Large Foundation Models

23 Minutes ago • All levels • $56,160 PA - $120,016 PA
Research Development

Job Description

Tencent AI Lab at Seattle Area is seeking Research Interns for 2026, focusing on Reinforcement Learning for Large Foundation Models. The role involves developing stable and efficient RL algorithms to enhance large foundation models in complex reasoning, agent tasks, autonomous exploration, and continuous learning. Interns will work on core problems in RL algorithms, reward modeling, and world models, conducting large-scale experiments and publishing research papers.
Must Have:
  • Ph.D. in Computer Science, Machine Learning, Artificial Intelligence, or related fields from a top university
  • Self-motivated and excited about developing novel techniques
  • Research experiences in natural language processing or machine learning
  • Proficient in Python programming
  • Experienced in developing with deep learning frameworks such as PyTorch
  • Good publication track records and history of creativity and intellectual flexibility
  • Excellent communication and teamwork skills
  • Capable of collaborating with cross-functional teams to drive project success and innovation
  • Intern duration: 3 months (with the possibility of extension)
  • Can start any time in the year 2026
Perks:
  • 1 hour of paid sick leave for every 30 hours worked
  • Up to 13 paid holidays throughout the calendar year
  • Eligible to enroll in the Company-sponsored medical plan (for full-time interns)

Add these skills to join the top 1% applicants for this job

team-management
cross-functional
communication
game-texts
pytorch
deep-learning
reinforcement-learning
python
algorithms
machine-learning

Business Unit

What the Role Entails

About Tencent AI Lab at Seattle Area

Tencent is a leading internet company in China. Tencent AI Lab at Seattle Area was established in May 2017. The lab strives to continuously improve AI's capability in perception, cognition, and creativity. Researchers there aim at solving challenging real-world problems with advanced technologies and publish extensively at top conferences and journals.

Research Internship – Reinforcement Learning for Large Foundation Models

Tencent AI Lab is dedicated to advancing cutting-edge AI technologies, with a particular focus on innovative breakthroughs in large foundation models. The lab's long-term ambition is to drive the development of Artificial General Intelligence (AGI), and ultimately, Artificial Superintelligence (ASI). We are currently seeking research interns for the year of 2026, in the area of reinforcement learning (RL) for large foundation models, with an emphasis on developing stable and efficient RL algorithms. The goal is to empower large foundation models in complex reasoning ang agent tasks and enhance their capabilities in autonomous exploration and continuous learning. Our Seattle area office is located in Bellevue WA.

Every research intern will work with researchers on a research project aimed at attacking one of the core problems on the design and optimization of RL algorithms for large foundation models. Research areas include but are not limited to Reinforcement Learning Algorithms, Reward Modeling, and World Models. We will conduct large-scale experiments of RL algorithms in scenarios such as complex reasoning and autonomous agents, deliver impactful algorithms for real world applications, and publish influential research papers.

Who We Look For

Requirements & Qualifications

The ideal intern candidates are those who

  • Ph.D. in Computer Science, Machine Learning, Artificial Intelligence, or related fields from a top university,
  • are self-motivated and excited about developing novel techniques,
  • have research experiences in natural language processing or machine learning,
  • are proficient in Python programming and experienced in developing with deep learning frameworks such as PyTorch.
  • have good publication track records and history of creativity and intellectual flexibility,
  • have excellent communication and teamwork skills, capable of collaborating with cross-functional teams to drive project success and innovation.
  • Intern duration: 3 months (with the possibility of extension). Can start any time in the year 2026.

Location State(s)

US-Washington-Bellevue

The expected base pay range for this position in the location(s) listed above is $27.00 to $57.70 per hour. Actual pay may vary depending on job-related knowledge, skills, and experience. This position will be eligible for 1 hour of paid sick leave for every 30 hours worked and up to 13 paid holidays throughout the calendar year. Subject to the terms and conditions of the applicable plans then in effect, full-time interns are also eligible to enroll in the Company-sponsored medical plan.

Equal Employment Opportunity at Tencent

As an equal opportunity employer, we firmly believe that diverse voices fuel our innovation and allow us to better serve our users and the community. We foster an environment where every employee of Tencent feels supported and inspired to achieve individual and common goals.

Who we are

Tencent is a world-leading internet and technology company that develops innovative products and services to improve the quality of life for people around the world.

Read More

Equal Employment Opportunity at Tencent

As an equal opportunity employer, we firmly believe that diverse voices fuel our innovation and allow us to better serve our users and the community. We foster an environment where every employee of Tencent feels supported and inspired to achieve individual and common goals.

Read More

Set alerts for more jobs like Research Internship – Reinforcement Learning for Large Foundation Models
Set alerts for new jobs by Tencent
Set alerts for new Research Development jobs in United States
Set alerts for new jobs in United States
Set alerts for Research Development (Remote) jobs

Contact Us
hello@outscal.com
Made in INDIA 💛💙