Hunyuan Multimodal Algorithm Researcher intern(Omni-Modal)
Tencent
Job Summary
Tencent is seeking a Hunyuan Multimodal Algorithm Researcher intern to conduct research and development on Omni multimodal large models. This includes designing training data, foundational model algorithms, optimization (pre-training/SFT/RL), model capability evaluation, and exploring downstream application scenarios. The role involves scientifically analyzing R&D challenges, identifying performance bottlenecks, and devising solutions to accelerate model development. The intern will also explore diverse paradigms for Omni-modal understanding and generation, researching next-generation model architectures to push the boundaries of multimodal models.
Must Have
- Bachelor’s degree or higher in Computer Science, Artificial Intelligence, Mathematics, or related fields
- Solid foundation in deep learning algorithms and practical experience in large model development
- Proficiency in underlying implementation details of deep learning networks and operators
- Proficiency in model tuning for training/inference, CPU/GPU acceleration, and distributed training/inference optimization
- Strong learning agility, communication skills, teamwork, and curiosity
Good to Have
- Graduate degrees are prioritized
- Hands-on experience in large-scale multimodal data processing and high-quality data generation
- Familiarity with Diffusion Models and Autoregressive Models
- Publication in top-tier conferences or experience in cross-modal (e.g., audio-visual) research
- Practical experience in underlying implementation details of deep learning networks and operators, model tuning for training/inference, CPU/GPU acceleration, and distributed training/inference optimization
- Participation in ACM or NOI competitions
Perks & Benefits
- 1 hour of paid sick leave for every 30 hours worked
- Up to 13 paid holidays throughout the calendar year
- Company-sponsored medical plan (for full-time interns)
Job Description
Business Unit
What the Role Entails
1. Conduct research and development of Omni multimodal large models, including the design and construction of training data, foundational model algorithm design, optimization related to pre-training/SFT/RL, model capability evaluation, and exploration of downstream application scenarios.
2. Scientifically analyze challenges in R&D, identify bottlenecks in model performance, and devise solutions based on first principles to accelerate model development and iteration, ensuring competitiveness and leading-edge performance.
3. Explore diverse paradigms for achieving Omni-modal understanding and generation capabilities, research next-generation model architectures, and push the boundaries of multimodal models.
Who We Look For
1. Bachelor’s degree (full-time preferred) or higher in Computer Science, Artificial Intelligence, Mathematics, or related fields; graduate degrees are prioritized.
2. Hands-on experience in large-scale multimodal data processing and high-quality data generation is highly preferred.
3. Solid foundation in deep learning algorithms and practical experience in large model development; familiarity with Diffusion Models and Autoregressive Models is advantageous. Publication in top-tier conferences or experience in cross-modal (e.g., audio-visual) research is preferred.
4. Proficiency in underlying implementation details of deep learning networks and operators, model tuning for training/inference, CPU/GPU acceleration, and distributed training/inference optimization; practical experience is a plus.
5. Participation in ACM or NOI competitions is highly valued.
6. Strong learning agility, communication skills, teamwork, and curiosity.
Location State(s)
US-California-Palo Alto
The expected base pay range for this position in the location(s) listed above is $80,169.00 to $120,000.14 per year. Actual pay may vary depending on job-related knowledge, skills, and experience. This position will be eligible for 1 hour of paid sick leave for every 30 hours worked and up to 13 paid holidays throughout the calendar year. Subject to the terms and conditions of the applicable plans then in effect, full-time interns are also eligible to enroll in the Company-sponsored medical plan.
Equal Employment Opportunity at Tencent
As an equal opportunity employer, we firmly believe that diverse voices fuel our innovation and allow us to better serve our users and the community. We foster an environment where every employee of Tencent feels supported and inspired to achieve individual and common goals.
Who we are
Tencent is a world-leading internet and technology company that develops innovative products and services to improve the quality of life for people around the world.
Equal Employment Opportunity at Tencent
As an equal opportunity employer, we firmly believe that diverse voices fuel our innovation and allow us to better serve our users and the community. We foster an environment where every employee of Tencent feels supported and inspired to achieve individual and common goals.