Hunyuan Multimodal Algorithm Researcher (Omni-Modal)

13 Minutes ago • All levels • $149,000 PA - $279,800 PA
Research Development

Job Description

Tencent is seeking a Hunyuan Multimodal Algorithm Researcher to conduct R&D on Omni multimodal large models. This role involves designing training data, foundational model algorithms, and optimizing pre-training/SFT/RL. The researcher will evaluate model capabilities, explore application scenarios, and identify bottlenecks to accelerate model development. Key responsibilities include researching next-generation model architectures and pushing the boundaries of multimodal models, requiring a strong background in deep learning and practical experience in large model development.
Good To Have:
  • Hands-on experience in large-scale multimodal data processing and high-quality data generation
  • Familiarity with Diffusion Models
  • Familiarity with Autoregressive Models
  • Publication in top-tier conferences
  • Experience in cross-modal (e.g., audio-visual) research
  • Participation in ACM or NOI competitions
Must Have:
  • Conduct research and development of Omni multimodal large models
  • Design and construct training data
  • Design foundational model algorithms
  • Optimize pre-training/SFT/RL
  • Evaluate model capabilities
  • Explore downstream application scenarios
  • Analyze R&D challenges scientifically
  • Identify bottlenecks in model performance
  • Devise solutions based on first principles
  • Accelerate model development and iteration
  • Explore diverse paradigms for Omni-modal understanding and generation capabilities
  • Research next-generation model architectures
  • Bachelor’s degree or higher in Computer Science, Artificial Intelligence, Mathematics, or related fields
Perks:
  • Sign-on payment
  • Relocation package
  • Restricted stock units
  • Medical benefits
  • Dental benefits
  • Vision benefits
  • Life and disability benefits
  • 401(k) plan
  • 15 to 25 days of vacation per year
  • 13 days of holidays
  • 10 days of paid sick leave per year

Add these skills to join the top 1% applicants for this job

team-management
communication
game-texts
deep-learning
algorithms

Business Unit

What the Role Entails

1. Conduct research and development of Omni multimodal large models, including the design and construction of training data, foundational model algorithm design, optimization related to pre-training/SFT/RL, model capability evaluation, and exploration of downstream application scenarios.

2. Scientifically analyze challenges in R&D, identify bottlenecks in model performance, and devise solutions based on first principles to accelerate model development and iteration, ensuring competitiveness and leading-edge performance.

3. Explore diverse paradigms for achieving Omni-modal understanding and generation capabilities, research next-generation model architectures, and push the boundaries of multimodal models.

Who We Look For

1. Bachelor’s degree (full-time preferred) or higher in Computer Science, Artificial Intelligence, Mathematics, or related fields; graduate degrees are prioritized.

2. Hands-on experience in large-scale multimodal data processing and high-quality data generation is highly preferred.

3. Solid foundation in deep learning algorithms and practical experience in large model development; familiarity with Diffusion Models and Autoregressive Models is advantageous. Publication in top-tier conferences or experience in cross-modal (e.g., audio-visual) research is preferred.

4. Proficiency in underlying implementation details of deep learning networks and operators, model tuning for training/inference, CPU/GPU acceleration, and distributed training/inference optimization; practical experience is a plus.

5. Participation in ACM or NOI competitions is highly valued.

6. Strong learning agility, communication skills, teamwork, and curiosity.

Location State(s)

US-California-Palo Alto

The expected base pay range for this position in the location(s) listed above is $149,000.00 to $279,800.00 per year. Actual pay may vary depending on job-related knowledge, skills, and experience. Employees hired for this position may be eligible for a sign on payment, relocation package, and restricted stock units, which will be evaluated on a case-by-case basis. Subject to the terms and conditions of the plans in effect, hired applicants are also eligible for medical, dental, vision, life and disability benefits, and participation in the Company’s 401(k) plan. The Employee is also eligible for up to 15 to 25 days of vacation per year (depending on the employee’s tenure), up to 13 days of holidays throughout the calendar year, and up to 10 days of paid sick leave per year. Your benefits may be adjusted to reflect your location, employment status, duration of employment with the company, and position level. Benefits may also be pro-rated for those who start working during the calendar year.

Equal Employment Opportunity at Tencent

As an equal opportunity employer, we firmly believe that diverse voices fuel our innovation and allow us to better serve our users and the community. We foster an environment where every employee of Tencent feels supported and inspired to achieve individual and common goals.

Who we are

Tencent is a world-leading internet and technology company that develops innovative products and services to improve the quality of life for people around the world.

Read More

Equal Employment Opportunity at Tencent

As an equal opportunity employer, we firmly believe that diverse voices fuel our innovation and allow us to better serve our users and the community. We foster an environment where every employee of Tencent feels supported and inspired to achieve individual and common goals.

Read More

Set alerts for more jobs like Hunyuan Multimodal Algorithm Researcher (Omni-Modal)
Set alerts for new jobs by Tencent
Set alerts for new Research Development jobs in United States
Set alerts for new jobs in United States
Set alerts for Research Development (Remote) jobs

Contact Us
hello@outscal.com
Made in INDIA 💛💙