Senior Data Scientist (LLM post training)

Grab

| Beijing, China (On Site) | Full Time | 3 months ago

Apply Now

Job Summary

We are seeking Senior Data Scientists specializing in large language model post-training. This role involves research and development in key areas such as reasoning, multi-modality, multi-lingual capabilities, and intelligent agents. Responsibilities include LLM post-training and optimization techniques like fine-tuning, low-resource training, and distillation, alongside domain-specific data collection and augmentation. The position also focuses on the application and deployment of these models in critical user-facing and internal productivity scenarios, including risk control, customer support, and personalization, aiming to deliver innovative solutions and significant business value.

Must Have

Master's degree or higher in computer science, artificial intelligence, or a related field.
Solid theoretical foundation in machine learning and deep learning, with in-depth understanding of large models and related technical domains.
Experience in large model post-training and fine-tuning, proficient in applying techniques such as SFT and DPO.
Experience in cleaning, labeling, and generating large amounts of training data.
Experience in developing, deploying, and launching applications based on large models.
Proficiency in Python and mainstream deep learning frameworks.
Excellent written and verbal communication skills, and good English communication skills.

Good to Have

Experience in participation and contribution to open-source communities.
Publication experience at top conferences.
Experience in end-to-end LLM application deployment.
Good technical strategic sense and strong business acumen.

Perks & Benefits

Term Life Insurance
Comprehensive Medical Insurance
GrabFlex (customizable benefits package)
Parental leave
Birthday leave
Love-all-Serve-all (LASA) volunteering leave
Confidential Grabber Assistance Programme
FlexWork arrangements such as differentiated hours

Job Description

Get to know the team

The data science team develops data mining, machine learning and various algorithms to optimize user experience and the efficiency of the platform.

Get to know the role

We are currently seeking several Senior Data Scientists in the field of large language model post-training to join our team.

The roles involve research and development in key areas such as reasoning, multi-modality, multi-lingual and agents. Application areas include risk & safety, customer support, personalization and internal productivities.

The team will track the latest technological advancements, design innovative solutions, carry out end-to-end system deployment, and generate value for the businesses.

Responsibilities:

LLM post-training and optimization, including fine-tuning, low-resource training, distillation, and performance evaluation.
Domain-specific data collection, generation, cleaning, labeling, and augmentation.
Develop technology and systems to post-train reasoning models, multimodal models, multi-lingual models and agents.
Application deployment in user-facing product experience and internal productivity scenarios, such as risk control, customer support, personalization and others.
Present and demonstrate technical achievements through documents, patents, prototypes, and product launches.
Actively participating in business and product planning.

Requirements:

Master's degree or higher in computer science, artificial intelligence, or a related field.
Solid theoretical foundation in machine learning and deep learning, with in-depth understanding of large models and related technical domains.
Experience in large model post-training and fine-tuning, proficient in applying techniques such as SFT and DPO.
Experience in cleaning, labeling, and generating large amounts of training data.
Experience in developing, deploying, and launching applications based on large models.
Proficiency in Python and mainstream deep learning frameworks.
Excellent written and verbal communication skills, and good English communication skills.

Really nice to haves:

Experience in participation and contribution to open-source communities;
Publication experience at top conferences.
Experience in end-to-end LLM application deployment.
Good technical strategic sense and strong business acumen.

Life at Grab

We care about your well-being at Grab, here are some of the global benefits we offer:

We have your back with Term Life Insurance and comprehensive Medical Insurance.

With GrabFlex, create a benefits package that suits your needs and aspirations.

Celebrate moments that matter in life with loved ones through Parental and Birthday leave, and give back to your communities through Love-all-Serve-all (LASA) volunteering leave

We have a confidential Grabber Assistance Programme to guide and uplift you and your loved ones through life's challenges.

Balancing personal commitments and life's demands are made easier with our FlexWork arrangements such as differentiated hours

What We Stand For At Grab

We are committed to building an inclusive and equitable workplace that provides equal opportunity for Grabbers to grow and perform at their best. We consider all candidates fairly and equally regardless of nationality, ethnicity, race, religion, age, gender, family commitments, physical and mental impairments or disabilities, and other attributes that make them unique.

9 Skills Required For This Role

Communication Game Texts User Experience Ux Prototyping Data Science Deep Learning Python Algorithms Machine Learning

Similar Jobs

Research Development

Software Engineer, BigQuery AI Developer Experience

Google • Kirkland, Washington, United States of America (On Site)