Senior Data Scientist (LLM post training)

4 Minutes ago • All levels
Research Development

Job Description

We are seeking Senior Data Scientists specializing in large language model post-training. This role involves research and development in key areas such as reasoning, multi-modality, multi-lingual capabilities, and intelligent agents. Responsibilities include LLM post-training and optimization techniques like fine-tuning, low-resource training, and distillation, alongside domain-specific data collection and augmentation. The position also focuses on the application and deployment of these models in critical user-facing and internal productivity scenarios, including risk control, customer support, and personalization, aiming to deliver innovative solutions and significant business value.
Good To Have:
  • Experience in participation and contribution to open-source communities.
  • Publication experience at top conferences.
  • Experience in end-to-end LLM application deployment.
  • Good technical strategic sense and strong business acumen.
Must Have:
  • Master's degree or higher in computer science, artificial intelligence, or a related field.
  • Solid theoretical foundation in machine learning and deep learning, with in-depth understanding of large models and related technical domains.
  • Experience in large model post-training and fine-tuning, proficient in applying techniques such as SFT and DPO.
  • Experience in cleaning, labeling, and generating large amounts of training data.
  • Experience in developing, deploying, and launching applications based on large models.
  • Proficiency in Python and mainstream deep learning frameworks.
  • Excellent written and verbal communication skills, and good English communication skills.
Perks:
  • Term Life Insurance
  • Comprehensive Medical Insurance
  • GrabFlex (customizable benefits package)
  • Parental leave
  • Birthday leave
  • Love-all-Serve-all (LASA) volunteering leave
  • Confidential Grabber Assistance Programme
  • FlexWork arrangements such as differentiated hours

Add these skills to join the top 1% applicants for this job

communication
game-texts
user-experience-ux
prototyping
data-science
deep-learning
python
algorithms
machine-learning

Get to know the team

The data science team develops data mining, machine learning and various algorithms to optimize user experience and the efficiency of the platform.

Get to know the role

We are currently seeking several Senior Data Scientists in the field of large language model post-training to join our team.

The roles involve research and development in key areas such as reasoning, multi-modality, multi-lingual and agents. Application areas include risk & safety, customer support, personalization and internal productivities.

The team will track the latest technological advancements, design innovative solutions, carry out end-to-end system deployment, and generate value for the businesses.

Responsibilities:

  • LLM post-training and optimization, including fine-tuning, low-resource training, distillation, and performance evaluation.
  • Domain-specific data collection, generation, cleaning, labeling, and augmentation.
  • Develop technology and systems to post-train reasoning models, multimodal models, multi-lingual models and agents.
  • Application deployment in user-facing product experience and internal productivity scenarios, such as risk control, customer support, personalization and others.
  • Present and demonstrate technical achievements through documents, patents, prototypes, and product launches.
  • Actively participating in business and product planning.

Requirements:

  • Master's degree or higher in computer science, artificial intelligence, or a related field.
  • Solid theoretical foundation in machine learning and deep learning, with in-depth understanding of large models and related technical domains.
  • Experience in large model post-training and fine-tuning, proficient in applying techniques such as SFT and DPO.
  • Experience in cleaning, labeling, and generating large amounts of training data.
  • Experience in developing, deploying, and launching applications based on large models.
  • Proficiency in Python and mainstream deep learning frameworks.
  • Excellent written and verbal communication skills, and good English communication skills.

Really nice to haves:

  • Experience in participation and contribution to open-source communities;
  • Publication experience at top conferences.
  • Experience in end-to-end LLM application deployment.
  • Good technical strategic sense and strong business acumen.

Life at Grab

We care about your well-being at Grab, here are some of the global benefits we offer:

We have your back with Term Life Insurance and comprehensive Medical Insurance.

With GrabFlex, create a benefits package that suits your needs and aspirations.

Celebrate moments that matter in life with loved ones through Parental and Birthday leave, and give back to your communities through Love-all-Serve-all (LASA) volunteering leave

We have a confidential Grabber Assistance Programme to guide and uplift you and your loved ones through life's challenges.

Balancing personal commitments and life's demands are made easier with our FlexWork arrangements such as differentiated hours

What We Stand For At Grab

We are committed to building an inclusive and equitable workplace that provides equal opportunity for Grabbers to grow and perform at their best. We consider all candidates fairly and equally regardless of nationality, ethnicity, race, religion, age, gender, family commitments, physical and mental impairments or disabilities, and other attributes that make them unique.

Set alerts for more jobs like Senior Data Scientist (LLM post training)
Set alerts for new jobs by Grab
Set alerts for new Research Development jobs in China
Set alerts for new jobs in China
Set alerts for Research Development (Remote) jobs

Contact Us
hello@outscal.com
Made in INDIA 💛💙