Data Science Lead - AI

1 Month ago • 7 Years + • Data Analysis

Job Summary

Job Description

We are building Sahabat-AI, a multilingual Large Language Model tailored for Bahasa Indonesia and regional languages. We are looking for a passionate Lead Data Scientist to help shape the future of open and inclusive AI for Indonesia, playing a pivotal role in identifying impactful AI use cases. As a Lead Data Scientist working on LLMs, you will design and build high-quality datasets, advanced model pre-training, fine-tuning and alignment techniques, and collaborate closely with product and engineering teams to ship safe, reliable LLM-powered features to millions of users. This role offers the opportunity to drive innovation, solve critical business challenges, and shape the future of AI-driven solutions at GoTo Group. You will work with large-scale multilingual corpora, build datasets for continual pretraining and fine-tuning, contribute to scaling multilingual LLMs, implement state-of-the-art methods, and ensure responsible model behavior.
Must have:
  • 7+ years of experience in deep learning, NLP, and LLM
  • Proficient in data preprocessing, model training, evaluation, and optimisation
  • Practical experience in applying deep learning to solve real business problems
  • Proficient with Python and deep learning frameworks like PyTorch or Tensorflow
  • Experience with cloud platforms like Alibaba Cloud, GCP or AWS
  • Strong communication skills
  • Ability to write clear and concise technical documentation
  • Master’s or PhD in Computer Science, Data Science, AI, or related field
Good to have:
  • Understanding in computer vision and voice will be a plus point

Job Details

About the Role:

We are building Sahabat-AI, a multilingual Large Language Model tailored for Bahasa Indonesia and regional languages. We are looking for a passionate Lead Data Scientist to help shape the future of open and inclusive AI for Indonesia, as well as playing a pivotal role in identifying impactful AI use cases. As a Lead Data Scientist working on LLMs, you will design and build high-quality datasets, advanced model pre-training, fine tuning and and alignment techniques, and collaborate closely with product and engineering teams to ship safe, reliable LLM-powered features to millions of users. This role offers the opportunity to drive innovation, solve critical business challenges, and shape the future of AI-driven solutions at GoTo Group.


What You Will Do
  • Work with large-scale multilingual corpora, including text, audio, and image modalities
  • Build high-quality datasets for both continual pretraining,post-training (SFT, RLHF, DPO), and benchmark evaluation
  • Contribute to the training and scaling of multilingual LLMs – from continual pretraining to supervised fine-tuning and alignment.
  • Implement state-of-the-art methods and research for efficient and scalable operations.
  • Implement and improve safety alignment and guardrail systems to ensure responsible and culturally appropriate model behaviour.
  • Collaborate closely with business/product engineers to deploy production-grade LLM-powered solutions.
  • Stay current with advancements in AI technologies. Frontier models, training methodologies, etc


What You Will Need
  • 7+ years of experience in deep learning, NLP, and LLM
  • Understanding in computer vision and voice will be a plus point
  • Proficient in data preprocessing, model training, evaluation, and optimisation.
  • Practical experience in applying deep learning to solve real business problems, with models successfully deployed and used in production environments.
  • Proficient with Python and deep learning frameworks such as PyTorch or Tensorflow.
  • Experience with cloud platforms like Alibaba Cloud, GCP or AWS.
  • Strong communication skills to understand business needs and effectively convey analytical solutions.
  • Ability to write clear and concise technical documentation.
  • A Master’s or PhD in Computer Science, Data Science, AI, or a related field.


About the Team:


The Sahabat-AI team is on a mission to build the most capable and culturally-aligned multilingual LLMs for Indonesia. At GoTo Group, the Sahabat-AI team is at the forefront of developing state-of-the-art language models. We are building foundational AI models that understand and generate Bahasa Indonesia and regional languages – empowering more inclusive technology. We work on everything from continual pretraining large-scale LLMs to alignment and safety fine-tuning, using both structured and unstructured data from diverse sources. Our projects span core model development, dataset curation, safety systems, and real-world deployment in consumer and enterprise applications. Our team brings together members with diverse technical and cultural backgrounds, bringing expertise in machine learning and local languages. 


About GoTo Group

GoTo Group is the largest digital ecosystem in Indonesia with its mission to “Empower Progress’ by offering technological infrastructure and solutions for everyone to access and thrive in the digital economy. The GoTo ecosystem consists of on-demand transportation services, food and grocery delivery, logistics and fulfillment, as well as financial and payment services through the Gojek and GoTo Financial platforms.It is the first platform in Southeast Asia that hosts these crucial cases in a single ecosystem, capturing the majority of Indonesia’s vast consumer household.


About Gojek 

Gojek is Southeast Asia’s leading on-demand platform and pioneer of the multi-service ecosystem with over 2.5 million driver partners across the regions offering a wide range of services such as transportation, food delivery, logistics and more. With its mission to create impact at scale, Gojek is committed to resolving consumer problems and raising standards of living by connecting consumers to the best providers of goods and services in the market.


About GoTo Financial

GoTo Financial accelerates financial inclusion through its leading financial services and merchants solutions. Its consumer services include GoPay and GoPayLater and serve businesses of all sizes through Midtrans, Moka, GoBiz Plus, GoBiz, and Selly. With its trusted and inclusive ecosystem of products, GoTo Financial is open to new growth opportunities and aims to empower everyone to Make It Happen, Make It Together, Make It Last.


GoTo and its business units, including Gojek and GoToFinancial ("GoTo") only post job opportunities on our official channels on our respective company websites and on LinkedIn. GoTo is not liable for any job postings or job offers that did not originate from us. You should conduct your own due diligence to prevent being victims of any fake job scams, if they did not originate from GoTo's official recruitment channels.


#LI-HYBRID

Similar Jobs

HCL Tech - Technical Lead iOS, Android, Java

HCL Tech

California, United States (On-Site)
1 Month ago
Wargaming - Lead Level Artist

Wargaming

Warsaw, Masovian Voivodeship, Poland (Remote)
1 Month ago
Salesforce - Client SE

Salesforce

Dublin, County Dublin, Ireland (On-Site)
1 Week ago
Jane Street - Linux Engineer

Jane Street

Singapore (On-Site)
2 Months ago
Morning Star - Associate Sales Manager - Audience

Morning Star

London, England, United Kingdom (Hybrid)
1 Day ago
Minecast - Principal Data Engineer

Minecast

London, England, United Kingdom (Hybrid)
2 Days ago
Casumo - Senior Business Analyst

Casumo

Zagreb, Croatia (Hybrid)
4 Months ago
Nintendo - Sr BI Data Analytics Engineer

Nintendo

Redmond, Washington, United States (On-Site)
3 Months ago
Addepar - Sr. Software Engineer - Reference Data

Addepar

United States (Remote)
2 Months ago
luxsoft - Technical Lead / Senior Data Engineer

luxsoft

Ukrainka, Kyiv Oblast, Ukraine (Remote)
1 Month ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Penumbrainc - Facilities Tech I (Swing Shift)

Penumbrainc

Alameda, California, United States (On-Site)
5 Months ago
Honor - Director of Data Platform & Analytics

Honor

United States (Remote)
2 Days ago
Shield AI - Staff Engineer, Software (R3175)

Shield AI

Washington, District Of Columbia, United States (On-Site)
2 Days ago
eBay - Billing Analyst

eBay

Austin, Texas, United States (Hybrid)
2 Days ago
extreme network - Sr. Director, Chief of Staff to Chief Information and Customer Officer

extreme network

North Carolina, United States (Remote)
3 Weeks ago
Rackspace Technology - Senior Azure Engineer

Rackspace Technology

Bengaluru, Karnataka, India (Remote)
1 Month ago
entrata - Strategic Account Manager

entrata

Dallas, Texas, United States (Remote)
2 Months ago
USE Insider - Solution Architect - Germany

USE Insider

Berlin, Berlin, Germany (Hybrid)
9 Months ago
Activision - Director, QA

Activision

Shakopee, Minnesota, United States (On-Site)
3 Months ago
Bito - Inside Sales Executive

Bito

Pune, Maharashtra, India (Hybrid)
5 Months ago

Get notifed when new similar jobs are uploaded

Jobs in Singapore

Ubisoft - Senior Technical Artist (MOSAIC)

Ubisoft

Singapore (On-Site)
4 Months ago
bytedance - Product Manager - Edge Computing Platform

bytedance

Singapore (On-Site)
8 Months ago
bytedance - Software Engineer, Video-On-Demand

bytedance

Singapore (On-Site)
8 Months ago
sitetracker - Salesforce Solution Architect

sitetracker

Singapore (On-Site)
1 Month ago
Illumina - Manufacturing Equipment Engineer

Illumina

Singapore, Singapore (On-Site)
1 Year ago
IGG - Senior Backend Engineer

IGG

Singapore (On-Site)
9 Months ago
Ubisoft - Product Marketing Intern

Ubisoft

Singapore (Hybrid)
2 Months ago
Habby fun - Senior 2D Animator

Habby fun

Singapore (On-Site)
3 Weeks ago
Precisly - Business Development Representative

Precisly

Singapore (On-Site)
3 Weeks ago

Get notifed when new similar jobs are uploaded

Data Analysis Jobs

HCL Tech - AWS Senior Data Engineer

HCL Tech

Texas, United States (On-Site)
3 Weeks ago
Mattel Inc - Associate Product Data Analyst

Mattel Inc

Shenzhen, Guangdong Province, China (On-Site)
3 Months ago
IMC - Data Quality Engineer

IMC

Amsterdam, North Holland, Netherlands (On-Site)
3 Weeks ago
Apple - AIML - Senior Data Scientist, Evaluation

Apple

Seattle, Washington, United States (On-Site)
3 Weeks ago
Embark Studios - Generalist Software Engineer - Data Tech

Embark Studios

Stockholm, Stockholm County, Sweden (On-Site)
2 Months ago
Ziff Davis - Data Architect

Ziff Davis

Helsinki, Uusimaa, Finland (Hybrid)
2 Months ago
Netomi - Senior Data Engineer

Netomi

Canada (Remote)
2 Months ago
dun bradstreet - Data Scientist

dun bradstreet

Chennai, Tamil Nadu, India (Hybrid)
3 Months ago
Super.com - Manager, Data Analytics

Super.com

United States (Remote)
4 Months ago
Nagarro - Associate Principal Consultant, Business Analyst

Nagarro

(On-Site)
9 Months ago

Get notifed when new similar jobs are uploaded

About The Company

GoTo is the largest technology group in Indonesia, combining on-demand and financial services through the Gojek and GoTo Financial brands. It is the first platform in Southeast Asia to host these two essential use cases in one ecosystem, capturing a majority of Indonesian consumer household expenditure.


GoTo’s mission is to “Empower Progress” by offering an unparalleled selection of goods and services through a comprehensive merchant and partner network and promoting financial inclusion through its leading payments and financial services business.

Jakarta, Indonesia (On-Site)

Jakarta, Indonesia (On-Site)

Jakarta, Indonesia (On-Site)

Jakarta, Indonesia (On-Site)

Bengaluru, Karnataka, India (Hybrid)

Denpasar, Bali, Indonesia (On-Site)

Jakarta, Indonesia (On-Site)

Jakarta, Indonesia (On-Site)

View All Jobs

Get notified when new jobs are added by GoTo Group

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug