Data Scientist

4 Months ago • All levels • Data Analyst • Artificial Intelligence

Job Summary

Job Description

This remote Data Scientist role focuses on developing and applying advanced Large Language Models (LLMs) to enhance AI-driven healthcare solutions. Must-have skills include expertise in LLMs, NLP, Python, and experience fine-tuning pre-trained LLMs for specific tasks. Ideal candidates will possess strong problem-solving skills, a passion for healthcare data, and a collaborative mindset.
Must have:
  • LLMs expertise
  • NLP proficiency
  • Python experience
  • Fine-tuning LLMs
Good to have:
  • Commercial LLMs
  • Transfer learning
  • RAG experience
  • Cloud platforms
Perks:
  • Remote working
  • Flexible hours

Job Details

About the job

This is a remote position.

We are seeking a highly skilled Data Scientist with expertise in Large Language Models (LLMs) and Natural Language Processing (NLP). In this role, you will lead the development and application of advanced LLMs to enhance our AI-driven solutions, drive innovation in language understanding, and enable meaningful insights from vast datasets. This position requires deep knowledge in data science, machine learning (ML), and the ability to fine-tune and deploy LLMs to solve real-world problems.

You will be working with cross-functional teams of healthcare professionals, data scientists, engineers, and business stakeholders to define and deliver cutting-edge solutions that transform healthcare delivery. If you are a highly motivated and results-oriented individual with a passion for leveraging healthcare data to drive insights and improve patient outcomes, we would love to hear from you.

Essential Duties And Responsibilities:

  • Lead the design, development, and deployment of LLM-based solutions to address complex business challenges.
  • Conduct advanced research and experiment with commercial LLM solutions (OpenAI GPT, Gemma, Llama), and leverage their APIs for rapid prototyping.
  • Fine-tune pre-trained LLMs (e.g., GPT, BERT, T5) to suit specific use cases, ensuring high performance on natural language tasks.
  • Utilize transfer learning techniques to adapt LLMs for domain-specific tasks.
  • Develop and implement strategies for data augmentation and synthesis to improve model generalization.
  • Collaborate with cross-functional teams (engineering, product management, and operations) to integrate LLMs into various applications, such as chatbots, knowledge retrieval systems, and data analytics platforms.
  • Conduct advanced research in the field of LLMs to stay updated with cutting-edge methodologies and apply them to real-world problems.
  • Build pipelines for preprocessing, training, and deploying models, ensuring scalability and reliability.
  • Evaluate and optimize models using metrics such as accuracy, F1 score, perplexity, and computational efficiency.
  • Create data pipelines for large-scale text data ingestion, cleaning, and annotation.
  • Perform A/B testing and rigorous validation of LLM-based models, providing insights on model efficacy.
  • Mentor junior data scientists and engineers on NLP techniques and best practices.
  • Document model development and analysis processes for transparency and reproducibility.

Requirements

Qualifications:

  • Bachelor's degree in Data Science, Computer Science, Machine Learning, or a related field.
  • Master’s degree is preferred

Skill Sets:

  • Hands-on experience with popular LLM architectures (GPT, BERT, RoBERTa, T5, etc.) and fine-tuning them for specific tasks.
  • Familiarity with commercial LLM solutions such as GPT, Gemma, Gemini, Llama.
  • Proficiency in programming languages such as Python and frameworks like PyTorch or TensorFlow.
  • Expertise in building and deploying machine learning models, including experience with model serving frameworks like TensorFlow Serving, MLflow, or similar.
  • Experience in fine-tuning pre-trained LLMs and applying transfer learning to adapt models for domain-specific tasks.
  • Familiarity with Retrieval-Augmented Generation (RAG) for improving model responses by integrating external knowledge bases or search engines.
  • Experience using Jupyter Notebooks for experimentation, prototyping, and data analysis.
  • Experience with cloud platforms (AWS, GCP, or Azure) for scalable model deployment.
  • Strong knowledge of data preprocessing, feature engineering, and text vectorization methods (Word2Vec, GloVe, etc.).
  • Proficient in statistical analysis and using tools like Pandas, NumPy, and Scikit-learn.
  • Familiarity with MLOps practices for version control, CI/CD, and containerization (Docker, Kubernetes).
  • Prior experience in developing conversational AI, chatbots, or virtual assistants using LLMs.
  • Experience in the healthcare sector, with a focus on processing domain-specific language or medical text.
  • Contributions to research publications or open-source NLP projects is a plus.
  • Strong problem-solving skills and the ability to work with unstructured data.
  • Excellent communication and collaboration skills.

Attitudinal/Cultural Fit

  • You are a passionate self-learner & love taking on new challenges
  • You love solving problems & digging for creative solutions if you don’t the answer to a problem
  • You don’t get flustered easily. You can remain level headed under pressure
  • You are meticulous & pay attention to details. Nothing falls off your radar. You believe in doing things perfectly, every single time you do them

Benefits

  • Remote working.
  • Flexible working hours.
  • Great work culture
  • 5 days working.
  • Group Medical Insurance

Similar Jobs

ByteDance - Senior Research Scientist- Foundation Model, Generative AI

ByteDance

Seattle, Washington, United States (On-Site)
3 Months ago
Sportskeeda - SEO Analyst

Sportskeeda

India (Remote)
1 Month ago
Tesla - Tesla Advisor

Tesla

Queensland, Australia (On-Site)
3 Weeks ago
Next Level Business Services - Data Scientist -  Full Time Only

Next Level Business Services

Redmond, Washington, United States (On-Site)
4 Months ago
TransPerfect - Localization Games Tester - Europe (freelance, remote)

TransPerfect

Dublin, County Dublin, Ireland (Remote)
7 Months ago
Next Level Business Services - Markit EDM

Next Level Business Services

Pittsburgh, Pennsylvania, United States (On-Site)
4 Months ago
UXBERT Labs - Senior Data Engineer

UXBERT Labs

Riyadh, Riyadh Province, Saudi Arabia (Hybrid)
1 Month ago
Luxoft - Business Analyst - Credit Risk

Luxoft

Gurugram, Haryana, India (On-Site)
2 Months ago
Luxoft - Senior Business Analyst

Luxoft

Bengaluru, Karnataka, India (On-Site)
3 Months ago
Paytm - Manager - Fraud Analytics

Paytm

Noida, Uttar Pradesh, India (On-Site)
3 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Meta - Software Engineer (Leadership) - Machine Learning

Meta

Bellevue, Washington, United States (Remote)
3 Months ago
Microsoft - Senior Applied Scientist/Research SDE

Microsoft

Redmond, Washington, United States (On-Site)
1 Month ago
Universally Speaking - Korean Games Tester

Universally Speaking

Community Of Madrid, Spain (On-Site)
1 Month ago
Amber - Localization Quality Assurance with Estonian

Amber

Montreal, Quebec, Canada (On-Site)
7 Months ago
Imagineio - Senior Generative AI Engineer

Imagineio

India (Remote)
3 Months ago
Epic Games - Knowledge Manager

Epic Games

Cary, North Carolina, United States (On-Site)
1 Month ago
GT - AI/LLM Engineer

GT

Ukraine (Remote)
1 Month ago
Tesla - Tesla Advisor

Tesla

Queensland, Australia (On-Site)
3 Weeks ago
Microsoft - Senior Applied Scientist

Microsoft

Redmond, Washington, United States (On-Site)
1 Month ago
Pocket FM - Assistant Creative Director - Generative AI

Pocket FM

Bengaluru, Karnataka, India (Remote)
5 Months ago

Get notifed when new similar jobs are uploaded

Jobs in Maharashtra, India

Pix Rock Vfx   - VFX Paint/Prep Artist

Pix Rock Vfx

Salem, Tamil Nadu, India (On-Site)
5 Months ago
Zeta - Lead Data Reliability Engineer

Zeta

Hyderabad, Telangana, India (On-Site)
4 Months ago
Google - Program Manager, Cloud Customer Capacity Planning

Google

Bengaluru, Karnataka, India (On-Site)
1 Month ago
Nagarro - Associate Staff Engineer

Nagarro

Gurugram, Haryana, India (On-Site)
4 Months ago
PwC - IN-Manager_IA_Internal Audit Services_Advisory_Kolkata

PwC

Kolkata, West Bengal, India (On-Site)
4 Months ago
PwC - IN_Associate _ Internal Audit _Internal Audit Services_ Advisory_ Bangalore

PwC

Bengaluru, Karnataka, India (On-Site)
4 Months ago
CloudHire - Outreach Manager

CloudHire

Mumbai, Maharashtra, India (Hybrid)
4 Months ago
CleverTap - Solutions Engineer

CleverTap

Gurugram, Haryana, India (Hybrid)
4 Months ago
Aiti Interieurs - Sr. Interior Designer

Aiti Interieurs

Bengaluru, Karnataka, India (On-Site)
4 Months ago

Get notifed when new similar jobs are uploaded

Data Analyst Jobs

Make - Senior Data Analyst

Make

Madrid, Community Of Madrid, Spain (On-Site)
3 Months ago
Stonewall Collision & Auto Painting - Lead Data Scientist

Stonewall Collision & Auto Painting

Hyderabad, Telangana, India (On-Site)
5 Months ago
Epic Games - Data Analyst - Product Analytics

Epic Games

Cary, North Carolina, United States (On-Site)
1 Month ago
Aristocrat Gaming - Consumer Insights Manager

Aristocrat Gaming

Las Vegas, Nevada, United States (Hybrid)
1 Month ago
OKX - Data Engineer

OKX

Hong Kong (On-Site)
4 Months ago
Meta - Marketing Science Partner (Financial Services)

Meta

San Francisco, California, United States (On-Site)
3 Months ago
ION - Internship - Data Science

ION

Milan, Lombardy, Italy (On-Site)
4 Months ago
Google - Senior Data Analyst, Trust and Safety, Search

Google

(On-Site)
2 Months ago
Netflix - Analytics Engineer (L4) - Finance

Netflix

Los Gatos, California, United States (On-Site)
3 Months ago
Bungie - Senior Product Security Analyst

Bungie

United States (Hybrid)
2 Months ago

Get notifed when new similar jobs are uploaded