Staff Engineer - Data Scientist

2 Weeks ago • All levels • Artificial Intelligence • Undisclosed

About the job

Job Description

As a Staff Engineer - Data Scientist, you will leverage your expertise in Python, AI, machine learning, and NLP. Responsibilities include training and fine-tuning foundation models, working with LLM APIs (OpenAI, Anthropic) and frameworks (LangChain, LlamaIndex), experience with multi-agent systems, unstructured data handling using libraries like unstructured.io, Llama Index for knowledge base creation, text chunking, embedding generation (BERT, GPT), knowledge graph construction (Neo4j, RDF), and implementing RAG systems. You will need proficiency in handling various document formats, extracting structured information, and designing ontologies.
Must have:
  • Proficiency in Python and DS libraries
  • Strong AI/ML/NLP knowledge
  • Experience with Foundation Models
  • LLM APIs & Frameworks expertise
  • Multi-agent systems experience
  • Unstructured data handling
  • Llama Index expertise
  • Text chunking techniques
  • Embedding generation
  • Knowledge graph construction
  • RAG system implementation
Good to have:
  • Machine Learning on AWS
  • Generative AI Fundamentals

Company Description

We are a Digital Product Engineering company that is scaling in a big way! We build products, services, and experiences that inspire, excite, and delight. We work at scale — across all devices and digital mediums, and our people exist everywhere in the world (19000+ experts across 33 countries, to be exact). Our work culture is dynamic and non-hierarchical. We are looking for great new colleagues. That is where you come in!

Job Description

  • Proficiency in Python and all associated DS libraries and frameworks.
  • Strong knowledge in AI, machine learning, and natural language processing.
  • Experience with leveraging, training and fine-tuning Foundation Models, including multimodal inputs and outputs.
  • Strong experience working with key LLM models APIs (e.g. OpenAI, Anthropic) and LLM Frameworks (e.g. LangChain, LlamaIndex).
  • Experience with multi-agent frameworks/systems and an understanding of multi-agent systems and their applications in complex problem-solving scenarios.
  • Experience with unstructured.io or similar libraries for handling various document formats and extracting structured information from unstructured data.
  • Expertise in using Llama Index for building and querying knowledge bases, including its data connectors, indexing strategies, and query engines.
  • Knowledge of effective text chunking techniques for optimal processing and indexing of large documents or datasets.
  • Proficiency in generating and working with text embeddings using models like BERT, GPT, or domain-specific embedding models.
  • Understanding of embedding spaces and their applications in semantic search and information retrieval.
  • Experience in constructing and querying knowledge graphs, including technologies like Neo4j or RDF triplestores.
  • Understanding of ontology design and graph-based reasoning.
  • Experience with RAG concepts and fundamentals (vectorDBs, semantic search, etc.).
  • Expertise in implementing RAG systems that combine knowledge bases with generative AI models.

Qualifications

Must have Skills: Python for Data Science (Capable).\

Good To Have Skills: Machine Learning on AWS (Capable), Generative AI Fundamentals (Capable).

View Full Job Description

Add your resume

80%

Upload your resume, increase your shortlisting chances by 80%

Similar Jobs

Oceaneering - Principal Data Scientist

Oceaneering, India (Hybrid)

Neostella - Senior Graph Database Architect

Neostella, Colombia (On-Site)

Keywords Studios (Player Support) - AI Engineer (AI-Powered Agents)

Keywords Studios (Player Support), India (On-Site)

Keywords Studios (Player Support) - AI Engineer (AI-Powered Agents)

Keywords Studios (Player Support), India (Hybrid)

Microsoft - Research Intern - AI-Assisted Game Creation

Microsoft, United States (On-Site)

AI Fund - Curriculum Developer

AI Fund, (Remote)

Samsung Semiconductor - Intern, Machine Learning Engineer - VLMs

Samsung Semiconductor, United States (Hybrid)

undefined - Senior Machine Learning Scientist, Gen AI

Madrid, Community Of Madrid, Spain (On-Site)

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Keywords Studios (Player Support) - AI Engineer (AI-Powered Agents)

Keywords Studios (Player Support), India (Hybrid)

Oceaneering - Principal Data Scientist

Oceaneering, India (Hybrid)

Next Level Business Services - Neo4J Architect

Next Level Business Services, United States (On-Site)

Keywords Studios (Player Support) - AI Engineer (AI-Powered Agents)

Keywords Studios (Player Support), India (On-Site)

The Walt Disney Company - Senior Data Engineer - Identity Data

The Walt Disney Company, United States (On-Site)

Talentica Software - Data Scientist

Talentica Software, India (Remote)

Get notifed when new similar jobs are uploaded

Jobs in Colombia

Token Metrics - Senior Front End Web Developer (Remote)

Token Metrics, Colombia (Remote)

Evolution - Studio Manager

Evolution, Colombia (On-Site)

Neostella - Customer Support Specialist

Neostella, Colombia (On-Site)

The Workshop - Data Centre Engineer

The Workshop, Colombia (On-Site)

Rush Street Interactive - Staff Engineer

Rush Street Interactive, Colombia (On-Site)

Evolution - Physical Security Specialist

Evolution, Colombia (On-Site)

Teravision Games - Aprendiz SENA

Teravision Games, Colombia (On-Site)

Get notifed when new similar jobs are uploaded

Artificial Intelligence Jobs

Scale AI - Marketplace Product Manager, Generative AI

Scale AI, United States (Hybrid)

ByteDance - Research Scientist- Foundation Model, Generative AI

ByteDance, United States (On-Site)

Inworld AI - Head of Developer Product Marketing

Inworld AI, United States (Hybrid)

Microsoft - Senior Applied Scientist

Microsoft, China (On-Site)

Flatworld Solutions - Technical Architect

Flatworld Solutions, India (Hybrid)

Canva - Senior Machine Learning Engineer - Canva UK

Canva, United Kingdom (Remote)

Get notifed when new similar jobs are uploaded