Delivery Manager – Data Science (AI / LLM / Agentic Systems)

Turing

10+ Years | San Francisco, California, United States (Remote) | Full Time | 1 day ago

Apply Now

Job Summary

This high-impact leadership role involves overseeing multiple cross-functional teams of data scientists, ML engineers, and data professionals engaged in Supervised Fine-Tuning (SFT), Reinforcement Learning from Human Feedback (RLHF), and Agentic AI system development. The Delivery Manager will ensure the delivery of world-class training data, reproducible pipelines, and scalable experimentation frameworks that directly support frontier AI research and enterprise-grade AI deployments.

Must Have

Lead large, distributed teams of data scientists, ML engineers, and Python developers.
Scale and operationalize data science workflows for SFT, RLHF, and RLAIF pipelines.
Define and manage project goals, timelines, and quality benchmarks for multiple concurrent programs.
Oversee end-to-end delivery of complex AI/ML initiatives, ensuring client goals are achieved.
Ensure rigorous standards for data, model, and process quality throughout the lifecycle.
Oversee data collection and benchmarking for LLM alignment and evaluation.
10+ years of experience in data science, AI/ML, or related engineering leadership roles.
Proven success managing large-scale AI delivery programs (100+ members).
Deep expertise in LLM training methodologies (SFT, RLHF, RLAIF) and Agentic AI systems.
Strong proficiency in Python and modern ML frameworks such as PyTorch, TensorFlow, Hugging Face, LangChain, and LlamaIndex.
Advanced understanding of data pipelines, annotation processes, and quality metrics.
Excellent client-facing communication, stakeholder management, and cross-functional collaboration.
Demonstrated ability to balance technical depth with operational leadership and business impact.

Good to Have

Advanced degree (Master’s or PhD) in Computer Science, Data Science, or AI-related field.
Experience delivering LLM and Agentic AI projects in production or research settings.
Knowledge of Responsible AI, fairness, and model interpretability practices.
Familiarity with project gamification, motivation frameworks, or workforce optimization at scale.
Experience working in fast-paced AI research or product organizations.

Perks & Benefits

Amazing work culture (Super collaborative & supportive work environment; 5 days a week)
Awesome colleagues (Surround yourself with top talent from Meta, Google, LinkedIn etc. as well as people with deep startup experience)
Competitive compensation
Flexible working hours

Job Description

About

Based in San Francisco, California, is the world’s leading research accelerator for frontier AI labs and a trusted partner for global enterprises looking to deploy advanced AI systems. accelerates frontier research with high-quality data, specialized talent, and training pipelines that advance thinking, reasoning, coding, multimodality, and STEM. For enterprises, builds proprietary intelligence systems that integrate AI into mission-critical workflows, unlock transformative outcomes, and drive lasting competitive advantage. Recognized by Forbes, The Information, and Fast Company among the world’s top innovators, leadership team includes AI technologists from Meta, Google, Microsoft, Apple, Amazon, McKinsey, Bain, Stanford, Caltech, and MIT. Learn more at www.turing.com

Position Overview

is seeking an accomplished Delivery Manager – Data Science to lead large-scale LLM training and Agentic AI programs, driving the successful delivery of mission-critical datasets, fine-tuning workflows, and model alignment initiatives. In this role, you will oversee multiple cross-functional teams of data scientists, ML engineers, and data professionals engaged in Supervised Fine-Tuning (SFT), Reinforcement Learning from Human Feedback (RLHF), and Agentic AI system development. You’ll ensure the delivery of world-class training data, reproducible pipelines, and scalable experimentation frameworks that directly support frontier AI research and enterprise-grade AI deployments. This is a high-impact leadership role at the intersection of technical depth, delivery excellence, and strategic alignment — ideal for someone who combines expertise in LLMs with operational mastery of large-scale AI execution.

Primary Responsibilities

1. Team and Delivery Leadership

Lead large, distributed teams of data scientists, ML engineers, and Python developers delivering high-quality AI datasets and models.
Scale and operationalize data science workflows for SFT, RLHF, and RLAIF pipelines.
Define and manage project goals, timelines, and quality benchmarks for multiple concurrent programs.
Drive team performance, engagement, and innovation, fostering a culture of technical excellence and accountability.

2. Program and Stakeholder Management

Oversee end-to-end delivery of complex AI/ML initiatives, ensuring client goals are achieved on schedule and within scope.
Partner with internal research and product leaders to translate technical requirements into actionable execution plans.
Manage dependencies across annotation, model training, and evaluation streams, mitigating risks proactively.
Deliver clear and consistent program reporting, highlighting key metrics, challenges, and insights.

3. Technical Oversight and Data Quality

Ensure rigorous standards for data, model, and process quality throughout the lifecycle.
Oversee data collection and benchmarking for LLM alignment and evaluation.
Leverage analytics to identify model performance gaps, biases, and optimization opportunities.
Champion reproducible experimentation, responsible AI practices, and scalable infrastructure design.

4. Research Collaboration and Innovation

Collaborate closely with AI researchers and infrastructure teams to refine model training methodologies.
Drive continuous improvement in data quality pipelines, annotation frameworks, and feedback loops.
Support the development of Agentic AI systems, enabling autonomous agents that learn from human and tool-based feedback.
Document and disseminate best practices to accelerate team learning and institutional knowledge.

Required Skills & Qualifications

10+ years of experience in data science, AI/ML, or related engineering leadership roles.
Proven success managing large-scale AI delivery programs (100+ members) across multiple workstreams.
Deep expertise in LLM training methodologies (SFT, RLHF, RLAIF) and Agentic AI systems.
Strong proficiency in Python and modern ML frameworks such as PyTorch, TensorFlow, Hugging Face, LangChain, and LlamaIndex.
Advanced understanding of data pipelines, annotation processes, and quality metrics.
Excellent client-facing communication, stakeholder management, and cross-functional collaboration.
Demonstrated ability to balance technical depth with operational leadership and business impact.

Preferred Skills & Qualifications

Advanced degree (Master’s or PhD) in Computer Science, Data Science, or AI-related field.
Experience delivering LLM and Agentic AI projects in production or research settings.
Knowledge of Responsible AI, fairness, and model interpretability practices.
Familiarity with project gamification, motivation frameworks, or workforce optimization at scale.
Experience working in fast-paced AI research or product organizations.

Why Join

Work at the intersection of cutting-edge AI research and scalable data delivery.
Lead flagship LLM and Agentic AI programs that define the next generation of intelligence systems.
Mentor high-performing global teams and influence both technical and strategic direction.
Be part of a mission-driven culture shaping the future of AI delivery excellence.

Values:

We are client first: We put our clients at the center of everything we do, because their success is the ultimate measure of our value.
We work at Start-Up Speed: We move fast, stay agile and favor action because momentum is the foundation of perfection
We are Al forward: We help our clients build the future of Al and implement it in our own roles and workflow to amplify productivity.

Advantages of joining:

Amazing work culture (Super collaborative & supportive work environment; 5 days a week)
Awesome colleagues (Surround yourself with top talent from Meta, Google, LinkedIn etc. as well as people with deep startup experience)
Competitive compensation
Flexible working hours

9 Skills Required For This Role

Cross Functional Game Texts Cross Functional Collaboration Agile Development Data Science Pytorch Reinforcement Learning Python Tensorflow