Machine Learning Engineering, Model Training

5 Months ago • 3 Years + • Data Analyst • Artificial Intelligence

Job Summary

Job Description

Captions seeks a Machine Learning Engineer to join its AI Research team and build data infrastructure for training cutting-edge video generation models. Responsibilities include designing and developing data pipelines for video data handling, building systems for video pre-processing, creating data loaders for large-scale datasets, implementing feature engineering techniques, collaborating with research and engineering teams, and managing cluster code for high-performance training. The role involves building foundational, state-of-the-art machine systems. This is an opportunity to be an early team member and have significant impact on the product and company culture.
Must have:
  • Python programming skills
  • Data pre-processing & feature engineering experience
  • Large-scale data processing framework experience
  • Deep learning systems and offline model training
  • Data loaders and cluster infrastructure experience
  • Video or image data experience
Perks:
  • Comprehensive medical, dental, and vision plans
  • 401K with employer match
  • Commuter Benefits
  • Catered lunch
  • Dinner stipend
  • Doordash DashPass subscription
  • Health & Wellness Perks
  • Team offsites and events
  • Generous PTO and flexible WFH days

Job Details

Captions is the leading video AI company, building the future of video creation. Over 10 million creators and businesses have used Captions to create videos for social media, marketing, sales, and more. We're on a mission to serve the next billion.

We are a rapidly growing team of ambitious, experienced, and devoted engineers, researchers, designers, marketers, and operators based in NYC. You'll join an early team and have an outsized impact on the product and the company's culture.

We’re very fortunate to have some the best investors and entrepreneurs backing us, including Index Ventures (Series C lead), Kleiner Perkins (Series B lead), Sequoia Capital (Series A and Seed co-lead), Andreessen Horowitz (Series A and Seed co-lead), Uncommon Projects, Kevin Systrom, Mike Krieger, Lenny Rachitsky, Antoine Martin, Julie Zhuo, Ben Rubin, Jaren Glover, SVAngel, 20VC, Ludlow Ventures, Chapter One, and more.

Check out our latest financing milestone and some other coverage:

The Information: 50 Most Promising Startups

Fast Company: Next Big Things in Tech

The New York Times: When A.I. Bridged a Language Gap, They Fell in Love

Business Insider: 34 most promising AI startups

Time: The Best Inventions of 2024

** Please note that all of our roles will require you to be in-person at our NYC HQ (located in Union Square) **

About the Role:

We’re seeking a skilled Machine Learning Engineer to join our AI Research team and build the data infrastructure that powers the training of cutting-edge video generation models.
In this role, you’ll develop offline jobs dedicated to training large generative models, manage training cluster code, and create data loaders to handle large-scale video datasets. Being an early member of our AI Research team will give you the opportunity to build foundational, state-of-the-art machine systems 0 to 1. 

Key Responsibilities:

  • Design and develop robust data pipelines to support the efficient handling and processing of video data, ensuring high-quality data input for model training.

  • Build and optimize systems for video frame extraction and other pre-processing steps to prepare data for training workflows.

  • Create and manage data loaders for large-scale video datasets, focusing on speed and efficiency to support various model training requirements.

  • Implement feature engineering techniques that enhance data quality and diversity, aiding in model accuracy and performance.

  • Collaborate with research and engineering teams to scale data infrastructure and enable seamless experimentation and model iterations.

  • Write and maintain cluster code to support high-performance training operations, including resource allocation and management.

Requirements:

  • Bachelor’s or Master’s degree in Computer Science, Data Engineering, Machine Learning, or a related field.

  • 3+ years of professional experience in software engineering, data engineering, or ML infrastructure development.

  • Strong programming skills, particularly in Python, with proven experience with data pre-processing and feature engineering, ideally within video or image data contexts.

  • Professional experience working with large-scale data processing frameworks, deep-learning systems, offline model training workflows, data loaders, and cluster infrastructure.

Benefits:

  • Comprehensive medical, dental, and vision plans

  • 401K with employer match

  • Commuter Benefits

  • Catered lunch multiple days per week

  • Dinner stipend every night if you're working late and want a bite!

  • Doordash DashPass subscription

  • Health & Wellness Perks (Talkspace, Kindbody, One Medical subscription, HealthAdvocate, Teladoc)

  • Multiple team offsites per year with team events every month

  • Generous PTO policy and flexible WFH days

Captions provides equal employment opportunities to all employees and applicants for employment and prohibits discrimination and harassment of any type without regard to race, color, religion, age, sex, national origin, disability status, genetics, protected veteran status, sexual orientation, gender identity or expression, or any other characteristic protected by federal, state or local laws.

Please note benefits apply to full time employees only.

Compensation Range: $170K - $250K

Similar Jobs

Keywords Studios (Player Support) - Technical Artist - VFX

Keywords Studios (Player Support)

Victoria, British Columbia, Canada (Hybrid)
8 Months ago
paypal - Sr. Machine learning scientist

paypal

Bengaluru, Karnataka, India (Hybrid)
6 Months ago
Nasdaq - Cloud Operations Specialist, FinTech

Nasdaq

Taguig, Metro Manila, Philippines (Hybrid)
6 Months ago
Rockstar Games - Systems Engineer, Automation

Rockstar Games

London, England, United Kingdom (On-Site)
6 Months ago
PwC - SAP - BODS - Senior Associate-Bangalore

PwC

Bengaluru, Karnataka, India (On-Site)
6 Months ago
The Walt Disney Company - Manager Data Science

The Walt Disney Company

New York, New York, United States (On-Site)
5 Months ago
Xsolla - Data Warehouse Architect

Xsolla

Serbia (Remote)
6 Months ago
Visa - Manager, Data Science - Visa Consulting and Analytics

Visa

Mumbai, Maharashtra, India (On-Site)
6 Months ago
Devoteam - Distributed Cloud l Google Data Project

Devoteam

(Remote)
6 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

MiQ - Manager Data Science

MiQ

Bengaluru, Karnataka, India (Hybrid)
6 Months ago
HP - Graduate - Technical

HP

Tlaquepaque, Jalisco, Mexico (On-Site)
7 Months ago
Silicon Labs - Test Systems Engineer II

Silicon Labs

Austin, Texas, United States (On-Site)
6 Months ago
Flexera Software - Member Technical Staff - Site Reliability Engineer

Flexera Software

Bengaluru, Karnataka, India (Hybrid)
6 Months ago
Google - Cloud Security Engineer, Professional Services, Google Cloud

Google

Austin, Texas, United States (On-Site)
5 Months ago
The Walt Disney Company - Associate Manager/Senior Analyst, Consumer Insights

The Walt Disney Company

Hong Kong (On-Site)
5 Months ago
canva - Software Engineer (Python) - Data Platform (Open to remote across ANZ)

canva

Sydney, New South Wales, Australia (Remote)
5 Months ago
Trend Micro - Senior Software Development Engineer

Trend Micro

Manila, Metro Manila, Philippines (Hybrid)
6 Months ago
Egnyte - Site Reliability Engineer

Egnyte

India (Remote)
6 Months ago

Get notifed when new similar jobs are uploaded

Jobs in New York, New York, United States

Nisum - Senior Java Developer - N4417

Nisum

San Francisco, California, United States (On-Site)
6 Months ago
Hoyoverse - Operations Specialist (Procurement)

Hoyoverse

Santa Monica, California, United States (On-Site)
11 Months ago
MatchGroup - Data Scientist I (Revenue)

MatchGroup

San Francisco, California, United States (Hybrid)
6 Months ago
Joyride Games - VP Marketing

Joyride Games

Austin, Texas, United States (Remote)
1 Year ago
Fabric - Principal Design Verification Engineer, CPU

Fabric

New York, New York, United States (On-Site)
6 Months ago
HP - Print Security & Manageability Product Manager

HP

Boise, Idaho, United States (On-Site)
6 Months ago
Google - Staff Software Engineer, Infrastructure, Google Cloud AI

Google

Kirkland, Washington, United States (On-Site)
5 Months ago
inveniolsi - SAP Finance Delivery Practice Lead (US Citizen or Perm Res req)

inveniolsi

United States (On-Site)
5 Months ago
Rocket - Senior Customer Solutions Engineer (IBM Infoprint Server)

Rocket

United States (Remote)
6 Years ago
Regent Craft - Modeling & Simulation Intern

Regent Craft

North Kingstown, Rhode Island, United States (On-Site)
6 Months ago

Get notifed when new similar jobs are uploaded

Data Analyst Jobs

OUTFIT7 - Global Head of Brand & Franchise Management

OUTFIT7

Limassol, Limassol, Cyprus (On-Site)
6 Months ago
meetelise - Data Analyst

meetelise

New York, New York, United States (On-Site)
6 Months ago
Nielsen - Data & Reporting Analyst with German

Nielsen

Warsaw, Masovian Voivodeship, Poland (Remote)
6 Months ago
DraftKings - Lead Data Science Engineer, Personalization and Search

DraftKings

Boston, Massachusetts, United States (On-Site)
7 Months ago
Barbaricum - Data Engineer

Barbaricum

Omaha, Nebraska, United States (Hybrid)
6 Months ago
PwC - Manager-Data Engineer|Pune

PwC

Pune, Maharashtra, India (On-Site)
6 Months ago
Trendyol - Trendyol GO - Growth Analytics Leaders

Trendyol

İstanbul, İstanbul, Türkiye (Hybrid)
6 Months ago
HHA Exchange - Data Architect

HHA Exchange

New York, New York, United States (Remote)
6 Months ago
undefined - Senior Data Engineer

Amsterdam, North Holland, Netherlands (On-Site)
6 Months ago
The Walt Disney Company - Specialist/ Associate Specialist, Sales Analysis and Planning

The Walt Disney Company

Hong Kong (On-Site)
5 Months ago

Get notifed when new similar jobs are uploaded