ML Engineer – Data Engineering

4 Months ago • All levels • Data Analysis

Job Summary

Job Description

Krea is seeking a Machine Learning Engineer with a focus on Data Engineering to manage and optimize extensive datasets of images and videos. The role involves ensuring efficient data access, storage, and versioning to support R&D teams. Responsibilities include implementing distributed storage solutions for seamless data access, developing strategies to reduce storage costs while maintaining availability, optimizing I/O operations for faster data retrieval, and building backend systems for dataset version management to ensure experiment reproducibility. This position offers the opportunity to work alongside world-class developers shaping the future of AI tooling and significantly impact Krea's market presence.
Must have:
  • Implement distributed storage solutions
  • Reduce data storage costs
  • Optimize data retrieval
  • Manage dataset versions
  • Python and C++ skills
  • Experience with K8s
Good to have:
  • Experience with distributed storage systems
  • Generalist back-end experience
  • Familiarity with ETL and infrastructure
Perks:
  • Sponsorship for international candidates
  • Work with world-class developers
  • Significant impact on growth
  • Competitive compensation
  • Significant equity upside

Job Details

About Krea

At Krea, we're dedicated to making AI intuitive and controllable for creatives. Our mission is to build tools that empower human creativity, not replace it. We believe AI is a new medium that allows us to express ourselves through various formats—text, images, video, sound, and even 3D. We're building better, smarter, and more controllable tools to harness this medium.

We’re backed by Bain Capital Ventures, A16Z, Abstract Ventures, Pebblebed and many others. If you're passionate about pushing the boundaries of AI and empowering human creativity, we'd love to hear from you.

We are seeking a Machine Learning Engineer with a focus on Data Engineering to manage and optimize our extensive datasets, comprising hundreds of millions of images and videos. This role is crucial in ensuring efficient data access, storage, and versioning to support our research and development teams.​ Your contributions will directly enhance the efficiency of our data handling processes, supporting cutting-edge research and development efforts.​

What you'll do:

  • Implement and maintain distributed storage solutions to provide seamless data access across all training machines.​

  • Develop strategies to reduce data storage costs while ensuring high availability and reliability.​

  • Optimize input/output operations to accelerate data retrieval and processing during training and inference phases.​

  • Build and manage backend systems to track and manage different versions of datasets, ensuring reproducibility and consistency in experiments.​

Our culture:

  • We work full-time and in-person at our waterfront office in San Francisco.

  • We believe that demonstrated interest in the creative space is key: our team includes musicians, designers, visual artists and more.

Example tacit skills we're looking for:

  • Experience with distributed storage systems including deployment, configuration, and optimization.​

  • Strong skills in Python and (ideally) C++ for developing data processing pipelines and integrating storage solutions.​

  • Experience in building and maintaining data pipelines capable of handling large-scale datasets efficiently.​

  • Experience with K8s.

  • Generalist back-end experience with familiarity in ETL and infrastructure.

What we offer:

  • Openness to sponsoring International candidates (e.g STEM OPT, OPT, H1B, O1, E3)

  • Work alongside a world class developing the future of AI tooling

  • Significant impact on Krea’s market presence and growth

  • Competitive compensation (75% percentile of market rates) with significant equity upside

Similar Jobs

Stake logic - Business Intelligence Analyst

Stake logic

Birkirkara, Malta (On-Site)
5 Months ago
Morning Star - Data Product Manager

Morning Star

Chicago, Illinois, United States (Hybrid)
1 Year ago
Univision - Senior Data Engineer

Univision

Bogota, Colombia (On-Site)
2 Months ago
Zinnia - Data Engineering, Manager

Zinnia

Bridgewater, New Jersey, United States (Hybrid)
2 Months ago
Qualcomm - GPU Performance Verification Engineer

Qualcomm

San Diego, California, United States (On-Site)
1 Month ago
ShyftLabs - Data Architect (Data Modernization)

ShyftLabs

Toronto, Ontario, Canada (Hybrid)
1 Month ago
Thumbtack - Senior Data Engineer

Thumbtack

Ontario, Canada (Remote)
2 Months ago
Rippling - Staff Backend Engineer, Data Bridge

Rippling

San Francisco, California, United States (On-Site)
7 Months ago
GoTo Group - Senior Data Scientist  (Singapore)

GoTo Group

Singapore (On-Site)
10 Months ago
OKX - Senior Business Intelligence Analyst, Growth

OKX

Singapore (On-Site)
3 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

FICO - Sales Development Representative

FICO

Italy (Remote)
2 Months ago
Moloco - Agency Senior Growth Manager

Moloco

London, England, United Kingdom (On-Site)
3 Months ago
supercell - Senior Data Analyst, Live and New Games

supercell

Helsinki, Uusimaa, Finland (On-Site)
3 Weeks ago
ISS Stoxx - Junior Data Analyst - Governance Data (Open to New Graduates)

ISS Stoxx

Makati City, Metro Manila, Philippines (Hybrid)
1 Month ago
The New York Times - Staff Editor, Opinion Audience

The New York Times

New York, New York, United States (Hybrid)
3 Months ago
Gupta Media - Data Analyst

Gupta Media

Boston, Massachusetts, United States (On-Site)
3 Months ago
Aledade - Senior BI Analyst I

Aledade

Bethesda, Maryland, United States (Remote)
2 Weeks ago
PwC - Senior Financial Data Analyst  | Deals (M&A) | Lyon | CDI | H/F

PwC

Lyon, Auvergne-Rhône-Alpes, France (On-Site)
10 Months ago
Capgemini - Data Business Analyst

Capgemini

Pune, Maharashtra, India (On-Site)
2 Months ago
Next Level Business Services - Cassandra Admin

Next Level Business Services

Bentonville, Arkansas, United States (On-Site)
10 Months ago

Get notifed when new similar jobs are uploaded

Jobs in San Francisco, California, United States

Bazaar Voice - Field Marketing Manager

Bazaar Voice

Austin, New York, United States (Hybrid)
3 Weeks ago
Patreon - Intelligence & Investigation Analyst

Patreon

California, United States (Hybrid)
4 Months ago
Sagecor - Systems Administrator IV

Sagecor

Annapolis Junction, Maryland, United States (On-Site)
2 Years ago
Google - Distinguished Engineer, Demand and Capacity Planning

Google

Seattle, Washington, United States (On-Site)
4 Months ago
hogarth - Sr. Commercial Financial Analyst

hogarth

New York, United States (Hybrid)
2 Weeks ago
Apple - Machine Learning Engineer - Semantics, Apple Ads

Apple

Austin, Texas, United States (On-Site)
1 Month ago
Next Level Business Services - SAP-MII Technology Lead

Next Level Business Services

Toledo, Ohio, United States (On-Site)
9 Months ago
Sabre India - Principal Category Management

Sabre India

Dallas, Texas, United States (Hybrid)
2 Months ago
Univision - Editor & Photographer

Univision

Miami, Florida, United States (On-Site)
1 Month ago
design works gaming - Business Development Associate - B2B Gaming

design works gaming

Scottsdale, Arizona, United States (Hybrid)
4 Weeks ago

Get notifed when new similar jobs are uploaded

Data Analysis Jobs

Apple - Camera Data Engineer

Apple

San Diego, California, United States (On-Site)
1 Month ago
Tencent - Workday Business Analyst - HCM

Tencent

California, United States (On-Site)
5 Months ago
Discord - Senior Data Scientist, Analytics

Discord

California, United States (On-Site)
3 Weeks ago
Pinterest - Senior Data Analyst

Pinterest

Chicago, Illinois, United States (Hybrid)
1 Month ago
Glean - Senior Data Scientist

Glean

Palo Alto, California, United States (Hybrid)
1 Month ago
Publicis Groupe - Publicis Media - Dual Study Program in Business Informatics Data Science & AI (m/f/d) - 2026

Publicis Groupe

Düsseldorf, North Rhine-Westphalia, Germany (Hybrid)
4 Weeks ago
Alphawave Semi - Senior Production and NPI Planner (Data Analytics Focus)

Alphawave Semi

Hsinchu County, Taiwan (Hybrid)
2 Months ago
Apple - Data Engineer - Business Process Re-Engineering

Apple

Austin, Texas, United States (On-Site)
2 Months ago
GoTo Group - Data Science Lead - AI

GoTo Group

Singapore (Hybrid)
1 Month ago
IMC - Data Engineer - Digital Assets

IMC

Zug, Zug, Switzerland (Hybrid)
3 Weeks ago

Get notifed when new similar jobs are uploaded

About The Company

San Francisco, California, United States (On-Site)

San Francisco, California, United States (On-Site)

San Francisco, California, United States (On-Site)

San Francisco, California, United States (On-Site)

San Francisco, California, United States (On-Site)

San Francisco, California, United States (On-Site)

San Francisco, California, United States (On-Site)

San Francisco, California, United States (On-Site)

View All Jobs

Get notified when new jobs are added by krea.ai

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug