Outscal Logooutscal logo

Machine Learning Engineer

12 Hours ago • All levels • DevOps

Job Summary

Job Description

Hedra seeks an ML Engineer to manage and optimize their computational infrastructure for training and deploying machine learning models, particularly focusing on their 3DVAE and video diffusion models. Responsibilities include designing scalable computing solutions for large video datasets, managing cloud instances (AWS or Google Cloud), ensuring infrastructure handles resource-intensive tasks, monitoring system performance, collaborating with the team on computational needs, and facilitating seamless model deployment. The ideal candidate will have experience with high-performance computing, cloud platforms, containerization (Docker), orchestration (Kubeflow), distributed training, and scripting languages like Python or Bash. This role is crucial for supporting Hedra's machine learning efforts in video generation.
Must have:
  • Experience with cloud platforms (AWS, GCP)
  • Knowledge of Docker and Kubeflow
  • Distributed training expertise
  • Proficiency in Python/Bash
  • Scalable computing solutions design
Perks:
  • Competitive compensation and equity
  • 401k
  • Healthcare (Silver PPO Medical, Vision, Dental)
  • Lunch and snacks

Job Details

Hedra is a pioneering generative media company backed by top investors at Index, A16Z, and Abstract Ventures. We're building Hedra Studio, a multimodal creation platform capable of control, emotion, and creative intelligence.

At the core of Hedra Studio is our Character-3 foundation model, the first omnimodal model in production. Character-3 jointly reasons across image, text, and audio for more intelligent video generation — it’s the next evolution of AI-driven content creation.

Note: At Hedra, we’re a team of hard-working, passionate individuals seeking to fundamentally change content and build a generational company together. You should have start-up experience and be a self-starter that is driven to build impactful products that change the status quo. You must be willing to work in-person in either NYC or SF.

Overview:

We are looking for an ML Engineer with expertise in high-performance computing systems to manage and optimize our computational infrastructure for training and deploying our machine learning models. The ideal candidate will have experience with cloud computing platforms and tools for managing ML workloads at scale, supporting our 3DVAE and video diffusion models.

Responsibilities:

  • Design and implement scalable computing solutions for training and deploying ML models, ensuring infrastructure can handle large video datasets.

  • Manage and optimize the performance of our computing clusters or cloud instances, such as AWS or Google Cloud, to support distributed training.

  • Ensure that our infrastructure can handle the resource-intensive tasks associated with training large generative models.

  • Monitor system performance and implement improvements to maximize efficiency, using tools like Kubeflow for orchestration.

  • Collaborate with the team to understand their computational needs and provide appropriate solutions, facilitating seamless model deployment.

Qualifications:

  • Bachelor’s degree in Computer Science, Information Technology, or a related field, with a focus on system administration.

  • Experience with cloud computing platforms such as Amazon Web Services, Google Cloud, or Microsoft Azure, essential for managing large-scale ML workloads.

  • Knowledge of containerization tools like Dockerfile and orchestration tools like Kubeflow, crucial for deploying models at scale.

  • Understanding of distributed training techniques and how to scale models across multiple GPUs or machines, aligning with video generation needs.

  • Proficiency in scripting languages like Python or Bash for automation tasks, facilitating infrastructure management.

  • Strong problem-solving and communication skills, given the need to collaborate with diverse teams.

This role is vital for ensuring the computational backbone supports the company’s ML efforts, focusing on deployment and scalability.

Benefits:

  • Competitive compensation and equity

  • 401k (no match)

  • Healthcare (Silver PPO Medical, Vision, Dental)

  • Lunch and snacks at the office

We encourage you to apply even if you don't fully meet all the listed requirements; we value potential and diverse perspectives, and your unique skills could be a great asset to our team.

Similar Jobs

Warner Bros Games - Software Engineer II - DevOps

Warner Bros Games

Bengaluru, Karnataka, India (Hybrid)
3 Weeks ago
Trailmix Games - Senior DevOps Engineer

Trailmix Games

London, England, United Kingdom (Hybrid)
1 Week ago
NVIDIA - Senior Server Firmware Bringup Engineer

NVIDIA

Canada (On-Site)
2 Days ago
NVIDIA - Senior HPC AI Cluster Engineer

NVIDIA

Tel Aviv-Yafo, Tel Aviv District, Israel (On-Site)
1 Month ago
Dream Sports - Director System IT

Dream Sports

Mumbai, Maharashtra, India (On-Site)
3 Months ago
Ajmera Infotech - DevOps Engineer

Ajmera Infotech

San Jose, California, United States (On-Site)
6 Months ago
Nagarro - Senior Engineer, DevOps

Nagarro

India (Remote)
5 Months ago
Glean - Solutions Engineer - East

Glean

(Remote)
4 Months ago
ECI - Cloud Services Engineer

ECI

Indore, Madhya Pradesh, India (On-Site)
4 Months ago
ByteDance - Software Engineer, SRE - Platform Services

ByteDance

Seattle, Washington, United States (On-Site)
1 Month ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Scopely - Lead DevOps/SRE - Unannounced Project

Scopely

Dublin, County Dublin, Ireland (Hybrid)
2 Months ago
ByteDance - Site Reliability Engineer – Data Infrastructure

ByteDance

Seattle, Washington, United States (On-Site)
19 Hours ago
N-iX - Middle GCP DevOps Engineer

N-iX

Ukraine (Remote)
1 Day ago
The Walt Disney Company - Senior Systems Administrator (Overnight Shift)

The Walt Disney Company

Las Vegas, Nevada, United States (On-Site)
1 Month ago
NVIDIA - Senior Technical Instructor - AI and Data Center Infrastructure

NVIDIA

United Kingdom (Remote)
2 Weeks ago
NVIDIA - Senior Site Reliability Engineer - AI Research Clusters

NVIDIA

Santa Clara, California, United States (Hybrid)
2 Months ago
ByteDance - DevOps Engineer, Applied Machine Learning Engine - 2025 Start

ByteDance

Singapore (On-Site)
4 Months ago
Jagex - IT EUC Lead - 12 Month FTC

Jagex

Cambridge, England, United Kingdom (Hybrid)
1 Week ago
Actian - Software Developer DBMS QA - Bangalore

Actian

Bengaluru, Karnataka, India (On-Site)
5 Months ago

Get notifed when new similar jobs are uploaded

Jobs in San Francisco, California, United States

Onward Search - Office Manager

Onward Search

Austin, Texas, United States (Hybrid)
1 Day ago
Bonfire Studios - Senior Producer (Audio / Narrative / Localization)

Bonfire Studios

California, United States (Hybrid)
1 Week ago
Evolution - iGaming Presenter (Server Alternative) $20-$25/hr.

Evolution

Atlantic City, New Jersey, United States (On-Site)
10 Months ago
Doola - Pitch Us on a Role (Remote)

Doola

New York, New York, United States (Remote)
5 Months ago
Zoox - Principal Machine Learning Engineer

Zoox

Foster City, California, United States (On-Site)
4 Months ago
Next Level Business Services - Java Developer (Full Time)

Next Level Business Services

Littleton, Colorado, United States (On-Site)
5 Months ago
NVIDIA - Senior Product Manager, NVCF - Capacity Management and Marketplaces

NVIDIA

California, United States (Remote)
2 Months ago
Zoox - Partnerships Manager

Zoox

Foster City, California, United States (Hybrid)
5 Months ago
Genies - Senior Engineer, Core Systems

Genies

Los Angeles, California, United States (On-Site)
6 Days ago
Next Level Business Services - Cloud Architect

Next Level Business Services

Jersey City, New Jersey, United States (On-Site)
5 Months ago

Get notifed when new similar jobs are uploaded

DevOps Jobs

PlayerUnknown Productions - IT Manager (Part-Time)

PlayerUnknown Productions

Amsterdam, North Holland, Netherlands (Hybrid)
5 Months ago
ByteDance - Software Engineer - Serverless Compute Infrastructure

ByteDance

San Jose, California, United States (On-Site)
1 Month ago
Wargaming - DevOps Engineer

Wargaming

Belgrade, Serbia (On-Site)
5 Days ago
Zazz - Data Engineer

Zazz

(Remote)
2 Months ago
AGS - American Gaming Systems - Lead DevSecOps Engineer

AGS - American Gaming Systems

Georgia (On-Site)
6 Days ago
Tesla - Sr. Software Developer (PowerShell)

Tesla

North Holland, Netherlands (On-Site)
1 Month ago
GoReel - DevOps Lead

GoReel

Bratislava Region, Slovakia (Remote)
1 Week ago
Scanline VFX - Senior DevOps Engineer

Scanline VFX

Montreal, Quebec, Canada (Hybrid)
1 Month ago
Salesforce - Principal, Technical Architect

Salesforce

(Remote)
2 Weeks ago
Playrix - Senior Release Automation Engineer (Gardenscapes)

Playrix

Ireland (Remote)
1 Month ago

Get notifed when new similar jobs are uploaded

About The Company

We are a creation lab building foundation models into products that power the next generation of human storytelling

San Francisco, California, United States (On-Site)

San Francisco, California, United States (On-Site)

San Francisco, California, United States (On-Site)

San Francisco, California, United States (On-Site)

San Francisco, California, United States (On-Site)

San Francisco, California, United States (On-Site)

California, United States (On-Site)

San Francisco, California, United States (On-Site)

San Francisco, California, United States (On-Site)

San Francisco, California, United States (On-Site)

View All Jobs

Get notified when new jobs are added by Hedra

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug