Machine Learning Engineer

6 Days ago • All levels • DevOps

Job Summary

Job Description

Hedra seeks an ML Engineer to manage and optimize their computational infrastructure for training and deploying machine learning models, particularly focusing on their 3DVAE and video diffusion models. Responsibilities include designing scalable computing solutions for large video datasets, managing cloud instances (AWS or Google Cloud), ensuring infrastructure handles resource-intensive tasks, monitoring system performance, collaborating with the team on computational needs, and facilitating seamless model deployment. The ideal candidate will have experience with high-performance computing, cloud platforms, containerization (Docker), orchestration (Kubeflow), distributed training, and scripting languages like Python or Bash. This role is crucial for supporting Hedra's machine learning efforts in video generation.
Must have:
  • Experience with cloud platforms (AWS, GCP)
  • Knowledge of Docker and Kubeflow
  • Distributed training expertise
  • Proficiency in Python/Bash
  • Scalable computing solutions design
Perks:
  • Competitive compensation and equity
  • 401k
  • Healthcare (Silver PPO Medical, Vision, Dental)
  • Lunch and snacks

Job Details

Hedra is a pioneering generative media company backed by top investors at Index, A16Z, and Abstract Ventures. We're building Hedra Studio, a multimodal creation platform capable of control, emotion, and creative intelligence.

At the core of Hedra Studio is our Character-3 foundation model, the first omnimodal model in production. Character-3 jointly reasons across image, text, and audio for more intelligent video generation — it’s the next evolution of AI-driven content creation.

Note: At Hedra, we’re a team of hard-working, passionate individuals seeking to fundamentally change content and build a generational company together. You should have start-up experience and be a self-starter that is driven to build impactful products that change the status quo. You must be willing to work in-person in either NYC or SF.

Overview:

We are looking for an ML Engineer with expertise in high-performance computing systems to manage and optimize our computational infrastructure for training and deploying our machine learning models. The ideal candidate will have experience with cloud computing platforms and tools for managing ML workloads at scale, supporting our 3DVAE and video diffusion models.

Responsibilities:

  • Design and implement scalable computing solutions for training and deploying ML models, ensuring infrastructure can handle large video datasets.

  • Manage and optimize the performance of our computing clusters or cloud instances, such as AWS or Google Cloud, to support distributed training.

  • Ensure that our infrastructure can handle the resource-intensive tasks associated with training large generative models.

  • Monitor system performance and implement improvements to maximize efficiency, using tools like Kubeflow for orchestration.

  • Collaborate with the team to understand their computational needs and provide appropriate solutions, facilitating seamless model deployment.

Qualifications:

  • Bachelor’s degree in Computer Science, Information Technology, or a related field, with a focus on system administration.

  • Experience with cloud computing platforms such as Amazon Web Services, Google Cloud, or Microsoft Azure, essential for managing large-scale ML workloads.

  • Knowledge of containerization tools like Dockerfile and orchestration tools like Kubeflow, crucial for deploying models at scale.

  • Understanding of distributed training techniques and how to scale models across multiple GPUs or machines, aligning with video generation needs.

  • Proficiency in scripting languages like Python or Bash for automation tasks, facilitating infrastructure management.

  • Strong problem-solving and communication skills, given the need to collaborate with diverse teams.

This role is vital for ensuring the computational backbone supports the company’s ML efforts, focusing on deployment and scalability.

Benefits:

  • Competitive compensation and equity

  • 401k (no match)

  • Healthcare (Silver PPO Medical, Vision, Dental)

  • Lunch and snacks at the office

We encourage you to apply even if you don't fully meet all the listed requirements; we value potential and diverse perspectives, and your unique skills could be a great asset to our team.

Similar Jobs

Kwalee - Senior IT Support Specialist

Kwalee

Royal Leamington Spa, England, United Kingdom (On-Site)
6 Days ago
Immutable - Enterprise Technology Engineer

Immutable

Sydney, New South Wales, Australia (Hybrid)
3 Months ago
Corsair - Automation Engineer

Corsair

Vietnam (On-Site)
1 Week ago
Dream Sports - SDE - 1 - DevOps

Dream Sports

Mumbai, Maharashtra, India (On-Site)
5 Months ago
Revolgy - Senior Cloud Operations Engineer

Revolgy

United Kingdom (Remote)
6 Days ago
Tencent - Senior Site Reliability Engineer

Tencent

Shanghai, Shanghai, China (On-Site)
6 Months ago
Omnissa - Staff Engineer (C++,MacOS Internals)

Omnissa

Bengaluru, Karnataka, India (Hybrid)
5 Months ago
Axinous - Staff Software Development Engineer

Axinous

Netherlands (Remote)
1 Week ago
PwC - IN_Associate_Azure Cloud Data Engineer_OneCloud _Advisory _Bangalore

PwC

Gurugram, Haryana, India (On-Site)
4 Months ago
Rackspace Technology - Data Architect

Rackspace Technology

Vietnam (Remote)
2 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

The Walt Disney Company - Sr Streaming Media Engineer

The Walt Disney Company

New York, New York, United States (On-Site)
1 Month ago
ION - IT/Cyber Security Analyst

ION

London, England, United Kingdom (On-Site)
5 Months ago
NVIDIA - Clock Design Engineer

NVIDIA

Tel Aviv-Yafo, Tel Aviv District, Israel (On-Site)
2 Months ago
ION - Backup System Engineer, Italy

ION

Italy (Hybrid)
5 Months ago
Rivos - SOC Static Timing Analysis Engineer - Full Time

Rivos

Hsinchu, Hsinchu City, Taiwan (On-Site)
5 Months ago
Next Level Business Services - JAvA Full Stack Developer

Next Level Business Services

New York, New York, United States (On-Site)
5 Months ago
DOTSOFT SA - Security Engineer

DOTSOFT SA

Greece (On-Site)
1 Week ago
Wizcorp - Game Server Programmer

Wizcorp

Tokyo, Japan (Remote)
2 Weeks ago
PlayStation Global - Site Reliability Engineer

PlayStation Global

Adelaide, South Australia, Australia (On-Site)
1 Month ago
Argus Labs - Site Reliability Engineer

Argus Labs

Calgary, Alberta, Canada (Remote)
1 Week ago

Get notifed when new similar jobs are uploaded

Jobs in San Francisco, California, United States

Actalent - Entry Level Designer

Actalent

North Wilkesboro, North Carolina, United States (On-Site)
9 Months ago
Bitwise Alchemy - Senior Software Engineer

Bitwise Alchemy

Texas, United States (Remote)
8 Months ago
AVER LLC - Senior Latent Print Examiner

AVER LLC

United States (On-Site)
5 Months ago
GoMotive - Director of Product Management, AI

GoMotive

United States (Remote)
1 Month ago
Trek - Seasonal Sales Associate - Part Time

Trek

Newport News, Virginia, United States (On-Site)
1 Month ago
Scientific Games  - Category Manager - Telecom and IT Hardware & Software

Scientific Games

Alpharetta, Georgia, United States (Hybrid)
1 Month ago
NVIDIA - Senior Design for Debug Architect and Methodology Engineer

NVIDIA

Santa Clara, California, United States (On-Site)
1 Month ago
The Walt Disney Company - Associate, TWDC Store (Part-Time)

The Walt Disney Company

New York, New York, United States (On-Site)
1 Week ago
Onward Search - Inside Sales Representative (Real Estate)

Onward Search

Greensboro, North Carolina, United States (On-Site)
4 Months ago
Onward Search - Software Engineer

Onward Search

Los Angeles, California, United States (Remote)
1 Month ago

Get notifed when new similar jobs are uploaded

DevOps Jobs

Inworld AI - Staff Platform Engineer - USA

Inworld AI

Mountain View, California, United States (On-Site)
4 Months ago
IO Interactive - Senior Build Engineer

IO Interactive

Copenhagen, Denmark (Hybrid)
1 Month ago
Playtech - DevOps Engineer

Playtech

Kyiv, Kyiv City, Ukraine (On-Site)
2 Weeks ago
Rackspace Technology - DEVOP Engineer (AWS Terraform)-PSDE III

Rackspace Technology

India (Remote)
4 Months ago
PwC - Senior Associate_Azure Data Engineer_Data & Analytics_Advisory_PAN  India

PwC

Bengaluru, Karnataka, India (On-Site)
6 Months ago
Sandsoft Games - DevOps & Automation Engineer

Sandsoft Games

Riyadh, Riyadh Province, Saudi Arabia (Hybrid)
6 Days ago
Hitachi - Kubernetes Engineer

Hitachi

Pune, Maharashtra, India (On-Site)
5 Months ago
SmileGate - Platform Engineering Lead

SmileGate

Seongnam-si, Gyeonggi-do, South Korea (On-Site)
1 Month ago
NVIDIA - Senior Software Engineer – AI Infrastructure and Tooling

NVIDIA

California, United States (Remote)
4 Days ago

Get notifed when new similar jobs are uploaded

About The Company

We are a creation lab building foundation models into products that power the next generation of human storytelling

New York, New York, United States (On-Site)

New York, New York, United States (On-Site)

New York, New York, United States (On-Site)

New York, New York, United States (On-Site)

New York, New York, United States (On-Site)

New York, New York, United States (On-Site)

New York, New York, United States (On-Site)

New York, New York, United States (On-Site)

New York, New York, United States (On-Site)

New York, New York, United States (On-Site)

View All Jobs

Get notified when new jobs are added by Hedra

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug