Machine Learning Engineer (CUDA)

1 Month ago • All levels • Artificial Intelligence

Job Summary

Job Description

Hedra seeks a CUDA ML Engineer to optimize machine learning models (3DVAE and video diffusion models) for GPU performance. Responsibilities include leveraging CUDA for efficient training and inference, developing efficient algorithms and data structures for GPU computation, collaborating with research and engineering teams, staying updated on GPU technology and optimization techniques, and ensuring model efficiency across various GPU architectures. The ideal candidate will have strong C++ and CUDA programming skills, experience with deep learning frameworks (PyTorch or TensorFlow), and a deep understanding of parallel computing and GPU architecture. They must also possess excellent problem-solving and debugging skills.
Must have:
  • C++ and CUDA programming
  • Experience with PyTorch or TensorFlow
  • Deep learning model optimization
  • Understanding of parallel computing
  • GPU architecture expertise
  • Problem-solving & debugging skills
Perks:
  • Competitive compensation and equity
  • 401k
  • Healthcare (Silver PPO Medical, Vision, Dental)
  • Lunch and snacks at the office

Job Details

Hedra is a pioneering generative media company backed by top investors at Index, A16Z, and Abstract Ventures. We're building Hedra Studio, a multimodal creation platform capable of control, emotion, and creative intelligence.

At the core of Hedra Studio is our Character-3 foundation model, the first omnimodal model in production. Character-3 jointly reasons across image, text, and audio for more intelligent video generation — it’s the next evolution of AI-driven content creation.

Note: At Hedra, we’re a team of hard-working, passionate individuals seeking to fundamentally change content and build a generational company together. You should have start-up experience and be a self-starter that is driven to build impactful products that change the status quo. You must be willing to work in-person in either NYC or SF.

Overview:

We are seeking a talented CUDA ML Engineer to optimize our machine learning models for high-performance computing on GPU hardware. The ideal candidate will have expertise in CUDA programming and a deep understanding of how to leverage GPU acceleration to maximize the efficiency of our 3DVAE and video diffusion models.

Responsibilities:

  • Optimize machine learning models, specifically 3DVAE and video diffusion models, for GPU performance using CUDA, ensuring efficient training and inference.

  • Develop and implement efficient algorithms and data structures for GPU computation, addressing performance bottlenecks in video generation tasks.

  • Work closely with the research and engineering teams to understand model requirements and performance bottlenecks, facilitating collaboration.

  • Stay current with the latest advancements in GPU technology and machine learning optimization techniques.

  • Ensure that our models run efficiently on various GPU architectures, supporting scalability for large-scale training.

Qualifications:

  • Bachelor’s degree in Computer Science, Electrical Engineering, or a related field, with a focus on high-performance computing.

  • Strong programming skills in C++ and CUDA, essential for GPU optimization.

  • Experience with deep learning frameworks that support GPU acceleration, such as PyTorch or TensorFlow, crucial for model implementation.

  • Understanding of parallel computing concepts and GPU architecture, given the need to optimize for hardware constraints.

  • Familiarity with machine learning models, particularly generative models, to align optimizations with model needs.

  • Excellent problem-solving and debugging skills, necessary for addressing performance issues.

Benefits:

  • Competitive compensation and equity

  • 401k (no match)

  • Healthcare (Silver PPO Medical, Vision, Dental)

  • Lunch and snacks at the office

We encourage you to apply even if you don't fully meet all the listed requirements; we value potential and diverse perspectives, and your unique skills could be a great asset to our team.

Similar Jobs

Axon - Machine Learning Engineer II

Axon

(Remote)
7 Hours ago
ByteDance - Research Scientist, Foundation Model, Speech & Audio

ByteDance

San Jose, California, United States (On-Site)
6 Months ago
PlayStation Global - Machine Learning Engineer for Game Technology

PlayStation Global

London, England, United Kingdom (On-Site)
9 Months ago
Google - Research Scientist, Ads QUEST

Google

Los Angeles, California, United States (On-Site)
2 Days ago
Genies - Lead Machine Learning Engineer, 3D Gen AI & Graphics

Genies

San Mateo, California, United States (On-Site)
1 Month ago
Google - Group Product Manager, Google Cloud Storage

Google

Sunnyvale, California, United States (On-Site)
2 Weeks ago
NVIDIA - Solution Architect - Auto

NVIDIA

Beijing, Beijing, China (On-Site)
3 Months ago
ByteDance - Research Scientist, Vision Foundation Model

ByteDance

San Jose, California, United States (On-Site)
6 Months ago
Google - Staff Software Engineer, AI/ML

Google

Sunnyvale, California, United States (On-Site)
2 Days ago
Interface AI - Senior Vice President of Engineering

Interface AI

United States (Remote)
2 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

AI Dash - Staff AI QA Engineer

AI Dash

Bengaluru, Karnataka, India (Hybrid)
1 Month ago
ByteDance - Research Scientist, Code Generation

ByteDance

San Jose, California, United States (On-Site)
6 Months ago
Western Digital - Data Scientist

Western Digital

Prachin Buri, Thailand (On-Site)
4 Weeks ago
Google - Cloud Engineer II, AI/ML, Professional Services

Google

Mexico City, Mexico City, Mexico (On-Site)
1 Week ago
ByteDance - Architect - AML Engine

ByteDance

San Jose, California, United States (On-Site)
6 Months ago
ByteDance - Student Researcher (Doubao (Seed) - Machine Learning System) - 2025 Start (PhD)

ByteDance

San Jose, California, United States (On-Site)
6 Months ago
GameJobs - Machine Learning Security Intern

GameJobs

Los Angeles, California, United States (On-Site)
1 Day ago
Every matrix - Experienced CRM Data Scientist

Every matrix

United Kingdom (Hybrid)
6 Months ago
Dolby Laboratories - AIOps Research Scientist

Dolby Laboratories

Bengaluru, Karnataka, India (Hybrid)
6 Months ago
Meta - Research Scientist Intern, Machine Perception for Input and Interaction (PhD)

Meta

Sausalito, California, United States (On-Site)
5 Months ago

Get notifed when new similar jobs are uploaded

Jobs in San Francisco, California, United States

Instawork - GTM Strategy & Analytics Manager

Instawork

Chicago, Illinois, United States (Hybrid)
23 Hours ago
Fluence - Systems Engineer, Product Verification & Validation

Fluence

Houston, Texas, United States (Hybrid)
6 Months ago
Google - Strategic Program Delivery Lead, Content and AI

Google

Austin, Texas, United States (On-Site)
1 Week ago
Nium - Sr Software Development Engineer - Backend

Nium

San Francisco, California, United States (Hybrid)
1 Day ago
Hawk Eye Innovations - NBA Technical Operations Senior Coordinator - Officiating

Hawk Eye Innovations

Atlanta, Georgia, United States (Hybrid)
2 Weeks ago
ByteDance - Backend Software Engineer - Global E-Commerce Warehousing

ByteDance

Seattle, Washington, United States (On-Site)
6 Months ago
Daybreak Game Company LLC - Senior Manager, People & Culture

Daybreak Game Company LLC

San Diego, California, United States (Remote)
5 Months ago
The Walt Disney Company - Member Education & Development Advocate II

The Walt Disney Company

Burbank, California, United States (On-Site)
1 Month ago
Snloker AI - Applied AI Engineer (Federal)

Snloker AI

Washington, District Of Columbia, United States (On-Site)
1 Day ago
Meta - Software Engineer, Infrastructure

Meta

Austin, Texas, United States (Remote)
5 Months ago

Get notifed when new similar jobs are uploaded

Artificial Intelligence Jobs

Meta - Software Engineer, Machine Learning

Meta

United States (Remote)
2 Weeks ago
NVIDIA - Senior Software Engineer, AI Resiliency

NVIDIA

Santa Clara, California, United States (On-Site)
1 Month ago
Trend Micro - Large Language Models (LLM) Expert (VicOne_Automotive Security)

Trend Micro

Taipei City, Taiwan (On-Site)
7 Months ago
Google - Lead Group Product Manager, Developer AI, Core

Google

San Francisco, California, United States (On-Site)
2 Days ago
NVIDIA - AI Algorithms Software Engineer (RDSS Intern)

NVIDIA

Taipei City, Taiwan (On-Site)
2 Months ago
ByteDance - Student Researcher (Doubao (Seed) - Foundation Model - Speech & Audio) - 2025 Start (PhD)

ByteDance

San Jose, California, United States (On-Site)
6 Months ago
Zoox - Senior/Staff Software Engineer - Simulator

Zoox

Foster City, California, United States (Hybrid)
6 Months ago
Google - Software Engineer III, Knowledge and Information

Google

Zürich, Zurich, Switzerland (On-Site)
2 Weeks ago
Meta - Software Engineer, Machine Learning

Meta

Fremont, California, United States (Remote)
5 Months ago

Get notifed when new similar jobs are uploaded

About The Company

We are a creation lab building foundation models into products that power the next generation of human storytelling

San Francisco, California, United States (On-Site)

New York, New York, United States (On-Site)

New York, New York, United States (On-Site)

New York, New York, United States (On-Site)

New York, New York, United States (On-Site)

New York, New York, United States (On-Site)

New York, New York, United States (On-Site)

New York, New York, United States (On-Site)

New York, New York, United States (On-Site)

New York, New York, United States (On-Site)

View All Jobs

Get notified when new jobs are added by Hedra

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug