Machine Learning Engineer (CUDA)

2 Months ago • All levels • Artificial Intelligence

Job Summary

Job Description

Hedra seeks a CUDA ML Engineer to optimize machine learning models (3DVAE and video diffusion models) for GPU performance. Responsibilities include leveraging CUDA for efficient training and inference, developing efficient algorithms and data structures for GPU computation, collaborating with research and engineering teams, staying updated on GPU technology and optimization techniques, and ensuring model efficiency across various GPU architectures. The ideal candidate will have strong C++ and CUDA programming skills, experience with deep learning frameworks (PyTorch or TensorFlow), and a deep understanding of parallel computing and GPU architecture. They must also possess excellent problem-solving and debugging skills.
Must have:
  • C++ and CUDA programming
  • Experience with PyTorch or TensorFlow
  • Deep learning model optimization
  • Understanding of parallel computing
  • GPU architecture expertise
  • Problem-solving & debugging skills
Perks:
  • Competitive compensation and equity
  • 401k
  • Healthcare (Silver PPO Medical, Vision, Dental)
  • Lunch and snacks at the office

Job Details

Hedra is a pioneering generative media company backed by top investors at Index, A16Z, and Abstract Ventures. We're building Hedra Studio, a multimodal creation platform capable of control, emotion, and creative intelligence.

At the core of Hedra Studio is our Character-3 foundation model, the first omnimodal model in production. Character-3 jointly reasons across image, text, and audio for more intelligent video generation — it’s the next evolution of AI-driven content creation.

Note: At Hedra, we’re a team of hard-working, passionate individuals seeking to fundamentally change content and build a generational company together. You should have start-up experience and be a self-starter that is driven to build impactful products that change the status quo. You must be willing to work in-person in either NYC or SF.

Overview:

We are seeking a talented CUDA ML Engineer to optimize our machine learning models for high-performance computing on GPU hardware. The ideal candidate will have expertise in CUDA programming and a deep understanding of how to leverage GPU acceleration to maximize the efficiency of our 3DVAE and video diffusion models.

Responsibilities:

  • Optimize machine learning models, specifically 3DVAE and video diffusion models, for GPU performance using CUDA, ensuring efficient training and inference.

  • Develop and implement efficient algorithms and data structures for GPU computation, addressing performance bottlenecks in video generation tasks.

  • Work closely with the research and engineering teams to understand model requirements and performance bottlenecks, facilitating collaboration.

  • Stay current with the latest advancements in GPU technology and machine learning optimization techniques.

  • Ensure that our models run efficiently on various GPU architectures, supporting scalability for large-scale training.

Qualifications:

  • Bachelor’s degree in Computer Science, Electrical Engineering, or a related field, with a focus on high-performance computing.

  • Strong programming skills in C++ and CUDA, essential for GPU optimization.

  • Experience with deep learning frameworks that support GPU acceleration, such as PyTorch or TensorFlow, crucial for model implementation.

  • Understanding of parallel computing concepts and GPU architecture, given the need to optimize for hardware constraints.

  • Familiarity with machine learning models, particularly generative models, to align optimizations with model needs.

  • Excellent problem-solving and debugging skills, necessary for addressing performance issues.

Benefits:

  • Competitive compensation and equity

  • 401k (no match)

  • Healthcare (Silver PPO Medical, Vision, Dental)

  • Lunch and snacks at the office

We encourage you to apply even if you don't fully meet all the listed requirements; we value potential and diverse perspectives, and your unique skills could be a great asset to our team.

Similar Jobs

bytedance - Research Scientist in Machine Learning for Science (AML - AI-for-Science) - 2024 Start (PhD)

bytedance

Seattle, Washington, United States (On-Site)
7 Months ago
Stylumia - Senior Machine Learning Engineer - Time Series & Computer Vision

Stylumia

Bengaluru, Karnataka, India (Hybrid)
9 Months ago
Illuminia - ML/Software Engineer 1 - MLOps

Illuminia

Singapore, Singapore (On-Site)
3 Weeks ago
ManyChat - Lead Machine Learning Scientist

ManyChat

Barcelona, Catalonia, Spain (Hybrid)
4 Days ago
Reddit - Staff Software Engineer, ML Understanding

Reddit

United Kingdom (Remote)
2 Weeks ago
Soul AI - Subject Matter Expert (AI Trainer)

Soul AI

Hyderabad, Telangana, India (On-Site)
8 Months ago
CharacterAI - Research Engineer, Post-Training

CharacterAI

New York, New York, United States (On-Site)
2 Months ago
Bragg - AI/ML Engineer

Bragg

Ljubljana, Ljubljana, Slovenia (Hybrid)
1 Month ago
Ion - AI Engineer - Graduate Development Program

Ion

Pisa, Tuscany, Italy (On-Site)
7 Months ago
bytedance - Student Researcher (Doubao (Seed) Foundation Model - Video Generation) - 2025 Start (PhD)

bytedance

Seattle, Washington, United States (On-Site)
7 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Zurora - AI/ML Engineer

Zurora

Chennai, Tamil Nadu, India (Hybrid)
3 Weeks ago
Motorola solutions - Senior Software Engineer - AI/Computer Vision (Camera Systems)

Motorola solutions

Toronto, Ontario, Canada (Hybrid)
2 Weeks ago
Cubic corporation - Innovation Intern

Cubic corporation

London, England, United Kingdom (On-Site)
1 Week ago
Trendyol - Data Science Professionals - Trendyol GO

Trendyol

İzmir, İzmir, Türkiye (Hybrid)
6 Months ago
FlawlessAi - Research Scientist Internship - Face Perecption

FlawlessAi

Santa Monica, California, United States (Hybrid)
1 Month ago
WebFX - Entry Level Software Engineer

WebFX

Harrisburg, Pennsylvania, United States (On-Site)
7 Months ago
flip fit - Senior Machine Learning Engineer - Machine Learning Infrastructure

flip fit

New York, United States (Remote)
1 Month ago
Krafton - Lead of Physical AI Agent, Research Scientist

Krafton

Seoul, South Korea (On-Site)
1 Month ago
Whatnot - Machine Learning Engineer

Whatnot

Los Angeles, California, United States (On-Site)
1 Week ago

Get notifed when new similar jobs are uploaded

Jobs in San Francisco, California, United States

Snloker AI - Data Annotator - STEM

Snloker AI

United States (Remote)
1 Month ago
UPF Industries  - Quality Control

UPF Industries

White Pigeon, Michigan, United States (On-Site)
3 Weeks ago
Dungarvin - Employment Support Specialist

Dungarvin

Apple Valley, Minnesota, United States (Hybrid)
2 Weeks ago
design works gaming - QA Technician - Game Tester

design works gaming

Scottsdale, Arizona, United States (Hybrid)
1 Month ago
Rockstar Games - Senior Data Engineer

Rockstar Games

New York, United States (On-Site)
2 Weeks ago
Betson Group - Customer Service & Protection Agent (NL/EN)

Betson Group

Malta, New York, United States (Hybrid)
1 Month ago
Palo Alto Networks - GTM Sales Finance Manager

Palo Alto Networks

Santa Clara, California, United States (On-Site)
4 Days ago
Crunchyroll - Senior Risk Analyst

Crunchyroll

Dallas, Texas, United States (Hybrid)
3 Weeks ago
Pattern - Financial Analyst - Pricing

Pattern

Lehi, Utah, United States (Hybrid)
4 Days ago
Notion - UX Writer

Notion

San Francisco, California, United States (On-Site)
2 Weeks ago

Get notifed when new similar jobs are uploaded

Artificial Intelligence Jobs

Keywords Studios - Senior Research Associate - AI

Keywords Studios

California, United States (Remote)
2 Months ago
Microsoft - Member of Technical Staff, AI - Reinforcement Systems

Microsoft

London, England, United Kingdom (On-Site)
2 Months ago
NVIDIA - Senior AI-HPC Storage Engineer

NVIDIA

Austin, Texas, United States (On-Site)
3 Months ago
NVIDIA - AI Algorithms Software Engineer (RDSS Intern)

NVIDIA

Taipei City, Taiwan (On-Site)
3 Months ago
NVIDIA - Senior AI-HPC Cluster Engineer

NVIDIA

Santa Clara, California, United States (Hybrid)
2 Months ago
Resemble AI - Deep Learning Speech Researcher

Resemble AI

Mountain View, California, United States (On-Site)
9 Months ago
Microsoft - Technical Support Engineer (Data and AI Intelligent Platform)

Microsoft

Selangor, Malaysia (Hybrid)
1 Month ago
NetEase Games - Game AI Research Leader

NetEase Games

Singapore (On-Site)
2 Months ago
Google - Senior Imaging and On-Device Machine Learning Software Engineer, Silicon

Google

New Taipei, New Taipei City, Taiwan (On-Site)
1 Month ago
NVIDIA - Senior Applied LLM Engineer, AI – Chip Design

NVIDIA

Canada (On-Site)
3 Months ago

Get notifed when new similar jobs are uploaded

About The Company

Hedra is a pioneering generative media company backed by top investors at Index, A16Z, and Abstract Ventures. We're building Hedra Studio, a multimodal creation platform capable of control, emotion, and creative intelligence.At the core of Hedra Studio is ourCharacter-3foundation model, the first omnimodal model in production. Character-3 jointly reasons across image, text, and audio for more intelligent video generation — it’s the next evolution of AI-driven content creation.

San Francisco, California, United States (On-Site)

New York, New York, United States (On-Site)

New York, New York, United States (On-Site)

New York, New York, United States (On-Site)

New York, New York, United States (On-Site)

New York, New York, United States (On-Site)

New York, New York, United States (On-Site)

New York, New York, United States (On-Site)

New York, New York, United States (On-Site)

New York, New York, United States (On-Site)

View All Jobs

Get notified when new jobs are added by Hedra

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug