AI/ML Inference Engineer

3 Months ago • All levels • Research Development

Job Summary

Job Description

Krea is seeking a Machine Learning Engineer to optimize AI model inference and training. The role involves collaborating with AI research and infrastructure teams for seamless integration of optimizations. Responsibilities include writing custom CUDA Kernels for multi-node inference acceleration on image and video models, implementing caching and dynamic compilation techniques to optimize AI model loading/unloading, and improving the speed and efficiency of training runs across GPU clusters. The ideal candidate will have proficiency in CUDA or parallel programming, Python/C++ experience, and expertise in optimizing diffusion/transformer models.
Must have:
  • Proficiency in CUDA or parallel programming
  • Python/C++ programming experience
  • Experience in optimizing diffusion/transformer models
  • High agency and resourcefulness
Perks:
  • Sponsorship for international candidates (STEM OPT, OPT, H1B, O1, E3)
  • Work with world-class AI developers
  • Significant impact on market presence and growth
  • Competitive compensation (75th percentile of market rates) with significant equity upside

Job Details

About Krea:

At Krea, we're dedicated to making AI intuitive and controllable for creatives. Our mission is to build tools that empower human creativity, not replace it. We believe AI is a new medium that allows us to express ourselves through various formats—text, images, video, sound, and even 3D. We're building better, smarter, and more controllable tools to harness this medium.

We’re backed by Bain Capital Ventures, A16Z, Abstract Ventures, Pebblebed and many others. If you're passionate about pushing the boundaries of AI and empowering human creativity, we'd love to hear from you.

We're looking for a Machine Learning Engineer to help us optimize the inference and training of our AI models.​ You will collaborate closely with our AI research and infrastructure teams to integrate optimizations seamlessly.​

Our culture:

  • We work full-time and in-person at our waterfront office in San Francisco.

  • We believe that demonstrated interest in the creative space is key: our team includes musicians, designers, visual artists and more.

What you'll do:

  • Write custom CUDA Kernels to speed up multi-node inference on image and video models.

  • Work on various caching and dynamic compilation techniques to optimize the loading and unloading of the variety of AI models we serve at Krea.

  • Speed up and efficiency of training runs across our GPU clusters.​

We'd like you to have:

  • Proficiency in CUDA or parallel programming.

  • Python/C++ programming experience.

  • Experience in optimizing diffusion/transformer models for performance and scalability.​

  • High agency and resourcefulness.

What we offer:

  • Openness to sponsoring International candidates (e.g STEM OPT, OPT, H1B, O1, E3)

  • Work alongside a world class developing the future of AI tooling

  • Significant impact on Krea’s market presence and growth

  • Competitive compensation (75% percentile of market rates) with significant equity upside

Similar Jobs

Test Tropic - Quality Assurance Manager

Test Tropic

Barbados (On-Site)
1 Year ago
Electronic Arts - Frostbite Architecture: Software Engineer

Electronic Arts

Stockholm, Stockholm County, Sweden (Hybrid)
2 Months ago
Epic Games - Senior Engine Programmer, Fortnite Tech

Epic Games

Vancouver, British Columbia, Canada (On-Site)
4 Months ago
Ansys - Senior R&D Engineer

Ansys

Austin, Texas, United States (On-Site)
1 Month ago
Windranger - Protocol Engineer

Windranger

Central Sulawesi, Indonesia (Remote)
4 Months ago
PayPal - Senior Staff Machine Learning Engineer, AI

PayPal

Bengaluru, Karnataka, India (On-Site)
1 Month ago
Apple - Machine Learning Manager - Apple Ads

Apple

Cupertino, California, United States (On-Site)
2 Months ago
Riot Games - Senior Researcher, Wild Rift

Riot Games

Shanghai, Shanghai, China (On-Site)
4 Months ago
Ansys - Senior R&D Engineer - Meshing/Geometry

Ansys

Otterfing, Bavaria, Germany (Hybrid)
3 Months ago
attentive - Staff Machine Learning Engineer

attentive

San Francisco, California, United States (Hybrid)
10 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Activision - Rigger

Activision

Shanghai, China (On-Site)
2 Months ago
Rockstar Games - Graphics Programmer

Rockstar Games

Oakville, Ontario, Canada (On-Site)
4 Months ago
Luxoft - Regular Android HMI Architect

Luxoft

Cairo, Cairo Governorate, Egypt (On-Site)
8 Months ago
Epic Games - Senior Engineer, Patching

Epic Games

Cary, North Carolina, United States (On-Site)
7 Months ago
Eqvilent - FPGA Engineer

Eqvilent

(Remote)
3 Months ago
playrix  - Technical Director (Game Project)

playrix

Cyprus (Remote)
9 Months ago
endava - Solution Architect - Payments

endava

Sydney, New South Wales, Australia (On-Site)
1 Month ago
Mozilla - Senior Software Engineer, Mozilla VPN

Mozilla

France (Remote)
3 Months ago
NXP - 2025 Intern - Product/Test Development Engineer

NXP

Tianjin, Tianjin, China (On-Site)
1 Year ago
Toast - Staff Software Engineer, Android OS

Toast

United States (Remote)
6 Months ago

Get notifed when new similar jobs are uploaded

Jobs in San Francisco, California, United States

Jane Street - Recruiting Coordinator

Jane Street

New York, United States (On-Site)
3 Months ago
Fox Factory - Project Specialist

Fox Factory

Trussville, Alabama, United States (On-Site)
1 Month ago
EMA - Account Executive - Healthcare

EMA

United States (Hybrid)
2 Months ago
Trek - Sales Associate - Part Time

Trek

Fairfield, Connecticut, United States (On-Site)
6 Months ago
Patreon - Business Operations and Strategy Lead

Patreon

New York, United States (Hybrid)
2 Months ago
HappyRobot - Forward Deployed Engineer

HappyRobot

Chicago, Illinois, United States (Hybrid)
2 Months ago
Ion - Senior Associate

Ion

New York, United States (On-Site)
4 Months ago
Shield AI - Prototype & New Product Launch Manager

Shield AI

Dallas, Texas, United States (On-Site)
3 Weeks ago
Open Systems Technologies - Restaurant Manager

Open Systems Technologies

Hurst, Texas, United States (On-Site)
1 Month ago
Internet Brands - Digital Marketing Solutions Consultant

Internet Brands

United States (On-Site)
1 Month ago

Get notifed when new similar jobs are uploaded

Research Development Jobs

Canva - Senior Machine Learning Engineer, Content Management & Distribution, Teams & Education

Canva

Auckland, Auckland, New Zealand (Remote)
1 Month ago
BioFire - Disease State Scientist

BioFire

Durham, North Carolina, United States (On-Site)
2 Months ago
Match Group - Senior Machine Learning Engineer, Dating Outcomes

Match Group

New York, New York, United States (Hybrid)
3 Months ago
DevRev - Applied AI Engineer

DevRev

Chennai, Tamil Nadu, India (On-Site)
2 Months ago
ElevenLabs - GTM AI Automations Engineer

ElevenLabs

United Kingdom (Remote)
3 Months ago
cirrus logic - Mixed-Signal CAD/Design Engineer – AI-Driven EDA CAD Development

cirrus logic

Austin, Texas, United States (Hybrid)
2 Months ago
Ansys - Senior AI R&D Engineer

Ansys

Montigny-le-Bretonneux, Île-de-France, France (Remote)
1 Month ago
Apple - Machine Learning Engineer - Matching

Apple

Cupertino, California, United States (On-Site)
1 Month ago
Apple - Senior AI Application Engineer

Apple

Cupertino, California, United States (On-Site)
2 Months ago

Get notifed when new similar jobs are uploaded

About The Company

San Francisco, California, United States (On-Site)

San Francisco, California, United States (On-Site)

San Francisco, California, United States (On-Site)

San Francisco, California, United States (On-Site)

San Francisco, California, United States (On-Site)

San Francisco, California, United States (On-Site)

San Francisco, California, United States (On-Site)

San Francisco, California, United States (On-Site)

View All Jobs

Get notified when new jobs are added by krea.ai

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug