Software Engineer, Multimedia

1 Month ago • 3 Years + • Software Development & Engineering • $170,000 PA - $240,000 PA

Job Summary

Job Description

Fireworks AI is building the future of generative AI infrastructure, offering the fastest and most scalable inference platform. Funded by top investors, the team comprises veterans from PyTorch and Google Vertex AI. This role seeks a Backend Infrastructure Engineer to accelerate multimedia AI capabilities, focusing on building and optimizing infrastructure for state-of-the-art multimodal AI, including VLMs and speech AI models. The goal is to achieve industry-leading latency and throughput, develop features like VLM fine-tuning, and enable models on the latest hardware, driving ARR growth in the multimedia AI space.
Must have:
  • Collaborate with ML engineers and researchers to productionize models and support evolving multimedia capabilities
  • Identify, profile and address performance bottlenecks across the stack, from media preprocessing to vision/audio encoders to the core inference engine
  • Ensure high reliability, observability, and security across backend systems.
  • Own the enablement and optimization of new model releases, ensuring we consistently deliver the fastest implementations in the market.
  • Build and maintain performant APIs and services
  • Collaborate closely with customers and sales teams to implement custom features and optimizations that drive ARR growth
  • Propose new roadmap items based on customer needs.
Good to have:
  • Experience supporting ML workloads in production (model fine-tuning, distributed training, inference optimization)
  • Experience working directly with LLMs, vision-language models, audio models (ASR, TTS) or other multimodal AI systems in production environments
  • Experience with performance optimization and profiling for high-throughput systems
  • Knowledge of model quantization, speculative decoding, or other ML optimization techniques
Perks:
  • Meaningful equity in a fast-growing startup
  • Competitive salary
  • Comprehensive benefits package

Job Details

About Us:

Here at Fireworks, we’re building the future of generative AI infrastructure. Fireworks offers the generative AI platform with the highest-quality models and the fastest, most scalable inference. We’ve been independently benchmarked to have the fastest LLM inference and have been getting great traction with innovative research projects, like our own function calling and multi-modal models. Fireworks is funded by top investors, like Benchmark and Sequoia, and we’re an ambitious, fun team composed primarily of veterans from Pytorch and Google Vertex AI.

The Role:

We're looking for a strong Backend Infrastructure Engineer to help accelerate our multimedia AI capabilities. You'll build and optimize the infrastructure powering state-of-the-art multimodal AI including vision-language models (VLMs), and speech AI models. You'll focus on achieving industry-leading latency and throughput across diverse multimedia workloads. You'll develop infrastructure for features like VLM fine-tuning, real-time voice processing pipelines, and model enablement on the latest hardware. You'll be instrumental in helping us capture significant ARR growth in the multimedia AI space while ensuring we deliver the fastest, most reliable multimodal platform in the market.

Key Responsibilities:

  • Collaborate with ML engineers and researchers to productionize models and support evolving multimedia capabilities
  • Identify, profile and address performance bottlenecks across the stack, from media preprocessing to vision/audio encoders to the core inference engine
  • Ensure high reliability, observability, and security across backend systems.
  • Own the enablement and optimization of new model releases, ensuring we consistently deliver the fastest implementations in the market.
  • Build and maintain performant APIs and services
  • Collaborate closely with customers and sales teams to implement custom features and optimizations that drive ARR growth
  • Propose new roadmap items based on customer needs.

Minimum Qualifications:

  • Bachelor’s degree in Computer Science, Engineering, or a related field.
  • 3+ years of experience as a backend or infrastructure engineer, ideally supporting ML/AI systems or data-intensive workloads.
  • Experience with PyTorch and deep learning frameworks for inference and training.
  • Strong programming skills in Python and/or Go, with a track record of building reliable distributed backend systems.
  • Experience with cloud platforms (e.g., AWS, GCP), infrastructure-as-code tools (e.g., Terraform), and containerization/orchestration tools (e.g., Docker, Kubernetes).

Preferred Qualifications:

  • Experience supporting ML workloads in production (model fine-tuning, distributed training, inference optimization)
  • Experience working directly with LLMs, vision-language models, audio models (ASR, TTS) or other multimodal AI systems in production environments
  • Experience with performance optimization and profiling for high-throughput systems
  • Knowledge of model quantization, speculative decoding, or other ML optimization techniques

Total compensation for this role also includes meaningful equity in a fast-growing startup, along with a competitive salary and comprehensive benefits package. Base salary is determined by a range of factors including individual qualifications, experience, skills, interview performance, market data, and work location. The listed salary range is intended as a guideline and may be adjusted.

Base Pay Range (Plus Equity)

$170,000 - $240,000 USD

Why Fireworks AI?

  • Solve Hard Problems: Tackle challenges at the forefront of AI infrastructure, from low-latency inference to scalable model serving.
  • Build What’s Next: Work with bleeding-edge technology that impacts how businesses and developers harness AI globally.
  • Ownership & Impact: Join a fast-growing, passionate team where your work directly shapes the future of AI—no bureaucracy, just results.
  • Learn from the Best: Collaborate with world-class engineers and AI researchers who thrive on curiosity and innovation.

Fireworks AI is an equal-opportunity employer. We celebrate diversity and are committed to creating an inclusive environment for all innovators.

Similar Jobs

Synthesia - Sales Operations Director

Synthesia

New York, United States (Hybrid)
1 Month ago
Publicis Groupe - Dynamic Creative Lead - Motion & Modularity

Publicis Groupe

Budapest, Hungary (Hybrid)
1 Month ago
Aptive - Senior VAT Specialist

Aptive

Kraków, Lesser Poland Voivodeship, Poland (Hybrid)
1 Month ago
Evoplay games - International Contract Lawyer

Evoplay games

Limassol, Limassol, Cyprus (On-Site)
1 Month ago
PhonePe - Senior Manager, Business Finance - Lending

PhonePe

Bengaluru, Karnataka, India (On-Site)
3 Months ago
PwC - SAP S/4HANA Finance Consultant | Senior Manager | Technology Consulting | Advisory

PwC

Dublin, County Dublin, Ireland (On-Site)
2 Months ago
WaveApps - Manager, Engineering

WaveApps

Canada (Remote)
1 Month ago
Alphawave Semi - Senior Engineer I - DFT

Alphawave Semi

Pune, Maharashtra, India (On-Site)
1 Month ago
ChainGuard - Senior Software Engineer (Sustaining)

ChainGuard

United States (Remote)
1 Month ago
Assystems - Senior Design Engineer

Assystems

Derby, England, United Kingdom (On-Site)
10 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Brillio - Lead Data Engineer - R01553054

Brillio

Bengaluru, Karnataka, India (Hybrid)
1 Month ago
Social Discovery Ventures - Head of Analytics

Social Discovery Ventures

Spain (Remote)
1 Month ago
Side - Engine Programmer - Talent Pool

Side

United States (Remote)
1 Month ago
Moloco - Staff Machine Learning Engineer

Moloco

Seoul, South Korea (On-Site)
3 Months ago
deel. - Payroll Consultant

deel.

United Kingdom (Remote)
1 Month ago
Qualcomm - Staff GPU Compiler Engineer

Qualcomm

San Diego, California, United States (On-Site)
2 Months ago
Reddit - Machine Learning Manager - Ads Engagement Modeling

Reddit

Canada (Remote)
3 Months ago
Epic Games - Senior DevOps Programmer

Epic Games

United States (On-Site)
6 Months ago
Crowd Strick - Manager - Data Engineering

Crowd Strick

India (Remote)
1 Month ago
Gamezop - Content Writer - Criczop

Gamezop

India (Remote)
1 Year ago

Get notifed when new similar jobs are uploaded

Jobs in Redwood City, California, United States

C3 IoT - Senior Employment and Regulatory Counsel

C3 IoT

Redwood City, California, United States (On-Site)
1 Month ago
Whatnot - Backend Engineer, Logistics

Whatnot

Los Angeles, California, United States (On-Site)
8 Months ago
Sony Pictures Entertainment - Coordinator, Finance and Administration

Sony Pictures Entertainment

Miami, Florida, United States (On-Site)
2 Months ago
WebMD - Sr. Product Manager, Payer Solutions

WebMD

Boise, Idaho, United States (On-Site)
1 Month ago
Electronic Arts - UA Specialist

Electronic Arts

Los Angeles, California, United States (Hybrid)
2 Months ago
pentair - Assembly Operator II

pentair

Minnesota, United States (On-Site)
1 Month ago
Coupa - Revenue Operations Analyst, Demand

Coupa

United States (Remote)
3 Months ago
Shipt - Security Engineer

Shipt

Birmingham, Alabama, United States (Hybrid)
1 Month ago
Pinterest - CPG Strategy Lead

Pinterest

New York, United States (Hybrid)
1 Month ago
WebTech Corporation - Buyer

WebTech Corporation

Plattsburgh, New York, United States (On-Site)
3 Months ago

Get notifed when new similar jobs are uploaded

Software Development & Engineering Jobs

Enphase Energy - Senior Staff / Staff Engineer Mechanical DVT

Enphase Energy

Bengaluru, Karnataka, India (On-Site)
6 Months ago
Passive Logic - Weather Simulation Engineer

Passive Logic

Salt Lake City, Utah, United States (On-Site)
8 Months ago
broadcom - Design Engineer Architect/Lead

broadcom

Fort Collins, Colorado, United States (On-Site)
2 Months ago
BioFire - SAP Technical Analyst

BioFire

Durham, North Carolina, United States (On-Site)
2 Months ago
Apple - Analog Engineering Program Manager

Apple

Cupertino, California, United States (On-Site)
1 Month ago
Enphase Energy - GRC Engineer

Enphase Energy

Bengaluru, Karnataka, India (On-Site)
1 Month ago
Instawork - Senior Software Engineer - E4

Instawork

Bengaluru, Karnataka, India (On-Site)
2 Months ago
Nice - Senior Software Engineer

Nice

Southampton, England, United Kingdom (Hybrid)
2 Months ago
Welltech - Engineering Manager

Welltech

Cyprus (Remote)
2 Months ago
Next Level Business Services - SAP WM/IM Consultant

Next Level Business Services

King Of Prussia, Pennsylvania, United States (On-Site)
10 Months ago

Get notifed when new similar jobs are uploaded

About The Company

Redwood City, California, United States (Hybrid)

Redwood City, California, United States (Hybrid)

New York, United States (Hybrid)

Redwood City, California, United States (Hybrid)

Redwood City, California, United States (Remote)

Redwood City, California, United States (Hybrid)

Redwood City, California, United States (Hybrid)

Redwood City, California, United States (On-Site)

Redwood City, California, United States (On-Site)

New York, United States (Hybrid)

View All Jobs

Get notified when new jobs are added by Fireworks AI

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug