Member of Technical Staff, Multimedia

10 Minutes ago • 3-5 Years
Technical Art

Job Description

Fireworks AI is seeking highly motivated engineers and researchers for the Multimedia team as Members of Technical Staff. This role involves advancing Fireworks AI’s capabilities in speech, vision, and multimodal systems, from model deployment and training to building real-time, production-ready AI infrastructure. Candidates will work at the intersection of research and engineering, transforming cutting-edge model innovation into high-performance, scalable systems for next-generation products.
Good To Have:
  • Master’s or PhD in a relevant technical field with research experience in speech, vision, or multimodal modeling.
  • Experience deploying and optimizing ML models in production, including distributed training, fine-tuning, or inference optimization.
  • Familiarity with model optimization techniques such as quantization, speculative decoding, or parameter-efficient fine-tuning.
  • Background in multimodal AI infrastructure or experience at a hyperscaler, AI infrastructure startup, or LLM platform.
  • Strong understanding of GPU performance, networking, and scaling for multimedia workloads.
  • Contributions to open-source projects or published first-author research papers in top-tier conferences.
  • Ability to thrive in a fast-paced, low-process environment and drive high-impact work.
Must Have:
  • Design, train, and implement machine learning models for speech, vision, or multimodal applications.
  • Bring new model capabilities from research to production, ensuring high performance and reliability.
  • Build and optimize infrastructure supporting distributed training, fine-tuning, and real-time inference.
  • Profile and address performance bottlenecks across the stack.
  • Write high-quality, maintainable code for experimentation and production systems.
  • Evaluate and integrate the latest research to improve model performance, scalability, and efficiency.
  • Work directly with internal and external stakeholders to define use cases and inform the multimedia roadmap.
  • Contribute to open-source efforts and help shape the future of multimodal AI.
  • Bachelor’s degree in Computer Science, Electrical Engineering, or a related technical field.
  • 3–5+ years of experience in machine learning, backend infrastructure, or data-intensive systems.
  • Strong proficiency in Python and familiarity with deep learning frameworks such as PyTorch or TensorFlow.
  • Demonstrated experience in speech/audio modeling, vision/vision-language modeling, or backend/infrastructure engineering for ML workloads.
  • Experience building production-quality systems and collaborating across research and engineering teams.
  • Familiarity with cloud platforms (AWS, GCP, or Azure) and containerization/orchestration tools (Docker, Kubernetes).
Perks:
  • Solve Hard Problems: Tackle challenges at the forefront of AI infrastructure.
  • Build What’s Next: Work with bleeding-edge technology that impacts how businesses and developers harness AI globally.
  • Ownership & Impact: Join a fast-growing, passionate team where your work directly shapes the future of AI.
  • Learn from the Best: Collaborate with world-class engineers and AI researchers who thrive on curiosity and innovation.

Add these skills to join the top 1% applicants for this job

talent-acquisition
game-texts
networking
aws
azure
model-serving
pytorch
deep-learning
docker
kubernetes
python
monday
tensorflow
machine-learning

About Us:

At Fireworks, we’re building the future of generative AI infrastructure. Our platform delivers the highest-quality models with the fastest and most scalable inference in the industry. We’ve been independently benchmarked as the leader in LLM inference speed and are driving cutting-edge innovation through projects like our own function calling and multimodal models. Fireworks is a Series C company valued at $4 billion and backed by top investors including Benchmark, Sequoia, Lightspeed, Index, and Evantic. We’re an ambitious, collaborative team of builders, founded by veterans of Meta PyTorch and Google Vertex AI.

The Role:

We are looking for highly motivated engineers and researchers to join our Multimedia team as Members of Technical Staff. In this role, you will help advance Fireworks AI’s capabilities across speech, vision, and multimodal systems, from deploying and training models to building the infrastructure that powers real-time, production-ready AI experiences.

We welcome both generalists with broad multimedia expertise and specialists with deeper focus in speech/audio or vision-language modeling. You will work at the intersection of research and engineering, turning cutting-edge model innovation into high-performance, scalable systems that power Fireworks AI’s next-generation products.

Key Responsibilities:

  • Design, train, and implement machine learning models for speech, vision, or multimodal applications, including ASR, TTS, image understanding, captioning, retrieval, and speech-to-speech systems
  • Bring new model capabilities from research to production, ensuring high performance and reliability
  • Build and optimize the infrastructure that supports distributed training, fine-tuning, and real-time inference across multimedia workloads
  • Profile and address performance bottlenecks across the stack, from preprocessing and model training to deployment and serving
  • Write high-quality, maintainable code for both experimentation and production systems
  • Evaluate and integrate the latest research to improve model performance, scalability, and efficiency
  • Work directly with internal and external stakeholders to define use cases, deliver custom optimizations, and inform the multimedia roadmap
  • Contribute to open-source efforts and help shape the future of multimodal AI at Fireworks

Minimum Qualifications:

  • Bachelor’s degree in Computer Science, Electrical Engineering, or a related technical field
  • 3–5+ years of experience in machine learning, backend infrastructure, or data-intensive systems
  • Strong proficiency in Python and familiarity with deep learning frameworks such as PyTorch or TensorFlow
  • Demonstrated experience in one or more of the following areas:
  • Speech or audio modeling (ASR, TTS, speech-to-speech)
  • Vision or vision-language modeling (captioning, VQA, retrieval, multimodal reasoning)
  • Backend or infrastructure engineering for ML workloads (training, inference, optimization, APIs)
  • Experience building production-quality systems and collaborating across research and engineering teams
  • Familiarity with cloud platforms (AWS, GCP, or Azure) and containerization/orchestration tools (Docker, Kubernetes)

Preferred Qualifications

  • Master’s or PhD in a relevant technical field with research experience in speech, vision, or multimodal modeling
  • Experience deploying and optimizing ML models in production, including distributed training, fine-tuning, or inference optimization
  • Familiarity with model optimization techniques such as quantization, speculative decoding, or parameter-efficient fine-tuning (LoRA or QLoRA)
  • Background in multimodal AI infrastructure or experience at a hyperscaler, AI infrastructure startup, or LLM platform
  • Strong understanding of GPU performance, networking, and scaling for multimedia workloads
  • Contributions to open-source projects or published first-author research papers in top-tier conferences such as NeurIPS, ICML, CVPR, ACL, or Interspeech
  • Ability to thrive in a fast-paced, low-process environment and drive high-impact, company-defining work

Why Fireworks AI?

  • Solve Hard Problems: Tackle challenges at the forefront of AI infrastructure, from low-latency inference to scalable model serving.
  • Build What’s Next: Work with bleeding-edge technology that impacts how businesses and developers harness AI globally.
  • Ownership & Impact: Join a fast-growing, passionate team where your work directly shapes the future of AI—no bureaucracy, just results.
  • Learn from the Best: Collaborate with world-class engineers and AI researchers who thrive on curiosity and innovation.

Fireworks AI is an equal-opportunity employer. We celebrate diversity and are committed to creating an inclusive environment for all innovators.

Create a Job Alert

Interested in building your career at Fireworks AI? Get future opportunities sent straight to your email.

Create alert

Apply for this job

  • indicates a required field

Autofill with MyGreenhouse

First Name*

Last Name*

Preferred First Name

Email*

Phone

Country*

Phone*

Resume/CV*

AttachAttach

Dropbox

Google Drive

Enter manuallyEnter manually

Accepted file types: pdf, doc, docx, txt, rtf

Cover Letter

AttachAttach

Dropbox

Google Drive

Enter manuallyEnter manually

Accepted file types: pdf, doc, docx, txt, rtf

---

LinkedIn Profile

Website

Work Authorization: Are you authorized to work in the US?*

Select...

Can you work full-time on-site at our San Mateo office (Monday through Friday)?*

Select...

Why are you interested in Fireworks AI?*

If offered a position, what is your ideal start-date? *

Do you have any initial compensation expectations? *

Voluntary Self-Identification

For government reporting purposes, we ask candidates to respond to the below self-identification survey. Completion of the form is entirely voluntary. Whatever your decision, it will not be considered in the hiring process or thereafter. Any information that you do provide will be recorded and maintained in a confidential file.

As set forth in Fireworks AI’s Equal Employment Opportunity policy, we do not discriminate on the basis of any protected group status under any applicable law.

Gender

Select...

Are you Hispanic/Latino?

Select...

Race & Ethnicity Definitions

If you believe you belong to any of the categories of protected veterans listed below, please indicate by making the appropriate selection. As a government contractor subject to the Vietnam Era Veterans Readjustment Assistance Act (VEVRAA), we request this information in order to measure the effectiveness of the outreach and positive recruitment efforts we undertake pursuant to VEVRAA. Classification of protected categories is as follows:

A "disabled veteran" is one of the following: a veteran of the U.S. military, ground, naval or air service who is entitled to compensation (or who but for the receipt of military retired pay would be entitled to compensation) under laws administered by the Secretary of Veterans Affairs; or a person who was discharged or released from active duty because of a service-connected disability.

A "recently separated veteran" means any veteran during the three-year period beginning on the date of such veteran's discharge or release from active duty in the U.S. military, ground, naval, or air service.

An "active duty wartime or campaign badge veteran" means a veteran who served on active duty in the U.S. military, ground, naval or air service during a war, or in a campaign or expedition for which a campaign badge has been authorized under the laws administered by the Department of Defense.

An "Armed forces service medal veteran" means a veteran who, while serving on active duty in the U.S. military, ground, naval or air service, participated in a United States military operation for which an Armed Forces service medal was awarded pursuant to Executive Order 12985.

Veteran Status

Select...

Voluntary Self-Identification of Disability

Form CC-305

Page 1 of 1

OMB Control Number 1250-0005

Expires 04/30/2026

Why are you being asked to complete this form?

We are a federal contractor or subcontractor. The law requires us to provide equal employment opportunity to qualified people with disabilities. We have a goal of having at least 7% of our workers as people with disabilities. The law says we must measure our progress towards this goal. To do this, we must ask applicants and employees if they have a disability or have ever had one. People can become disabled, so we need to ask this question at least every five years.

Completing this form is voluntary, and we hope that you will choose to do so. Your answer is confidential. No one who makes hiring decisions will see it. Your decision to complete the form and your answer will not harm you in any way. If you want to learn more about the law or this form, visit the U.S. Department of Labor’s Office of Federal Contract Compliance Programs (OFCCP) website at www.dol.gov/ofccp

.

How do you know if you have a disability?

A disability is a condition that substantially limits one or more of your “major life activities.” If you have or have ever had such a condition, you are a person with a disability. Disabilities include, but are not limited to:

  • Alcohol or other substance use disorder (not currently using drugs illegally)
  • Autoimmune disorder, for example, lupus, fibromyalgia, rheumatoid arthritis, HIV/AIDS
  • Blind or low vision
  • Cancer (past or present)
  • Cardiovascular or heart disease
  • Celiac disease
  • Cerebral palsy
  • Deaf or serious difficulty hearing
  • Diabetes
  • Disfigurement, for example, disfigurement caused by burns, wounds, accidents, or congenital disorders
  • Epilepsy or other seizure disorder
  • Gastrointestinal disorders, for example, Crohn's Disease, irritable bowel syndrome
  • Intellectual or developmental disability
  • Mental health conditions, for example, depression, bipolar disorder, anxiety disorder, schizophrenia, PTSD
  • Missing limbs or partially missing limbs
  • Mobility impairment, benefiting from the use of a wheelchair, scooter, walker, leg brace(s) and/or other supports
  • Nervous system condition, for example, migraine headaches, Parkinson’s disease, multiple sclerosis (MS)
  • Neurodivergence, for example, attention-deficit/hyperactivity disorder (ADHD), autism spectrum disorder, dyslexia, dyspraxia, other learning disabilities
  • Partial or complete paralysis (any cause)
  • Pulmonary or respiratory conditions, for example, tuberculosis, asthma, emphysema
  • Short stature (dwarfism)
  • Traumatic brain injury

Disability Status

Select...

PUBLIC BURDEN STATEMENT: According to the Paperwork Reduction Act of 1995 no persons are required to respond to a collection of information unless such collection displays a valid OMB control number. This survey should take about 5 minutes to complete.

Submit application

Set alerts for more jobs like Member of Technical Staff, Multimedia
Set alerts for new jobs by Fireworks AI
Set alerts for new Technical Art jobs in United States
Set alerts for new jobs in United States
Set alerts for Technical Art (Remote) jobs

Contact Us
hello@outscal.com
Made in INDIA 💛💙