Here at Fireworks, we’re building the future of generative AI infrastructure. Fireworks offers the generative AI platform with the highest-quality models and the fastest, most scalable inference. We’ve been independently benchmarked to have the fastest LLM inference and have been getting great traction with innovative research projects, like our own function calling and multi-modal models. Fireworks is funded by top investors, like Benchmark and Sequoia, and we’re an ambitious, fun team composed primarily of veterans from Pytorch and Google Vertex AI.
As a Research Scientist focused on Reinforcement Learning (RL), you’ll apply your deep expertise in the field to push the boundaries of how large language models are trained, aligned, and deployed. We’re looking for someone with a strong foundation in RL - not just familiarity, but hands-on experience designing algorithms, building training pipelines, and running experiments.
You’ll work on everything from scalable RLHF alternatives (e.g., GRPO, DPO) to reward modeling and agent-based training. Your contributions will directly impact Fireworks’ model quality, training workflows, and customer-facing APIs. You’ll also collaborate with researchers, engineers, and product teams to translate state-of-the-art RL into practical systems used by companies deploying LLMs at scale.
Total compensation for this role also includes meaningful equity in a fast-growing startup, along with a competitive salary and comprehensive benefits package. Base salary is determined by a range of factors including individual qualifications, experience, skills, interview performance, market data, and work location. The listed salary range is intended as a guideline and may be adjusted.
Base Pay Range (Plus Equity)
$250,000 - $290,000 USD
Fireworks AI is an equal-opportunity employer. We celebrate diversity and are committed to creating an inclusive environment for all innovators.