AI Infrastructure Product Manager

Fireworks AI

Job Summary

Fireworks is seeking a deeply technical, systems-minded Product Manager to lead product strategy for its core AI infrastructure. This role involves building the most performant, reliable, and scalable GPU inference platform, spanning multi-region deployments, autoscaling, new inference verticals, and user-facing packaging. The PM will partner with customers to understand workload patterns, reliability needs, and performance requirements, translating these insights into an infrastructure roadmap with engineering and field teams. The ideal candidate will have a strong technical background and familiarity with LLMs.

Must Have

  • Build product strategy for cloud infrastructure teams based on customer and market needs
  • Prioritize between initiatives like improved autoscaling, onboarding new GPUs, region availability, and platform-wide reliability
  • Drive new end-to-end new verticals for Fireworks like multi-modal inference and embeddings inference
  • Drive end-to-end execution of infrastructure initiatives with Fireworks engineering teams
  • Determine the end-to-end user experience and billing model for how customers consume Fireworks’ infrastructure
  • Engage deeply with customers to segment user groups and uncover new needs and improvement opportunities
  • Serve as a trusted technical advisor to teams evaluating different AI deployment strategies
  • Strong technical background (CS/EECE background and/or production level development experience) or technical product management experience
  • Intimate familiarity with LLMs, fine-tuning, model inference, agents etc.
  • 1 - 5+ years of experience in products for software engineers
  • Strong customer interaction skills
  • Ability to communicate highly technical concepts to all kinds of audiences
  • Focus on driving outcomes
  • Ability to navigate complex customer/technical scenarios and come up with creative solutions

Good to Have

  • Prior experience in technical product management roles with high degree of customer interactions
  • Experience with B2B infrastructure and infrastructure automation frameworks like Kubernetes and Terraform
  • Interest in generative AI or developer tools
  • Early startup or founding experience
  • Prior technical leadership / technical PM experience

Perks & Benefits

  • Meaningful equity in a fast-growing startup
  • Competitive salary
  • Comprehensive benefits package
  • Opportunity to solve hard problems at the forefront of AI infrastructure
  • Work with bleeding-edge technology
  • High ownership and impact
  • Learn from world-class engineers and AI researchers

Job Description

About Us:

At Fireworks, we’re building the future of generative AI infrastructure. Our platform delivers the highest-quality models with the fastest and most scalable inference in the industry. We’ve been independently benchmarked as the leader in LLM inference speed and are driving cutting-edge innovation through projects like our own function calling and multimodal models. Fireworks is a Series C company valued at $4 billion and backed by top investors including Benchmark, Sequoia, Lightspeed, Index, and Evantic. We’re an ambitious, collaborative team of builders, founded by veterans of Meta PyTorch and Google Vertex AI.

The Role:

Fireworks is hiring a deeply technical, systems-minded PM who is obsessed with building the most performant, reliable, and scalable GPU inference platform in the world. You’ll work on product strategy for Fireworks' core infrastructure—spanning multi-region deployments, autoscaling experiences, new inference verticals, and user-facing packaging.

You will partner directly with customers to deeply understand their workload patterns, reliability needs, and performance requirements. You will convert these insights into an infrastructure roadmap with Fireworks’ core engineering and field team.

Key Responsibilities:

Fireworks Product Roadmap

  • Build product strategy for cloud infrastructure teams based on customer and market needs. Prioritize between initiatives like improved autoscaling, onboarding new GPUs, region availability, and platform-wide reliability.
  • Drive new end-to-end new verticals for Fireworks like multi-modal inference and embeddings inference, from infrastructure requirements to UI interactions
  • Drive end-to-end execution of infrastructure initiatives with Fireworks from requirements to rollout, working closely with Fireworks engineering teams
  • Determine the end-to-end user experience and billing model for how customers consume Fireworks’ infrastructure, through to the UI and product packaging

Customer Engagement

  • Engage deeply with customers to segment user groups and uncover new needs and improvement opportunities
  • Serve as a trusted technical advisor to teams evaluating different AI deployment strategies

Minimum Requirements:

  • Strong technical background (CS/EECE background and/or production level development experience) or technical product management experience
  • Intimate familiarity with LLMs, fine-tuning, model inference, agents etc.
  • 1 - 5+ years of experience in products for software engineers (hiring at multiple levels for this role)
  • Strong customer interaction skills, deep customer success focus
  • Ability to communicate highly technical concepts to all kinds of audiences including the latest AI developments
  • Focus on driving outcomes, not just technical milestones
  • Ability to navigate complex customer/technical scenarios and come up with creative solutions

Preferred Qualifications:

  • Prior experience in technical product management roles with high degree of customer interactions
  • Experience with B2B infrastructure and infrastructure automation frameworks like Kubernetes and Terraform
  • Interest in generative AI or developer tools
  • Early startup or founding experience
  • Prior technical leadership / technical PM experience

Total compensation for this role also includes meaningful equity in a fast-growing startup, along with a competitive salary and comprehensive benefits package. Base salary is determined by a range of factors including individual qualifications, experience, skills, interview performance, market data, and work location. The listed salary range is intended as a guideline and may be adjusted.

Base Pay Range (Plus Equity)

$170,000 - $240,000 USD

Why Fireworks AI?

  • Solve Hard Problems: Tackle challenges at the forefront of AI infrastructure, from low-latency inference to scalable model serving.
  • Build What’s Next: Work with bleeding-edge technology that impacts how businesses and developers harness AI globally.
  • Ownership & Impact: Join a fast-growing, passionate team where your work directly shapes the future of AI—no bureaucracy, just results.
  • Learn from the Best: Collaborate with world-class engineers and AI researchers who thrive on curiosity and innovation.

Fireworks AI is an equal-opportunity employer. We celebrate diversity and are committed to creating an inclusive environment for all innovators.

8 Skills Required For This Role

Communication Talent Acquisition Game Texts User Experience Ux Model Serving Terraform Pytorch Kubernetes

Similar Jobs