About Us:
At Fireworks, we’re building the future of generative AI infrastructure. Our platform delivers the highest-quality models with the fastest and most scalable inference in the industry. We’ve been independently benchmarked as the leader in LLM inference speed and are driving cutting-edge innovation through projects like our own function calling and multimodal models. Fireworks is a Series C company valued at $4 billion and backed by top investors including Benchmark, Sequoia, Lightspeed, Index, and Evantic. We’re an ambitious, collaborative team of builders, founded by veterans of Meta PyTorch and Google Vertex AI.
The Role:
As an Enterprise Solutions Architect at Fireworks.ai, you will be the technical and strategic cornerstone for our most ambitious enterprise customers. This is a highly technical, customer-facing leadership position where you will architect and guide the implementation of groundbreaking GenAI applications. You will translate complex business problems into scalable technical solutions, leveraging the full power of the Fireworks.ai platform to deliver transformative value. Your expertise will directly influence our product roadmap and cement our leadership in the AI infrastructure landscape.
Key Responsibilities:
- Strategic Customer Leadership: Serve as the primary technical trusted advisor for CTOs, VPs of Engineering and AI leads, guiding their AI strategy and architecting production-grade solutions for their most critical GenAI use cases (e.g., advanced RAG, AI agents, complex fine-tuning).
- Accelerate Time-to-Value: Lead and own the technical sales cycle, from scoping and executing high-impact Proof-of-Concepts to delivering compelling demonstrations that showcase our platform's superior performance and unique capabilities, like our function-calling and multi-modal models.
- Champion Development: Build and nurture relationships with technical champions and executive stakeholders, creating long-term partnerships and embedding Fireworks.ai as a critical platform for innovation.
- Voice of the Customer & Product Influence: Synthesize customer feedback and market trends into high-signal, actionable insights for our Product and Engineering teams, directly influencing the Fireworks.ai roadmap.
- Scale Expertise: Develop authoritative technical content (e.g., reference architectures, blogs, open-source tools) to educate the market and elevate the practice of high-performance AI deployment.
- Evangelism & Enablement: Represent Fireworks.ai as a technical expert at industry conferences, developer meetups, and strategic customer workshops. Produce technical content like workshops, webinars, and documentation to enable both customers and internal teams on advanced Fireworks capabilities and AI best practices.
Minimum Qualifications:
- Bachelor's degree in Computer Science or a related field, or equivalent practical experience.
- 7+ years in a customer-facing technical architecture role (Solutions Architect, Sales Engineer) with at least 3 years in a pre-sales or consulting capacity.
- Deep, hands-on expertise with the LLM stack: a strong understanding of inference optimization, fine-tuning methodologies (e.g., LoRA, QLoRA), and the engineering trade-offs in deploying large-scale models.
- Expertise in Python with experience building and debugging applications on API-based platforms.
- Exceptional communication and presentation skills, with the ability to articulate complex technical concepts to both engineers and C-level executives.
Preferred Qualifications:
- Master's or PhD in Computer Science, Machine Learning, or a related field.
- Proven experience designing and implementing AI/ML solutions in enterprise environments.
- Strong familiarity with cloud infrastructure (AWS, GCP, Azure), containerization (Docker, Kubernetes), and infrastructure-as-code (Terraform).
- A proven track record of influencing technical strategy and closing complex, multi-million dollar deals.
- A strong sense of ownership, a bias for action, and an unwavering commitment to technical excellence.
Why Fireworks AI?
- Solve Hard Problems: Tackle challenges at the forefront of AI infrastructure, from low-latency inference to scalable model serving.
- Build What’s Next: Work with bleeding-edge technology that impacts how businesses and developers harness AI globally.
- Ownership & Impact: Join a fast-growing, passionate team where your work directly shapes the future of AI—no bureaucracy, just results.
- Learn from the Best: Collaborate with world-class engineers and AI researchers who thrive on curiosity and innovation.