AI Systems Engineer

4 Months ago • All levels • $190,000 PA - $250,000 PA
System Design

Job Description

This AI Systems Engineer role at Perplexity involves developing robust APIs for AI inference, designing and maintaining scalable infrastructure for machine learning model deployment, and improving system performance. Responsibilities include benchmarking, diagnosing bottlenecks, and integrating monitoring tools. The engineer will also respond to system outages. The company has experienced significant growth since its launch in 2022. They offer Perplexity Enterprise Pro and have raised significant funding. The role provides a chance to work on large-scale deployment of machine learning models for real-time inference.
Must Have:
  • Develop APIs for AI inference for internal and external use.
  • Strong understanding of Kubernetes and container orchestration.
  • Experience deploying reliable, distributed, real-time systems.
  • Familiarity with LLM architecture and key components.
Perks:
  • Comprehensive health, dental, and vision insurance.
  • 401(k) plan.
  • Equity may be part of the total compensation package.

Add these skills to join the top 1% applicants for this job

cpp
cuda
rust
pytorch
kubernetes
python
machine-learning

We are looking for an AI Systems Engineer to join our growing team. Our current stack is Python, Rust, C++, PyTorch, Triton, CUDA, Kubernetes. You will have the opportunity to work on large-scale deployment of machine learning models for real-time inference.

Responsibilities

  • Develop robust APIs for AI inference used by both internal and external customers
  • Design, deploy, and maintain scalable, reliable infrastructure for deploying machine learning models
  • Benchmark system performance, diagnose bottlenecks, and implement improvements across the inference stack
  • Enhance system reliability and observability by integrating modern monitoring and alerting tools
  • Respond swiftly to system outages and collaborate across teams to maintain high uptime and performance 

Qualifications

  • Experience in developing APIs and managing distributed systems
  • Strong understanding of Kubernetes and container orchestration
  • Experience with deploying reliable, distributed, real-time systems at scale
  • High level familiarity with LLM architecture, and the key pieces (Multi-Head, Multi/Grouped-Query, as well as common Layers)

The cash compensation range for this role is $190,000 - $250,000.

At Perplexity, we've experienced tremendous growth and adoption since publicly launching the world's first fully functional conversational answer engine in 2022. We've grown from answering 2.5 million questions per day at the start of 2024 to around 20 million daily queries in December 2024. We also offer Perplexity Enterprise Pro, which counts leading companies like Nvidia, the Cleveland Cavaliers, Bridgewater, and Zoom as customers.

To support our rapid expansion, we've raised significant funding from some of the most respected technology investors. Our investor base includes IVP, NEA, Jeff Bezos, NVIDIA, Databricks, Bessemer Venture Partners, Elad Gil, Nat Friedman, Daniel Gross, Naval Ravikant, Tobi Lutke, and many other visionary individuals. In 2024, our employee base grew nearly 300%, and we're just getting started.

Final offer amounts are determined by multiple factors, including, experience and expertise, and may vary from the amounts listed above.

Equity: In addition to the base salary, equity may be part of the total compensation package.
Benefits: Comprehensive health, dental, and vision insurance for you and your dependents. Includes a 401(k) plan.

Set alerts for more jobs like AI Systems Engineer
Set alerts for new jobs by Perplexity
Set alerts for new System Design jobs in United States
Set alerts for new jobs in United States
Set alerts for System Design (Remote) jobs

Contact Us
hello@outscal.com
Made in INDIA 💛💙