AI Systems Engineer

8 Hours ago • All levels

Job Summary

Job Description

The AI Systems Engineer will join a growing team and work on deploying machine learning models for real-time inference. Responsibilities include developing robust APIs for AI inference, designing and maintaining scalable infrastructure for machine learning model deployment, benchmarking system performance, improving system reliability, and responding to system outages. The role requires collaboration across teams to maintain high uptime and performance. The company has experienced rapid growth with a large user base and has secured significant funding from prominent investors. Benefits include health, dental, and vision insurance, along with a 401(k) plan and equity may be part of the total compensation package.
Must have:
  • Develop APIs and manage distributed systems
  • Understand Kubernetes and container orchestration
  • Deploy reliable, distributed, real-time systems at scale
  • Familiarity with LLM architecture
Perks:
  • Comprehensive health, dental, and vision insurance
  • 401(k) plan
  • Equity may be part of the total compensation package

Job Details

We are looking for an AI Systems Engineer to join our growing team. Our current stack is Python, Rust, C++, PyTorch, Triton, CUDA, Kubernetes. You will have the opportunity to work on large-scale deployment of machine learning models for real-time inference.

Responsibilities

  • Develop robust APIs for AI inference used by both internal and external customers
  • Design, deploy, and maintain scalable, reliable infrastructure for deploying machine learning models
  • Benchmark system performance, diagnose bottlenecks, and implement improvements across the inference stack
  • Enhance system reliability and observability by integrating modern monitoring and alerting tools
  • Respond swiftly to system outages and collaborate across teams to maintain high uptime and performance 

Qualifications

  • Experience in developing APIs and managing distributed systems
  • Strong understanding of Kubernetes and container orchestration
  • Experience with deploying reliable, distributed, real-time systems at scale
  • High level familiarity with LLM architecture, and the key pieces (Multi-Head, Multi/Grouped-Query, as well as common Layers)
At Perplexity, we've experienced tremendous growth and adoption since publicly launching the world's first fully functional conversational answer engine just over a year ago. Our AI-powered search assistant has amassed 10 million monthly active users as of early 2024, with our mobile apps installed over 1 million times across iOS and Android devices. In 2023 alone, we served over 500 million queries from users around the globe.

To support our rapid expansion, we've raised significant funding from some of the most respected investors in technology. In January 2024, we raised $73.6 million in a Series B round led by IVP, with participation from NVIDIA, Jeff Bezos' investment fund, NEA, Databricks, and other prominent firms. We followed that up with a $62.7 million Series B1 round in April 2024 led by Daniel Gross, valuing Perplexity at over $1 billion.
Our prominent investor base includes IVP, NEA, Jeff Bezos, NVIDIA, Databricks, Bessemer Venture Partners, Elad Gil, Nat Friedman, Naval Ravikant, Tobi Lutke, and many other visionary individuals.
 
Final offer amounts are determined by multiple factors, including, experience and expertise, and may vary from the amounts listed above.
 
Equity: In addition to the base salary, equity may be part of the total compensation package.
Benefits: Comprehensive health, dental, and vision insurance for you and your dependents. Includes a 401(k) plan.

Similar Jobs

Looks like we're out of matches

Set up an alert and we'll send you similar jobs the moment they appear!

Similar Skill Jobs

Looks like we're out of matches

Set up an alert and we'll send you similar jobs the moment they appear!

Jobs in London, England, United Kingdom

Looks like we're out of matches

Set up an alert and we'll send you similar jobs the moment they appear!

Similar Category Jobs

Looks like we're out of matches

Set up an alert and we'll send you similar jobs the moment they appear!

About The Company

San Francisco, California, United States (On-Site)

London, England, United Kingdom (On-Site)

San Francisco, California, United States (On-Site)

London, England, United Kingdom (Hybrid)

San Francisco, California, United States (On-Site)

San Francisco, California, United States (On-Site)

San Francisco, California, United States (On-Site)

New York, New York, United States (On-Site)

New York, New York, United States (On-Site)

View All Jobs

Get notified when new jobs are added by Perplexity AI

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug