AI Systems Engineer

3 Hours ago • All levels • $190,000 PA - $250,000 PA

Job Summary

Job Description

This AI Systems Engineer role at Perplexity involves developing robust APIs for AI inference, designing and maintaining scalable infrastructure for machine learning model deployment, and improving system performance. Responsibilities include benchmarking, diagnosing bottlenecks, and integrating monitoring tools. The engineer will also respond to system outages. The company has experienced significant growth since its launch in 2022. They offer Perplexity Enterprise Pro and have raised significant funding. The role provides a chance to work on large-scale deployment of machine learning models for real-time inference.
Must have:
  • Develop APIs for AI inference for internal and external use.
  • Strong understanding of Kubernetes and container orchestration.
  • Experience deploying reliable, distributed, real-time systems.
  • Familiarity with LLM architecture and key components.
Perks:
  • Comprehensive health, dental, and vision insurance.
  • 401(k) plan.
  • Equity may be part of the total compensation package.

Job Details

We are looking for an AI Systems Engineer to join our growing team. Our current stack is Python, Rust, C++, PyTorch, Triton, CUDA, Kubernetes. You will have the opportunity to work on large-scale deployment of machine learning models for real-time inference.

Responsibilities

  • Develop robust APIs for AI inference used by both internal and external customers
  • Design, deploy, and maintain scalable, reliable infrastructure for deploying machine learning models
  • Benchmark system performance, diagnose bottlenecks, and implement improvements across the inference stack
  • Enhance system reliability and observability by integrating modern monitoring and alerting tools
  • Respond swiftly to system outages and collaborate across teams to maintain high uptime and performance 

Qualifications

  • Experience in developing APIs and managing distributed systems
  • Strong understanding of Kubernetes and container orchestration
  • Experience with deploying reliable, distributed, real-time systems at scale
  • High level familiarity with LLM architecture, and the key pieces (Multi-Head, Multi/Grouped-Query, as well as common Layers)

The cash compensation range for this role is $190,000 - $250,000.

At Perplexity, we've experienced tremendous growth and adoption since publicly launching the world's first fully functional conversational answer engine in 2022. We've grown from answering 2.5 million questions per day at the start of 2024 to around 20 million daily queries in December 2024. We also offer Perplexity Enterprise Pro, which counts leading companies like Nvidia, the Cleveland Cavaliers, Bridgewater, and Zoom as customers.

To support our rapid expansion, we've raised significant funding from some of the most respected technology investors. Our investor base includes IVP, NEA, Jeff Bezos, NVIDIA, Databricks, Bessemer Venture Partners, Elad Gil, Nat Friedman, Daniel Gross, Naval Ravikant, Tobi Lutke, and many other visionary individuals. In 2024, our employee base grew nearly 300%, and we're just getting started.

Final offer amounts are determined by multiple factors, including, experience and expertise, and may vary from the amounts listed above.

Equity: In addition to the base salary, equity may be part of the total compensation package.
Benefits: Comprehensive health, dental, and vision insurance for you and your dependents. Includes a 401(k) plan.

Similar Jobs

Ubisoft - Gen AI Programmer

Ubisoft

Pune, Maharashtra, India (On-Site)
1 Month ago
NVIDIA - Principal Engineer

NVIDIA

(Remote)
2 Months ago
Scale AI - Head of Frontier Data Operations

Scale AI

San Francisco, California, United States (On-Site)
2 Weeks ago
Grab - Lead Data Scientist

Grab

Beijing, China (On-Site)
3 Weeks ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

ByteDance - Senior Research Scientist, Foundation Model, Speech Understanding

ByteDance

Seattle, Washington, United States (On-Site)
6 Months ago
Balbix - Staff AI Engineer

Balbix

Bengaluru, Karnataka, India (On-Site)
6 Months ago
NVIDIA - Solutions Architect, Financial Services

NVIDIA

New Jersey, United States (Remote)
1 Month ago
ByteDance - Machine Learning Engineer Graduate (AML Algorithm) - 2025 Start (PhD)

ByteDance

San Jose, California, United States (On-Site)
6 Months ago
ByteDance - Software Engineer Intern (Doubao (Seed) - Machine Learning System) - 2025 Summer (MS)

ByteDance

San Jose, California, United States (On-Site)
6 Months ago
Oportun - Senior Software ML Engineer

Oportun

(Remote)
2 Weeks ago
Ness Digital - AI/ML Engineer

Ness Digital

(Remote)
3 Months ago
Outbrain - Data Science Summer School

Outbrain

Paris, Île-de-France, France (On-Site)
2 Weeks ago
ByteDance - Software Engineer Intern (Machine Learning Platform) - 2024 Summer (PhD)

ByteDance

Seattle, Washington, United States (On-Site)
6 Months ago
ByteDance - Software Engineer, ML System Scheduling

ByteDance

San Jose, California, United States (On-Site)
6 Months ago

Get notifed when new similar jobs are uploaded

Jobs in San Francisco, California, United States

Patreon - Staff Product Designer, Design Foundations

Patreon

New York, New York, United States (Hybrid)
1 Month ago
Aspyr - Associate 2 Game Producer

Aspyr

Austin, Texas, United States (On-Site)
2 Weeks ago
Whoop - Senior Mechanical Engineer (Apparel & Accessories)

Whoop

Boston, Massachusetts, United States (On-Site)
3 Days ago
SimpliSafe - Security Monitoring Specialist

SimpliSafe

Richmond, Virginia, United States (On-Site)
2 Weeks ago
Alphasense - Account Executive, Financial Services

Alphasense

New York, United States (On-Site)
2 Days ago
ByteDance - Software Engineer, Architecture and Infrastructure

ByteDance

San Jose, California, United States (On-Site)
6 Months ago
Pentair - Group HR Director, Resi Flow

Pentair

Golden Valley, Minnesota, United States (On-Site)
2 Days ago
Treck - Seasonal Sales Associate

Treck

Algonquin, Illinois, United States (On-Site)
2 Days ago
Yodo1 - Business Development Manager, Game Publishing

Yodo1

United States (Remote)
5 Months ago
Postman - Backend Software Engineer, Test Infrastructure

Postman

San Francisco, California, United States (Hybrid)
1 Week ago

Get notifed when new similar jobs are uploaded

Similar Category Jobs

Looks like we're out of matches

Set up an alert and we'll send you similar jobs the moment they appear!

About The Company

San Francisco, California, United States (On-Site)

San Francisco, California, United States (On-Site)

London, England, United Kingdom (On-Site)

London, England, United Kingdom (On-Site)

San Francisco, California, United States (On-Site)

San Francisco, California, United States (On-Site)

San Francisco, California, United States (On-Site)

San Francisco, California, United States (On-Site)

San Francisco, California, United States (On-Site)

View All Jobs

Get notified when new jobs are added by Perplexity AI

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug