AI Systems Engineer

1 Month ago • All levels • System Design • $190,000 PA - $250,000 PA

Job Summary

Job Description

This AI Systems Engineer role at Perplexity involves developing robust APIs for AI inference, designing and maintaining scalable infrastructure for machine learning model deployment, and improving system performance. Responsibilities include benchmarking, diagnosing bottlenecks, and integrating monitoring tools. The engineer will also respond to system outages. The company has experienced significant growth since its launch in 2022. They offer Perplexity Enterprise Pro and have raised significant funding. The role provides a chance to work on large-scale deployment of machine learning models for real-time inference.
Must have:
  • Develop APIs for AI inference for internal and external use.
  • Strong understanding of Kubernetes and container orchestration.
  • Experience deploying reliable, distributed, real-time systems.
  • Familiarity with LLM architecture and key components.
Perks:
  • Comprehensive health, dental, and vision insurance.
  • 401(k) plan.
  • Equity may be part of the total compensation package.

Job Details

We are looking for an AI Systems Engineer to join our growing team. Our current stack is Python, Rust, C++, PyTorch, Triton, CUDA, Kubernetes. You will have the opportunity to work on large-scale deployment of machine learning models for real-time inference.

Responsibilities

  • Develop robust APIs for AI inference used by both internal and external customers
  • Design, deploy, and maintain scalable, reliable infrastructure for deploying machine learning models
  • Benchmark system performance, diagnose bottlenecks, and implement improvements across the inference stack
  • Enhance system reliability and observability by integrating modern monitoring and alerting tools
  • Respond swiftly to system outages and collaborate across teams to maintain high uptime and performance 

Qualifications

  • Experience in developing APIs and managing distributed systems
  • Strong understanding of Kubernetes and container orchestration
  • Experience with deploying reliable, distributed, real-time systems at scale
  • High level familiarity with LLM architecture, and the key pieces (Multi-Head, Multi/Grouped-Query, as well as common Layers)

The cash compensation range for this role is $190,000 - $250,000.

At Perplexity, we've experienced tremendous growth and adoption since publicly launching the world's first fully functional conversational answer engine in 2022. We've grown from answering 2.5 million questions per day at the start of 2024 to around 20 million daily queries in December 2024. We also offer Perplexity Enterprise Pro, which counts leading companies like Nvidia, the Cleveland Cavaliers, Bridgewater, and Zoom as customers.

To support our rapid expansion, we've raised significant funding from some of the most respected technology investors. Our investor base includes IVP, NEA, Jeff Bezos, NVIDIA, Databricks, Bessemer Venture Partners, Elad Gil, Nat Friedman, Daniel Gross, Naval Ravikant, Tobi Lutke, and many other visionary individuals. In 2024, our employee base grew nearly 300%, and we're just getting started.

Final offer amounts are determined by multiple factors, including, experience and expertise, and may vary from the amounts listed above.

Equity: In addition to the base salary, equity may be part of the total compensation package.
Benefits: Comprehensive health, dental, and vision insurance for you and your dependents. Includes a 401(k) plan.

Similar Jobs

Blind Squirrel Games - Senior Generalist Engineer

Blind Squirrel Games

California, United States (Hybrid)
2 Months ago
Behaviour Interactive - Principal Gameplay Programmer - Dead by Daylight | Programmeur·se jouabilité Principal·e - Dead by Daylight

Behaviour Interactive

Middlesbrough, England, United Kingdom (Hybrid)
8 Months ago
Wind River - Member of Technical Staff - (Senior Engineer)

Wind River

Galați, Județul Galați, Romania (On-Site)
6 Days ago
Google - Software Engineering Manager, People with Disabilities

Google

State Of Minas Gerais, Brazil (On-Site)
5 Months ago
bytedance - NLP Researcher - 2025 Start

bytedance

Singapore (On-Site)
7 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Ubisoft - Machine Learning Programmer (Characters & Animation) - Rainbow Six

Ubisoft

Montreal, Quebec, Canada (On-Site)
3 Months ago
CGS Carrers - Software Development Engineer- C++, Telecom Billing Domain

CGS Carrers

India (Remote)
1 Month ago
SEGA - Audio Programmer

SEGA

Sofia, Sofia City Province, Bulgaria (On-Site)
3 Months ago
Mozilla - Staff Software Engineer - Mobile Android

Mozilla

Germany (Remote)
1 Week ago
Calix - Senior Core AI Engineer

Calix

Mexico City, Mexico (Remote)
1 Month ago
Single Store - Senior Software Engineer, Security (C++, RBAC, OpenSSL)

Single Store

Hyderabad, Telangana, India (Hybrid)
1 Month ago
Airbyte - Solutions Engineer

Airbyte

San Francisco, California, United States (On-Site)
1 Month ago
Embark Studios - Senior Gameplay Engineer - Games

Embark Studios

Stockholm, Stockholm County, Sweden (On-Site)
3 Months ago
bytedance - Site Reliability Engineer Graduate (Technical Infrastructure) - 2025 Start (BS/MS)

bytedance

San Jose, California, United States (On-Site)
7 Months ago
Corsair - Senior Software Embedded Architect

Corsair

Landshut, Bavaria, Germany (On-Site)
2 Months ago

Get notifed when new similar jobs are uploaded

Jobs in San Francisco, California, United States

WebTech Corporation - Incoming Inspector

WebTech Corporation

Germantown, Maryland, United States (On-Site)
2 Weeks ago
Nagarro - Senior Staff Engineer - SAP FICO S/4Hana Solution Advisor

Nagarro

United States (Remote)
8 Months ago
Scientific Games - Advanced Solutions Architect

Scientific Games

Georgia, United States (Remote)
2 Months ago
Trek - Service Manager

Trek

San Francisco, California, United States (On-Site)
4 Months ago
Sailpoint - Staff Machine Learning Engineer

Sailpoint

United States (Remote)
3 Days ago
Reddit - Lead Business Partner, Revenue Strategy & Operations

Reddit

New York, United States (On-Site)
3 Weeks ago
HCL Tech - Senior Developer

HCL Tech

Texas, United States (On-Site)
3 Weeks ago
Blinkhealth - Pharmacy Technician - Data Entry/Intake/Order Entry

Blinkhealth

Boise, Idaho, United States (On-Site)
1 Month ago
NBC Universal - VP, Consolidations and Controllership

NBC Universal

Englewood Cliffs, New Jersey, United States (Hybrid)
1 Month ago
WebTech Corporation - Mechanical Engineer IV

WebTech Corporation

Duncan, South Carolina, United States (On-Site)
1 Month ago

Get notifed when new similar jobs are uploaded

System Design Jobs

Looks like we're out of matches

Set up an alert and we'll send you similar jobs the moment they appear!

About The Company

San Francisco, California, United States (On-Site)

New York, New York, United States (On-Site)

New York, New York, United States (On-Site)

Palo Alto, California, United States (Hybrid)

San Francisco, California, United States (On-Site)

New York, New York, United States (On-Site)

New York, New York, United States (On-Site)

New York, New York, United States (On-Site)

San Francisco, California, United States (On-Site)

New York, United States (Hybrid)

View All Jobs

Get notified when new jobs are added by Perplexity

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug