AI Systems Engineer

3 Weeks ago • All levels

Job Summary

Job Description

The AI Systems Engineer will join a growing team to work on large-scale deployment of machine learning models for real-time inference. The responsibilities include developing APIs for AI inference, designing and maintaining scalable infrastructure, benchmarking system performance, and improving system reliability. The engineer will also respond to system outages. The job description also mentions the company's growth and recent funding rounds, including the investors.
Must have:
  • Develop APIs and manage distributed systems
  • Strong understanding of Kubernetes and container orchestration
  • Experience with deploying reliable, distributed, real-time systems
  • Familiarity with LLM architecture and layers
Perks:
  • Comprehensive health, dental, and vision insurance
  • 401(k) plan
  • Equity as part of the total compensation package

Job Details

We are looking for an AI Systems Engineer to join our growing team. Our current stack is Python, Rust, C++, PyTorch, Triton, CUDA, Kubernetes. You will have the opportunity to work on large-scale deployment of machine learning models for real-time inference.

Responsibilities

  • Develop robust APIs for AI inference used by both internal and external customers
  • Design, deploy, and maintain scalable, reliable infrastructure for deploying machine learning models
  • Benchmark system performance, diagnose bottlenecks, and implement improvements across the inference stack
  • Enhance system reliability and observability by integrating modern monitoring and alerting tools
  • Respond swiftly to system outages and collaborate across teams to maintain high uptime and performance 

Qualifications

  • Experience in developing APIs and managing distributed systems
  • Strong understanding of Kubernetes and container orchestration
  • Experience with deploying reliable, distributed, real-time systems at scale
  • High level familiarity with LLM architecture, and the key pieces (Multi-Head, Multi/Grouped-Query, as well as common Layers)
At Perplexity, we've experienced tremendous growth and adoption since publicly launching the world's first fully functional conversational answer engine just over a year ago. Our AI-powered search assistant has amassed 10 million monthly active users as of early 2024, with our mobile apps installed over 1 million times across iOS and Android devices. In 2023 alone, we served over 500 million queries from users around the globe.

To support our rapid expansion, we've raised significant funding from some of the most respected investors in technology. In January 2024, we raised $73.6 million in a Series B round led by IVP, with participation from NVIDIA, Jeff Bezos' investment fund, NEA, Databricks, and other prominent firms. We followed that up with a $62.7 million Series B1 round in April 2024 led by Daniel Gross, valuing Perplexity at over $1 billion.
Our prominent investor base includes IVP, NEA, Jeff Bezos, NVIDIA, Databricks, Bessemer Venture Partners, Elad Gil, Nat Friedman, Naval Ravikant, Tobi Lutke, and many other visionary individuals.
 
Final offer amounts are determined by multiple factors, including, experience and expertise, and may vary from the amounts listed above.
 
Equity: In addition to the base salary, equity may be part of the total compensation package.
Benefits: Comprehensive health, dental, and vision insurance for you and your dependents. Includes a 401(k) plan.

Similar Jobs

SingleStore - SDET

SingleStore

Pune, Maharashtra, India (Remote)
3 Weeks ago
PhonePe - Firmware Engineer

PhonePe

Bengaluru, Karnataka, India (On-Site)
2 Weeks ago
Electronic Arts - Machine Learning Co-op (PhD Student)

Electronic Arts

Vancouver, British Columbia, Canada (Hybrid)
2 Weeks ago
Demonware - 2025 Canada Fall Co-ops - Project Management - Demonware

Demonware

Vancouver, British Columbia, Canada (On-Site)
3 Weeks ago
miniclip - Senior Software Developer

miniclip

Lisbon, Lisbon, Portugal (Hybrid)
2 Weeks ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Sony Interactive Entertainment - Software Engineer (Development of PlayStation Development Environment)

Sony Interactive Entertainment

Tokyo, Japan (On-Site)
4 Weeks ago
Paradox Interactive - Engine Graphics Programmer

Paradox Interactive

Stockholm, Stockholm County, Sweden (On-Site)
2 Days ago
Silicon Labs - Engineer I - Infra SW

Silicon Labs

Hyderabad, Telangana, India (On-Site)
2 Weeks ago
Qualcomm - Sensor Subsystem Design Verification Engineer

Qualcomm

Cork, County Cork, Ireland (On-Site)
2 Weeks ago
Ansys - R&D Engineer II

Ansys

Hsinchu County, Taiwan (On-Site)
4 Days ago
Warhorse Studios - C++ Programmer

Warhorse Studios

Prague, Prague, Czechia (Hybrid)
1 Week ago
HCL Tech - Senior Technical Architect .NET, C#, C++

HCL Tech

Stockholm, Stockholm County, Sweden (On-Site)
1 Week ago
Unseen Inc - Senior Gameplay Engineer

Unseen Inc

Tokyo, Japan (Hybrid)
4 Days ago
London stock Exchange - Software Engineer

London stock Exchange

St. Louis, Missouri, United States (On-Site)
3 Weeks ago
broadcom - Applications Developer

broadcom

Irvine, California, United States (On-Site)
1 Week ago

Get notifed when new similar jobs are uploaded

Jobs in London, England, United Kingdom

Monzo - Senior Legal Counsel - Employment, Incentives & Pensions

Monzo

London, England, United Kingdom (Remote)
4 Weeks ago
Thousand Eyes - Principal Software Engineer, Endpoint

Thousand Eyes

London, England, United Kingdom (Hybrid)
1 Day ago
Unbroken Studios - Industrial Water Treatment Scientist

Unbroken Studios

Manchester, England, United Kingdom (On-Site)
4 Weeks ago
Disobey - Influencer Campaigns Lead

Disobey

England, United Kingdom (Remote)
1 Month ago
MRI Software - Benefits Business Consultant

MRI Software

London, England, United Kingdom (Hybrid)
4 Weeks ago
ClearPoint Recruitment - Web Developer

ClearPoint Recruitment

Darlington, England, United Kingdom (On-Site)
5 Years ago
Assystems - Principal Waste Engineer

Assystems

Blackburn, England, United Kingdom (On-Site)
7 Months ago
Tech Holding - Salesforce Business Analyst

Tech Holding

London, England, United Kingdom (Hybrid)
3 Weeks ago
WebTech Corporation - Production/Continuous Improvement Engineer

WebTech Corporation

Burton Upon Trent, England, United Kingdom (On-Site)
1 Week ago
playground - Senior VFX Artist

playground

Royal Leamington Spa, England, United Kingdom (Hybrid)
1 Month ago

Get notifed when new similar jobs are uploaded

Similar Category Jobs

Looks like we're out of matches

Set up an alert and we'll send you similar jobs the moment they appear!

About The Company

At Perplexity, we've experienced tremendous growth and adoption since publicly launching the world's first fully functional conversational answer engine just over a year ago. Our AI-powered search assistant has amassed 10 million monthly active users as of early 2024, with our mobile apps installed over 1 million times across iOS and Android devices. In 2023 alone, we served over 500 million queries from users around the globe.

New York, New York, United States (On-Site)

Palo Alto, California, United States (Hybrid)

San Francisco, California, United States (On-Site)

New York, New York, United States (On-Site)

New York, New York, United States (On-Site)

New York, New York, United States (On-Site)

San Francisco, California, United States (On-Site)

New York, United States (Hybrid)

California, United States (On-Site)

San Francisco, California, United States (On-Site)

View All Jobs

Get notified when new jobs are added by Perplexity AI

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug