AI Training Infrastructure Engineer - Post Training

3 Months ago • 6 Years + • Devops • $220,000 PA - $290,000 PA

Job Summary

Job Description

Perplexity is seeking experienced AI Research Engineers and Scientists to improve their in-house Online LLMs, the Sonar models. The role involves creating a robust and effective training framework, especially for post-training LLMs. Responsibilities include building a post-training framework, implementing necessary infrastructure for the latest models and algorithms, owning the data, training, and evaluation pipelines, and collaborating with engineering teams. The job requires experience with large-scale LLM frameworks, proficiency in Python/Pytorch, and a passion for tackling challenging problems.
Must have:
  • Experience with large-scale LLMs frameworks.
  • Strong in Python/Pytorch; C++/CUDA is a plus.
  • Self-starter with willingness to take ownership.
  • Passion for tackling challenging problems.
Good to have:
  • PhD in AI/ML/Systems or related areas.
  • Experience building LLM training frameworks.
Perks:
  • Comprehensive health, dental, and vision insurance.
  • 401(k) plan.
  • Equity may be part of the total compensation package.

Job Details

Perplexity is seeking experienced AI Research Engineers and Scientists to continue to improve our in house Online LLMs, the Sonar models. Your job is to work with team and create a robust and effective training framework (on top of Megatron/PyTorch), especially for post training LLMs.

Responsibilities

  • Build a post training framework that can run cutting-edge model training jobs in scale
  • Implement the necessary infra and components to support latest models and algorithms like SFT, RL (DPO/GRPO) and more
  • Own the full stack data, training, and eval pipelines required to post-train LLM models
  • Work closing with engineering teams to integrate Sonar models into our product.

Qualifications

  • Proven experience with large-scale LLMs frameworks building
  • Strong in Python/Pytorch; C++/CUDA is a plus
  • Self-starter with a willingness to take ownership of tasks
  • Passion for tackling challenging problems
  • Minimum of 6 years of working on relevant projects.

Bonus

  • PhD in AI/ML/Systems or related areas
  • Experience building LLM training frameworks, especially post training

The cash compensation range for this role is $220,000 - $290,000.

At Perplexity, we've experienced tremendous growth and adoption since publicly launching the world's first fully functional conversational answer engine in 2022. We've grown from answering 2.5 million questions per day at the start of 2024 to around 20 million daily queries in December 2024. We also offer Perplexity Enterprise Pro, which counts leading companies like Nvidia, the Cleveland Cavaliers, Bridgewater, and Zoom as customers.

To support our rapid expansion, we've raised significant funding from some of the most respected technology investors. Our investor base includes IVP, NEA, Jeff Bezos, NVIDIA, Databricks, Bessemer Venture Partners, Elad Gil, Nat Friedman, Daniel Gross, Naval Ravikant, Tobi Lutke, and many other visionary individuals. In 2024, our employee base grew nearly 300%, and we're just getting started.

Final offer amounts are determined by multiple factors, including, experience and expertise, and may vary from the amounts listed above.

Equity: In addition to the base salary, equity may be part of the total compensation package.
Benefits: Comprehensive health, dental, and vision insurance for you and your dependents. Includes a 401(k) plan.

 

Similar Jobs

Power Integrations - Test Development Engineer

Power Integrations

Penang, Malaysia (On-Site)
7 Months ago
Qualcomm - Intern - Software Architecture Scripting Support Intern - 6 months

Qualcomm

Timișoara, Timiș, Romania (On-Site)
1 Month ago
Tencent - Lead Tools Engineer

Tencent

Irvine, California, United States (On-Site)
1 Month ago
Mapbox - Software Development Engineer III, EV Routing (Rust)

Mapbox

Germany (Remote)
5 Months ago
Glean - Solutions Engineer - East

Glean

United States (Remote)
1 Month ago
bytedance - Cloud Technical Support Engineer

bytedance

Singapore (On-Site)
4 Months ago
Salesforce - Forward Deployed Engineer - Deployment Strategist

Salesforce

Amsterdam, North Holland, Netherlands (On-Site)
3 Weeks ago
Salesforce - Principal, AgentForce Solution Engineer - Consumer Business Service

Salesforce

San Francisco, California, United States (On-Site)
3 Weeks ago
TensorWave - Cloud Engineer

TensorWave

Las Vegas, Nevada, United States (Hybrid)
1 Month ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Tagwiz - Lead Game Developer

Tagwiz

(On-Site)
1 Month ago
bytedance - Research Scientist, Responsible AI

bytedance

San Jose, California, United States (On-Site)
4 Months ago
extreme network - Senior/Staff Systems Software Engineer – Python, Go, C++, Networking

extreme network

Ontario, Canada (Hybrid)
4 Months ago
Ansys - R&D Engineer II (Solid Mechanics/ HPC)

Ansys

Canonsburg, Pennsylvania, United States (On-Site)
2 Months ago
SEGA - Audio Programmer

SEGA

Horsham, England, United Kingdom (Hybrid)
3 Months ago
N-ix - Senior Qt Engineer

N-ix

Ukraine (Remote)
1 Month ago
bytedance - Research Engineer - Multimodal Model

bytedance

Singapore (On-Site)
9 Months ago
Flexra Software - Manager Site Reliability Engineering

Flexra Software

Bengaluru, Karnataka, India (On-Site)
1 Year ago
bytedance - Tech Lead Software Engineer, OpenXR

bytedance

San Jose, California, United States (On-Site)
3 Weeks ago
Crowd Strick - SDET III, Windows Sensor

Crowd Strick

United States (Remote)
1 Month ago

Get notifed when new similar jobs are uploaded

Jobs in San Francisco, California, United States

Whatnot - Senior Business Systems Analyst (Workday)

Whatnot

Los Angeles, California, United States (On-Site)
2 Months ago
Naughty Dog - Game Developer

Naughty Dog

Santa Monica, California, United States (On-Site)
2 Months ago
Dream world  - Internship: Game Design / QA Testing (Fall-Winter)

Dream world

Redwood City, California, United States (On-Site)
3 Weeks ago
Kavalirio - Test Engineer III

Kavalirio

Fort Meade, Maryland, United States (On-Site)
1 Month ago
TransUnion - Lead Financial Services Consultant

TransUnion

White Plains, New York, United States (On-Site)
3 Months ago
Drive mode - Product Operations Manager

Drive mode

Mountain View, California, United States (Hybrid)
1 Month ago
Notion - Software Engineer, Android

Notion

San Francisco, California, United States (On-Site)
10 Months ago
Activision - Staff Software Engineer

Activision

San Francisco, California, United States (On-Site)
2 Months ago
Apple - Cellular RF Firmware Engineer

Apple

San Diego, California, United States (On-Site)
2 Months ago
Thatgamecompany - Engine Programmer

Thatgamecompany

United States (Remote)
4 Months ago

Get notifed when new similar jobs are uploaded

Devops Jobs

Autodesk - Solutions Engineer

Autodesk

Tokyo, Japan (On-Site)
2 Months ago
HappyRobot - Site Reliability Engineer

HappyRobot

San Francisco, California, United States (On-Site)
2 Months ago
playrix  - Senior Release Automation Engineer (Gardenscapes)

playrix

Ireland (Remote)
6 Months ago
Interactive Brokers - Platform Operations Engineer - Linux

Interactive Brokers

Zug, Zug, Switzerland (On-Site)
3 Months ago
ARHS - Azure Cloud Architect (m/f)

ARHS

Luxembourg (On-Site)
4 Months ago
Wargaming - IT Systems Reliability Engineer

Wargaming

Vilnius, Vilnius County, Lithuania (Hybrid)
3 Weeks ago
bytedance - Infrastructure Software Engineer in Edge Cloud

bytedance

San Jose, California, United States (On-Site)
4 Months ago
Everi - Solutions Architect II

Everi

Las Vegas, Nevada, United States (Hybrid)
1 Month ago
Aerovect - Senior Process Automation Engineer

Aerovect

Atlanta, Georgia, United States (On-Site)
2 Months ago
oportun - Senior Software Engineer, Cloud Infrastructure

oportun

Mexico (Remote)
3 Weeks ago

Get notifed when new similar jobs are uploaded

About The Company

Belgrade, Serbia (Hybrid)

Palo Alto, California, United States (On-Site)

New York, New York, United States (Hybrid)

New York, New York, United States (On-Site)

San Francisco, California, United States (On-Site)

Palo Alto, California, United States (On-Site)

London, England, United Kingdom (On-Site)

New York, United States (On-Site)

New York, New York, United States (On-Site)

View All Jobs

Get notified when new jobs are added by Perplexity

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug