AI Training Infrastructure Engineer - Post Training

10 Hours ago • 6 Years + • $220,000 PA - $290,000 PA

Job Summary

Job Description

Perplexity is seeking experienced AI Research Engineers and Scientists to improve their in-house Online LLMs, the Sonar models. Your job will be to collaborate with the team and create a robust and effective training framework, particularly for post-training LLMs. Responsibilities include building a post-training framework, implementing infrastructure to support the latest models and algorithms like SFT, RL (DPO/GRPO), and owning the full stack data, training, and evaluation pipelines required to post-train LLM models, also working closely with engineering teams to integrate Sonar models into the product. The company has grown significantly since 2022 and offers comprehensive benefits, including health, dental, vision insurance and a 401(k) plan.
Must have:
  • Proven experience with large-scale LLMs frameworks building
  • Strong in Python/Pytorch; C++/CUDA is a plus
  • Self-starter with a willingness to take ownership of tasks
  • Passion for tackling challenging problems
Good to have:
  • PhD in AI/ML/Systems or related areas
  • Experience building LLM training frameworks, especially post training
Perks:
  • Comprehensive health, dental, and vision insurance for you and your dependents
  • Includes a 401(k) plan
  • Equity may be part of the total compensation package

Job Details

Perplexity is seeking experienced AI Research Engineers and Scientists to continue to improve our in house Online LLMs, the Sonar models. Your job is to work with team and create a robust and effective training framework (on top of Megatron/PyTorch), especially for post training LLMs.

Responsibilities

  • Build a post training framework that can run cutting-edge model training jobs in scale
  • Implement the necessary infra and components to support latest models and algorithms like SFT, RL (DPO/GRPO) and more
  • Own the full stack data, training, and eval pipelines required to post-train LLM models
  • Work closing with engineering teams to integrate Sonar models into our product.

Qualifications

  • Proven experience with large-scale LLMs frameworks building
  • Strong in Python/Pytorch; C++/CUDA is a plus
  • Self-starter with a willingness to take ownership of tasks
  • Passion for tackling challenging problems
  • Minimum of 6 years of working on relevant projects.

Bonus

  • PhD in AI/ML/Systems or related areas
  • Experience building LLM training frameworks, especially post training

The cash compensation range for this role is $220,000 - $290,000.

At Perplexity, we've experienced tremendous growth and adoption since publicly launching the world's first fully functional conversational answer engine in 2022. We've grown from answering 2.5 million questions per day at the start of 2024 to around 20 million daily queries in December 2024. We also offer Perplexity Enterprise Pro, which counts leading companies like Nvidia, the Cleveland Cavaliers, Bridgewater, and Zoom as customers.

To support our rapid expansion, we've raised significant funding from some of the most respected technology investors. Our investor base includes IVP, NEA, Jeff Bezos, NVIDIA, Databricks, Bessemer Venture Partners, Elad Gil, Nat Friedman, Daniel Gross, Naval Ravikant, Tobi Lutke, and many other visionary individuals. In 2024, our employee base grew nearly 300%, and we're just getting started.

Final offer amounts are determined by multiple factors, including, experience and expertise, and may vary from the amounts listed above.

Equity: In addition to the base salary, equity may be part of the total compensation package.
Benefits: Comprehensive health, dental, and vision insurance for you and your dependents. Includes a 401(k) plan.

 

Similar Jobs

Fictiv - Deputy Finance Manager

Fictiv

Bengaluru, Karnataka, India (On-Site)
9 Hours ago
Ciklum - Senior Data Scientist

Ciklum

Pune, Maharashtra, India (Hybrid)
6 Months ago
QuinStreet - Applied Machine Learning Engineer

QuinStreet

(Remote)
1 Day ago
Samsung Semiconductor - Principal Engineer, Device Modeling

Samsung Semiconductor

San Jose, California, United States (On-Site)
3 Weeks ago
Ubisoft - Senior R&D Engineer

Ubisoft

Pune, Maharashtra, India (On-Site)
2 Weeks ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

NVIDIA - Principal Technical Program Manager, AI and Enterprise Apps

NVIDIA

Santa Clara, California, United States (On-Site)
2 Weeks ago
NVIDIA - Senior Tool and Methodology Development Software Engineer

NVIDIA

Hsinchu, Hsinchu City, Taiwan (On-Site)
1 Month ago
ByteDance - DevOps Engineer - Applied Machine Learning, Engine

ByteDance

San Jose, California, United States (On-Site)
2 Months ago
Ubisoft - Senior Gameplay Programmer

Ubisoft

Barcelona, Catalonia, Spain (Hybrid)
2 Weeks ago
NVIDIA - Senior System Software Architect, HPC Networking

NVIDIA

Tel Aviv-Yafo, Tel Aviv District, Israel (On-Site)
2 Months ago
Singhania Buildcon Group | No 1 Real Estate Company - social media manager

Singhania Buildcon Group | No 1 Real Estate Company

Raipur, Chhattisgarh, India (On-Site)
5 Months ago
Google - Software Engineer III, Infrastructure, Google Cloud Compute Infrastructure

Google

Sunnyvale, California, United States (On-Site)
2 Weeks ago
Google - Software Engineer III, Platforms

Google

Bengaluru, Karnataka, India (On-Site)
2 Weeks ago
Adtran - Junior Software Engineer

Adtran

Gdynia, Pomeranian Voivodeship, Poland (Hybrid)
19 Hours ago

Get notifed when new similar jobs are uploaded

Jobs in San Francisco, California, United States

GHX - Inventory Specialist

GHX

Tampa, Florida, United States (On-Site)
6 Hours ago
Riot Games - Sr. Manager, Software Engineering - Unpublished R&D Product

Riot Games

Los Angeles, California, United States (On-Site)
2 Weeks ago
Hudl - Senior Product Manager - North American Sports

Hudl

Lexington, Kentucky, United States (Hybrid)
1 Day ago
Riot Games - Principal Animation Artist - Unpublished R&D Product

Riot Games

Los Angeles, California, United States (On-Site)
1 Month ago
NVIDIA - Senior AI-HPC Cluster Engineer

NVIDIA

Westford, Massachusetts, United States (Hybrid)
1 Month ago
Aspyr - Lead Software Engineer

Aspyr

Austin, Texas, United States (On-Site)
1 Day ago
Evolution - In Studio Game Presenter

Evolution

Atlantic City, New Jersey, United States (On-Site)
1 Month ago
Meta - Software Engineer, iOS

Meta

Burlingame, California, United States (On-Site)
5 Months ago
Google - Technical Program Manager III, Manufacturing Test Development

Google

Sunnyvale, California, United States (On-Site)
2 Days ago
Epic Games - Environment Art Lead

Epic Games

Cary, North Carolina, United States (On-Site)
2 Months ago

Get notifed when new similar jobs are uploaded

Similar Category Jobs

Looks like we're out of matches

Set up an alert and we'll send you similar jobs the moment they appear!

About The Company

New York, New York, United States (On-Site)

San Francisco, California, United States (On-Site)

New York, New York, United States (On-Site)

London, England, United Kingdom (On-Site)

London, England, United Kingdom (On-Site)

Belgrade, Serbia (On-Site)

Belgrade, Serbia (On-Site)

Belgrade, Serbia (On-Site)

Belgrade, Serbia (On-Site)

View All Jobs

Get notified when new jobs are added by Perplexity AI

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug