Research Engineer (Foundation Model) New Grad

2 Weeks ago • Upto 1 Years • Artificial Intelligence

About the job

Job Description

This role is for a Research Engineer specializing in foundation models, focusing on Generative AI. Responsibilities include leading research in multimodal foundation models, designing and developing algorithms and architectures to enhance model performance and scalability, optimizing models for production environments, managing large-scale data clusters, and collaborating with cross-functional teams. The ideal candidate possesses strong engineering skills in Python and PyTorch, experience building machine learning models from scratch, familiarity with generative multimodal models (Diffusion Models and GANs), and a solid understanding of deep learning concepts including Transformers. The position is open to recent graduates and offers a competitive salary and equity package.
Must have:
  • Strong Python & PyTorch skills
  • Experience building ML models
  • Generative model familiarity (Diffusion, GANs)
  • Deep learning understanding (Transformers)
  • Large-scale data cluster management
Good to have:
  • 1+ year experience
  • 100+ GPU system experience
  • Linux cluster proficiency
Perks:
  • Competitive equity package
  • Comprehensive benefits plan

We are seeking a highly skilled Research Engineer with extensive experience in training Generative AI models. As part of our research team, you will play a key role in building state-of-the-art multimodal foundation models and managing large-scale training runs on thousands of GPUs. Your expertise will directly impact the performance, scalability, and efficiency of our next-generation AI technologies.

Key Responsibilities

  • Lead and contribute to groundbreaking research in multimodal foundation models.
  • Design, develop, and experiment with innovative algorithms, architectures, and techniques to enhance model performance and scalability.
  • Optimize models for production environments, focusing on computational efficiency, throughput, and latency while maintaining accuracy and robustness.
  • Analyze and manage large-scale data clusters, identifying inefficiencies and bottlenecks in training pipelines and data loading processes.
  • Collaborate with cross-functional teams, including data, applied research, and infrastructure teams, to drive impactful projects.

Qualifications

  • Technical Expertise:
    • Demonstrated strong engineering skills in Python and PyTorch.
    • Hands-on experience building machine learning models from scratch using PyTorch.
    • Familiarity with generative multimodal models such as Diffusion Models and GANs.
    • Solid understanding of foundational deep learning concepts, including Transformers.
  • Preferred Experience:
    • 1 year+ industrial or academic lab experience.
    • Experience working with large distributed systems involving 100+ GPUs.
    • Proficiency with Linux clusters, systems, and scripting.

Note: This role is open to recent graduates.

Compensation

The salary range for this position in California is $160,000–$200,000 per year. The final offer will be based on job-related expertise, skills, candidate location, and experience. Additionally, we provide competitive equity packages in the form of stock options and a comprehensive benefits plan.

View Full Job Description
$160.0K - $200.0K/yr (Outscal est.)
$180.0K/yr avg.
Palo Alto, California, United States

Add your resume

80%

Upload your resume, increase your shortlisting chances by 80%

About The Company

An idea-to-video platform that brings your creativity to motion.

Palo Alto, California, United States (On-Site)

Palo Alto, California, United States (On-Site)

Palo Alto, California, United States (On-Site)

California, United States (On-Site)

Palo Alto, California, United States (On-Site)

Palo Alto, California, United States (On-Site)

Palo Alto, California, United States (On-Site)

Palo Alto, California, United States (On-Site)

Palo Alto, California, United States (On-Site)

Palo Alto, California, United States (On-Site)

View All Jobs

Get notified when new jobs are added by Pika

Similar Jobs

Wargaming - Director of AI Engineering

Wargaming, Poland (On-Site)

Worley - Data Scientist II

Worley, India (Hybrid)

Fluence - Sr.Controls Software Engineer

Fluence, India (Hybrid)

Oblivious - Senior Algorithms Engineer

Oblivious, India (Hybrid)

ByteDance - Research Scientist, Vision Foundation Model

ByteDance, United States (On-Site)

Inkittt - Director of AI

Inkittt, United States (On-Site)

C3 AI - Solution Engineer

C3 AI, India (On-Site)

Barbaricum - Senior Program Protection Specialist

Barbaricum, United States (On-Site)

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

G5 Games - C++ Gameplay Programmer

G5 Games, Cyprus (Remote)

Matic Robots - iOS Engineer, Graphics and Rendering

Matic Robots, United States (On-Site)

Paypal - Senior Data Scientist, MMM

Paypal, United States (Hybrid)

CloudHire - React + Blockchain Developer

CloudHire, India (Remote)

Larian Studios - Graphics Programmer

Larian Studios, Ireland (On-Site)

Get notifed when new similar jobs are uploaded

Jobs in Palo Alto, California, United States

Blizzard Entertainment - Associate Program Manager | Irvine, CA or Austin, TX

Blizzard Entertainment, United States (Hybrid)

Inkittt - Senior Product Manager, Monetization

Inkittt, United States (On-Site)

Patreon - Staff Data Scientist

Patreon, United States (Hybrid)

Warner Bros Discovery - Diversity in Entertainment Legal Fellowship: LA- Summer 2025

Warner Bros Discovery, United States (Hybrid)

Nielsen Holdings - Field Sales Representative

Nielsen Holdings, United States (On-Site)

Next Level Business Services - Sr. Java Developer

Next Level Business Services, United States (On-Site)

The Walt Disney Company - Director, Licensing (Scripted)

The Walt Disney Company, United States (On-Site)

Gala - Nodes Business Leader

Gala, United States (On-Site)

Get notifed when new similar jobs are uploaded

Artificial Intelligence Jobs

Get notifed when new similar jobs are uploaded