Research Scientist Graduate (High-Performance Computing (Inference Optimization) - Vision AI Platform-Seattle)) - 2025 Start (PhD)

undefined ago • All levels • Research Development • $184,300 PA - $337,250 PA

Job Summary

Job Description

The Doubao (Seed) Vision AI Platform team at ByteDance is seeking PhD graduates for 2025 to develop end-to-end infrastructure and optimize efficiency for vision-based large models like VLM, VGFM, and T2I. This role involves designing and developing next-generation large model inference engines, optimizing GPU cluster performance for low-latency and high-throughput deployment, and leading inference optimization techniques. Candidates will also build GPU inference acceleration stacks and collaborate on performance analysis and AI infrastructure development.
Must have:
  • Design and develop next-generation large model inference engines.
  • Optimize GPU cluster performance for image/video generation and multimodal models.
  • Lead inference optimization including CUDA/Triton kernel development.
  • Lead TensorRT/TRT-LLM graph optimization.
  • Lead distributed inference strategies and quantization techniques.
  • Lead PyTorch-based compilation (torch.compile).
  • Build GPU inference acceleration stack with multi-GPU collaboration.
  • Build PCIe optimization and high-concurrency service architecture.
  • Collaborate with algorithm teams on performance bottleneck analysis.
  • Collaborate on software-hardware co-design for vision model deployment.
  • Collaborate on AI infrastructure ecosystem development.
Good to have:
  • Experience in large-scale inference systems
  • Experience in vLLM/TGI customization
  • Experience in advanced quantization/sparsity
Perks:
  • Medical, dental, and vision insurance
  • 401(k) savings plan with company match
  • Paid parental leave
  • Short-term and long-term disability coverage
  • Life insurance
  • Wellbeing benefits
  • 10 paid holidays per year
  • 10 paid sick days per year
  • 17 days of Paid Personal Time (prorated upon hire with increasing accruals by tenure)

Job Details

The Doubao (Seed) Vision AI Platform team focuses on the end-to-end infrastructure development and efficiency improvement for Seed vision-based large model development, including the data pipeline construction and training, evaluation data delivery, and full lifecycle efficiency enhancement for visual large models such as VLM, VGFM, and T2I. This also encompasses large-scale training stability and optimization for acceleration, as well as large model inference and multi-machine multi-card deployment. We are looking for talented individuals to join our team in 2025. As a graduate, you will get unparalleled opportunities for you to kickstart your career, pursue bold ideas and explore limitless growth opportunities. Co-create a future driven by your inspiration with ByteDance. Successful candidates must be able to commit to an onboarding date by end of year 2025. We will prioritize candidates who are able to commit to the company start dates. Please state your availability and graduation date clearly in your resume. Applications will be reviewed on a rolling basis. We encourage you to apply early. Candidates can apply for a maximum of TWO positions and will be considered for jobs in the order you applied for. The application limit is applicable to ByteDance and its affiliates' jobs globally.

Responsibilities

1. Design and develop next-generation large model inference engines, optimizing GPU cluster performance for image/video generation and multimodal models to achieve industrial-grade low-latency & high-throughput deployment.

2. Lead inference optimization including CUDA/Triton kernel development, TensorRT/TRT-LLM graph optimization, distributed inference strategies, quantization techniques, and PyTorch-based compilation (torch.compile).

3. Build GPU inference acceleration stack with multi-GPU collaboration, PCIe optimization, and high-concurrency service architecture design.

4. Collaborate with algorithm teams on performance bottleneck analysis, software-hardware co-design for vision model deployment, and AI infrastructure ecosystem development.

Qualifications

Minimum Qualifications:

1. Bachelor's/Master's or above in Computer Science/EE/related fields.

2. Proficient in C++/Python and high-performance coding.

3. Expertise in ≥1 domains: GPU programming (CUDA/Triton/TensorRT), model quantization (PTQ/QAT), parallel computing (multi-GPU/multi-node inference), or compiler optimization (TVM/MLIR/XLA/torch.compile).

4. Deep understanding of Transformer architectures and LLM/VLM/Diffusion model optimization.

Preferred Qualifications:

1. Experience in large-scale inference systems, vLLM/TGI customization, advanced quantization/sparsity;

Similar Jobs

Ion - Software Developer/Engineer - Graduate Development Program

Ion

Collecchio, Emilia-Romagna, Italy (On-Site)
9 Months ago
rivos - Memory Subsystem Architecture and Performance Modeling

rivos

Santa Clara, California, United States (Hybrid)
5 Months ago
Black Bery - QNX Technical Product Manager (Intermediate)

Black Bery

Ottawa, Ontario, Canada (On-Site)
2 Months ago
Robot cache  - Multiple Programming and Analyst Roles

Robot cache

San Diego, California, United States (On-Site)
2 Weeks ago
Playdead - Graphics Programmer

Playdead

Copenhagen, Denmark (On-Site)
11 Months ago
Moloco - Applied Scientist II - Moloco Ads

Moloco

Seattle, Washington, United States (On-Site)
2 Weeks ago
whoop - Senior Software Engineer (ML Operations)

whoop

Boston, Massachusetts, United States (On-Site)
2 Weeks ago
Qualcomm - Machine Learning for Video Compression - Principal Scientist

Qualcomm

San Diego, California, United States (On-Site)
2 Months ago
Intangible - Applied AI Engineer (Image/Video Diffusion)

Intangible

United States (Remote)
2 Months ago
Qualcomm - GPU Research Engineer

Qualcomm

Santa Clara, California, United States (On-Site)
2 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

endava - Solution Architect - Payments

endava

Sydney, New South Wales, Australia (On-Site)
1 Week ago
Capgemini - Teamcenter CAD Integration

Capgemini

Pune, Maharashtra, India (On-Site)
2 Months ago
Visual Concepts - Senior Technical Artist

Visual Concepts

Austin, Texas, United States (Remote)
4 Weeks ago
NVIDIA - Senior Malware Research Architect

NVIDIA

Tel Aviv-Yafo, Tel Aviv District, Israel (On-Site)
4 Months ago
Epic Games - Senior QA Engineer

Epic Games

Cary, North Carolina, United States (On-Site)
6 Months ago
Roblox - Principal Software Engineer - Input & Controls

Roblox

San Mateo, California, United States (On-Site)
1 Month ago
rivos - Accelerator Microarchitecture Performance Modeling

rivos

Austin, Texas, United States (Remote)
9 Months ago
Penn Interactive - Machine Learning Engineer

Penn Interactive

Philadelphia, Pennsylvania, United States (On-Site)
2 Weeks ago
Deepgram - Solutions Architect

Deepgram

California, United States (Remote)
1 Month ago
rivos - Accelerator DV Testgen

rivos

Santa Clara, California, United States (Hybrid)
1 Year ago

Get notifed when new similar jobs are uploaded

Jobs in Seattle, Washington, United States

Penrose studios - Director of Operations

Penrose studios

San Francisco, California, United States (On-Site)
4 Years ago
Vercel - Sales Engineer

Vercel

United States (Remote)
2 Months ago
Sandbox VR - Shift Lead (Key Holder)

Sandbox VR

Pittsburgh, Pennsylvania, United States (On-Site)
3 Days ago
Hawkeye Innovations - Systems Technician

Hawkeye Innovations

Rosemont, Illinois, United States (On-Site)
3 Months ago
Axon - Senior Product Designer

Axon

Seattle, Washington, United States (On-Site)
2 Weeks ago
HCL Tech - Senior LabVIEW Designer

HCL Tech

California, United States (On-Site)
1 Month ago
Apple - Software Engineer (Accessibility Engineer)

Apple

Sunnyvale, California, United States (On-Site)
2 Months ago
Alten Technology - Technical Software Lead

Alten Technology

Greensboro, North Carolina, United States (On-Site)
2 Months ago
Coherent corp. - Process Engineer - Components & Systems

Coherent corp.

Saxonburg, Pennsylvania, United States (On-Site)
1 Month ago
illumio - Director, Engineering

illumio

Sunnyvale, California, United States (On-Site)
1 Week ago

Get notifed when new similar jobs are uploaded

Research Development Jobs

bytedance - Research Scientist Graduate (Foundation Model, Video Generation) - 2025 Start (PhD)

bytedance

Seattle, Washington, United States (On-Site)
8 Months ago
zoox - Senior/Staff Software Engineer - Learned Trajectory Machine Learning Engineer

zoox

Foster City, California, United States (Hybrid)
8 Months ago
Apple - MacOS Machine Learning Engineer

Apple

Seattle, Washington, United States (On-Site)
1 Month ago
Kokotree - Artificial Intelligence Developers

Kokotree

Wilmington, North Carolina, United States (On-Site)
8 Months ago
Perplexity - Senior Machine Learning Engineer

Perplexity

Belgrade, Serbia (On-Site)
2 Months ago
Glocomms - VP, Global Technology - AI

Glocomms

Orlando, Florida, United States (Hybrid)
2 Months ago
Axel springer - Senior Research Analyst - Pharmaceutical and Biotech Regulation

Axel springer

Arlington, Virginia, United States (Hybrid)
1 Week ago
Single Store - Head of Digital & AI Transformation

Single Store

United States (Remote)
2 Weeks ago
Make - Senior Process Automation & AI specialist

Make

Prague, Czechia (On-Site)
2 Months ago
bytedance - Student Researcher (Doubao (Seed) Foundation Model - Video Generation) - 2025 Start (PhD)

bytedance

Seattle, Washington, United States (On-Site)
8 Months ago

Get notifed when new similar jobs are uploaded

About The Company

Founded in 2012, ByteDance's mission is to inspire creativity and enrich life. With a suite of more than a dozen products, including TikTok as well as platforms specific to the China market, including Toutiao, Douyin, and Xigua, ByteDance has made it easier and more fun for people to connect with, consume, and create content.
View All Jobs

Get notified when new jobs are added by bytedance

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug