Research Scientist Graduate (High-Performance Computing (Inference Optimization) - Vision AI Platform-Seattle)) - 2025 Start (PhD)

1 Month ago • All levels • Research Development • $184,300 PA - $337,250 PA

Job Summary

Job Description

The Doubao (Seed) Vision AI Platform team at ByteDance is seeking PhD graduates for 2025 to develop end-to-end infrastructure and optimize efficiency for vision-based large models like VLM, VGFM, and T2I. This role involves designing and developing next-generation large model inference engines, optimizing GPU cluster performance for low-latency and high-throughput deployment, and leading inference optimization techniques. Candidates will also build GPU inference acceleration stacks and collaborate on performance analysis and AI infrastructure development.
Must have:
  • Design and develop next-generation large model inference engines.
  • Optimize GPU cluster performance for image/video generation and multimodal models.
  • Lead inference optimization including CUDA/Triton kernel development.
  • Lead TensorRT/TRT-LLM graph optimization.
  • Lead distributed inference strategies and quantization techniques.
  • Lead PyTorch-based compilation (torch.compile).
  • Build GPU inference acceleration stack with multi-GPU collaboration.
  • Build PCIe optimization and high-concurrency service architecture.
  • Collaborate with algorithm teams on performance bottleneck analysis.
  • Collaborate on software-hardware co-design for vision model deployment.
  • Collaborate on AI infrastructure ecosystem development.
Good to have:
  • Experience in large-scale inference systems
  • Experience in vLLM/TGI customization
  • Experience in advanced quantization/sparsity
Perks:
  • Medical, dental, and vision insurance
  • 401(k) savings plan with company match
  • Paid parental leave
  • Short-term and long-term disability coverage
  • Life insurance
  • Wellbeing benefits
  • 10 paid holidays per year
  • 10 paid sick days per year
  • 17 days of Paid Personal Time (prorated upon hire with increasing accruals by tenure)

Job Details

The Doubao (Seed) Vision AI Platform team focuses on the end-to-end infrastructure development and efficiency improvement for Seed vision-based large model development, including the data pipeline construction and training, evaluation data delivery, and full lifecycle efficiency enhancement for visual large models such as VLM, VGFM, and T2I. This also encompasses large-scale training stability and optimization for acceleration, as well as large model inference and multi-machine multi-card deployment. We are looking for talented individuals to join our team in 2025. As a graduate, you will get unparalleled opportunities for you to kickstart your career, pursue bold ideas and explore limitless growth opportunities. Co-create a future driven by your inspiration with ByteDance. Successful candidates must be able to commit to an onboarding date by end of year 2025. We will prioritize candidates who are able to commit to the company start dates. Please state your availability and graduation date clearly in your resume. Applications will be reviewed on a rolling basis. We encourage you to apply early. Candidates can apply for a maximum of TWO positions and will be considered for jobs in the order you applied for. The application limit is applicable to ByteDance and its affiliates' jobs globally.

Responsibilities

1. Design and develop next-generation large model inference engines, optimizing GPU cluster performance for image/video generation and multimodal models to achieve industrial-grade low-latency & high-throughput deployment.

2. Lead inference optimization including CUDA/Triton kernel development, TensorRT/TRT-LLM graph optimization, distributed inference strategies, quantization techniques, and PyTorch-based compilation (torch.compile).

3. Build GPU inference acceleration stack with multi-GPU collaboration, PCIe optimization, and high-concurrency service architecture design.

4. Collaborate with algorithm teams on performance bottleneck analysis, software-hardware co-design for vision model deployment, and AI infrastructure ecosystem development.

Qualifications

Minimum Qualifications:

1. Bachelor's/Master's or above in Computer Science/EE/related fields.

2. Proficient in C++/Python and high-performance coding.

3. Expertise in ≥1 domains: GPU programming (CUDA/Triton/TensorRT), model quantization (PTQ/QAT), parallel computing (multi-GPU/multi-node inference), or compiler optimization (TVM/MLIR/XLA/torch.compile).

4. Deep understanding of Transformer architectures and LLM/VLM/Diffusion model optimization.

Preferred Qualifications:

1. Experience in large-scale inference systems, vLLM/TGI customization, advanced quantization/sparsity;

Similar Jobs

Google - Software Engineer II, Android Automotive

Google

Kraków, Lesser Poland Voivodeship, Poland (On-Site)
3 Months ago
LLNL - Synthetic Biologist - Postdoctoral Researcher

LLNL

Livermore, California, United States (On-Site)
1 Month ago
Ten4 - Senior Software Engineer - VR/Virtual Reality

Ten4

Seattle, Washington, United States (On-Site)
9 Years ago
Epic Games - Senior Platform Programmer

Epic Games

United States (On-Site)
5 Months ago
Google - Software Engineer, Quantum Compiling

Google

Los Angeles, California, United States (On-Site)
1 Month ago
Sonar Source - AI Research Engineer

Sonar Source

Singapore (On-Site)
4 Months ago
PwC - Manager_Conversational AI Developer_Advisory Corporate_Advisory_Bangalore

PwC

Bengaluru, Karnataka, India (On-Site)
10 Months ago
Kojima - AI Programmer

Kojima

Minato City, Tokyo, Japan (On-Site)
3 Months ago
bytedance - Research Scientist, AI for Infra

bytedance

Seattle, Washington, United States (On-Site)
1 Month ago
sony global (Games) - Robotics Researcher

sony global (Games)

Shenzhen, Guangdong Province, China (On-Site)
3 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

PhonePe - Software Engineer - Backend

PhonePe

Pune, Maharashtra, India (On-Site)
2 Months ago
frames store - FREELANCE: CG - CHICAGO

frames store

Chicago, Illinois, United States (On-Site)
1 Year ago
bytedance - Software Engineer, Video-On-Demand

bytedance

Singapore (On-Site)
9 Months ago
Epic Games - Automation Engineer

Epic Games

Cary, North Carolina, United States (On-Site)
5 Months ago
Mozilla - Senior Software Engineer

Mozilla

Finland (Remote)
3 Months ago
Tencent - Software Engineer Intern

Tencent

(On-Site)
4 Months ago
turtle rock studios - Senior UI Engineer

turtle rock studios

Irvine, California, United States (Hybrid)
2 Months ago
extreme network - Senior/Staff Systems Software Engineer – Python, Go, C++, Networking

extreme network

Ontario, Canada (Hybrid)
4 Months ago
Qualcomm - Staff Software Engineering – Virtual Platforms

Qualcomm

San Diego, California, United States (Remote)
3 Months ago
Dream Games - Senior Software Engineer

Dream Games

İstanbul, Türkiye (On-Site)
1 Year ago

Get notifed when new similar jobs are uploaded

Jobs in Seattle, Washington, United States

Ruselle Investments - Chief Information Security Officer

Ruselle Investments

Seattle, Washington, United States (On-Site)
1 Month ago
Kavalirio - Financial Analyst

Kavalirio

Orlando, Florida, United States (Hybrid)
1 Month ago
Adyen - Manager, Account Management, Adyen for Platforms

Adyen

New York, United States (On-Site)
3 Months ago
bytedance - Senior Software Engineer, Multi Cloud CDN - San Jose / Seattle / Boston

bytedance

Boston, Massachusetts, United States (On-Site)
8 Months ago
CharacterAI - Research Engineer, Multimodal Audio

CharacterAI

Redwood City, California, United States (On-Site)
2 Months ago
Token Metrics - Senior Crypto Data Engineer (Global-Remote-Non-US)

Token Metrics

Austin, Texas, United States (Remote)
4 Weeks ago
Marvell - Senior Principal Engineering Program Manager

Marvell

Santa Clara, California, United States (On-Site)
1 Month ago
Thumbtack - Senior Software Engineer, Pricing

Thumbtack

United States (Remote)
2 Months ago
Square - Associate, Credit & Public Equity Clients (2025 Start Dates)

Square

Dallas, Texas, United States (On-Site)
4 Weeks ago
Discord - Account Manager - Crossplatform Gaming

Discord

San Francisco, California, United States (On-Site)
3 Months ago

Get notifed when new similar jobs are uploaded

Research Development Jobs

Sony Interactive Entertainment - AI / ML Senior Technical Program Manager

Sony Interactive Entertainment

Dublin, County Dublin, Ireland (On-Site)
1 Month ago
CD PROJEKT RED - AI & Navigation Engineer

CD PROJEKT RED

Warsaw, Masovian Voivodeship, Poland (On-Site)
7 Months ago
EMA - AI Application Engineer

EMA

California, United States (On-Site)
4 Months ago
Welltech - Senior Machine Learning Engineer

Welltech

Poland (Remote)
3 Months ago
Fieldguide - Senior Software Engineer, AI

Fieldguide

San Francisco, California, United States (Remote)
1 Month ago
Scanline VFX - Research Intern (Summer 2026)

Scanline VFX

Los Angeles, California, United States (Hybrid)
2 Months ago
Hudl - Senior Applied Researcher

Hudl

London, England, United Kingdom (Hybrid)
4 Weeks ago
Reddit - Principal Machine Learning Engineer, Ads Measurement

Reddit

United States (Remote)
3 Months ago
SimpliSafe - Senior Embedded DSP/ML Engineer

SimpliSafe

Boston, Massachusetts, United States (Hybrid)
1 Month ago
eBay - Sr. Manager, AI Enablement

eBay

Austin, Texas, United States (Hybrid)
4 Weeks ago

Get notifed when new similar jobs are uploaded

About The Company

Founded in 2012, ByteDance's mission is to inspire creativity and enrich life. With a suite of more than a dozen products, including TikTok as well as platforms specific to the China market, including Toutiao, Douyin, and Xigua, ByteDance has made it easier and more fun for people to connect with, consume, and create content.
View All Jobs

Get notified when new jobs are added by bytedance

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug