Jobs Courses Resources Companies Placements

Home >

Jobs >

Software Engineer, LLM Storage System Intern

Bytedance

Singapore (On-site)

Software Engineer, LLM Storage System Intern

2 Months ago • All levels • System Design

Job Summary

Job Description

This Software Engineer internship focuses on designing and developing components for large model storage systems. Responsibilities include optimizing KVCache, improving data IO, and managing a multi-level storage system using various media like HBM and remote storage. The role involves optimizing KV Cache hit rate, designing data access interfaces, and ensuring system stability in Kubernetes. The engineer will also handle system setup, disaster recovery, and data placement across clusters for multi-datacenter scenarios. This internship offers an opportunity to contribute to cutting-edge AI technology and work with a world-class research team.

Must have:

Design and develop components for machine learning systems.
Optimize KVCache for large model inference.
Implement multi-level storage systems.
Design efficient and user-friendly data access interfaces.
Manage and maintain storage systems in Kubernetes.
Handle system setup and disaster recovery in multi-cloud scenarios.

2 skills required

2 skills required for this role

Add these skills to join the top 1% applicants for this job

kubernetes

machine-learning

Job Details

About the Team ByteDance Doubao Large Model Team was established in 2023, dedicated to developing the most advanced AI large model technology in the industry, becoming a world-class research team, and contributing to the development of technology and society. The Doubao large model team has a long-term vision and determination in the field of AI, with research directions covering NLP, CV, speech, etc. They have laboratories and research positions in China, Singapore, the US and other places. The team relies on sufficient data, computing and other resources on the platform, continuously invests in related fields, and has launched self-developed general large models, providing MultiModal Machine Learning capabilities. Downstream support includes 50 + businesses such as Doubao, Coze, Dreamina, and is open to enterprise customers through Volcengine. Currently, Doubao APP has become the largest AIGC application in the Chinese market. 1. Assume responsibility for the design and development of components associated with the storage of machine learning systems, catering to diverse business scenarios of large model inference (LLM/S2S/VLM/multimodal, etc.). This includes model distribution and loading, KVCache optimization, enhancement of data IO performance, and improvement of TTFT and TBT in LLM serving 2. Take charge of designing and implementing a multi-level storage system for large model inference. Comprehensively utilize various media, including HBM, host memory, distributed disk, and remote large-capacity storage systems (HDFS/object storage) for data storage and migration management. Realize an integrated hierarchical system of "near-compute cache + remote large-capacity storage". 3. Be accountable for optimizing the hit rate of large model KV Cache. Formulate customized optimization strategies from multiple system dimensions, such as the inference framework, traffic scheduling, and multi-level cache. Optimize data IO performance by fully leveraging NVLink, RDMA high-speed network, and GPU Direct technologies on the near-compute side to achieve efficient data transmission. Optimize the storage strategy of data replicas to achieve a reasonable distribution of load traffic and stored data. 4. Undertake the design and implementation of efficient and user-friendly data access interfaces. Realize seamless docking with the inference framework, and manage the lifecycle of KV Cache. 5. Be responsible for the access, management, operation and maintenance, and monitoring of the multi-level storage system in the Kubernetes scenario to ensure stability. 6. Assume the task of system setup and disaster recovery in multi-datacenter, multi-region, and multi-cloud scenarios, and optimize data placement across clusters.

Similar Jobs

Tech Lead - Data Tech Infrastructure- San Jose

bytedance

San Jose, California, United States (On-Site)

• 8 Months ago

Lead Engineer (Python)

velotio technologies

Pune, Maharashtra, India (Remote)

• 2 Months ago

DevOps/Software Engineer

Trellix

Cork, County Cork, Ireland (On-Site)

• 1 Month ago

Senior Software Engineer

Xsolla

Kuala Lumpur, Federal Territory Of Kuala Lumpur, Malaysia (Hybrid)

• 6 Days ago

Production Engineering

Lead Systems Engineer, High-Performance Computing

Visa

Ashburn, Virginia, United States (Hybrid)

• 1 Week ago

Hardware System Electrical Engineer - Beats

Apple

Los Angeles, California, United States (On-Site)

• 2 Weeks ago

Systems Engineer II - Windows

Granicus

Bengaluru, Karnataka, India (Remote)

• 2 Months ago

Software Engineer, Systems ML - SW/HW Co-design

STAFF SW SYSTEMS ENGINEER

extreme network

Bengaluru, Karnataka, India (Hybrid)

• 2 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Senior Software and System Architect

NVIDIA

New York, New York, United States (Remote)

• 4 Months ago

Architecte Cloud Hybride H/F

Devoteam

Levallois-Perret, Île-de-France, France (Remote)

• 8 Months ago

Cloud Engineer/Architect (DevOps)

Ion

Pisa, Tuscany, Italy (On-Site)

• 9 Months ago

DevOps Architect

plana technologies

Cartagena, Bolivar, Colombia (Remote)

• 2 Months ago

Backend GO Developer

JMA

Bologna, Emilia-Romagna, Italy (Hybrid)

• 2 Months ago

Technical Support Engineer

Alpha Sense

Canada (On-Site)

• 1 Month ago

Senior Staff Engineer

DMG

Bengaluru, Karnataka, India (On-Site)

• 6 Months ago

Senior software engineer (Node.js & Angular)

Marsh McLennan

Cluj-Napoca, Cluj County, Romania (Hybrid)

• 1 Month ago

Software Engineer, Data Engineering

Krafton

Seoul, South Korea (On-Site)

• 1 Month ago

Product Manager, SaaS

Veeam Software

Prague, Czechia (Remote)

• 1 Month ago

Get notifed when new similar jobs are uploaded

Jobs in Singapore

Data Science Lead - AI

GoTo Group

Singapore (Hybrid)

• 3 Weeks ago

OCBP - Global Monetization Product and Technology

bytedance

Singapore (On-Site)

• 5 Months ago

Sales Executive

Bushiroad

Singapore (On-Site)

• 11 Months ago

ASEAN Sales Leader

Spaulding Ridge

Singapore (On-Site)

• 1 Month ago

Clinical Manager Cardiology Ultrasound APAC

Philips

Singapore (On-Site)

• 1 Month ago

Data Center Technical Project Manager - Fibre Delivery

bytedance

Singapore (On-Site)

• 8 Months ago

Blockchain Engineer

Windranger

Singapore (Remote)

• 2 Years ago

Account Manager

Adyen

Singapore (On-Site)

• 6 Days ago

Engineer 2, Product Lifecycle (Mechanical)

Illumina

Singapore, Singapore (On-Site)

• 2 Months ago

Network Implementation Engineer, Physical Network Infrastructure

bytedance

Singapore (On-Site)

• 3 Months ago

Get notifed when new similar jobs are uploaded

System Design Jobs

Electrical Engineer - Motor Insulation System

Tesla

Athens, Greece (On-Site)

• 5 Months ago

Senior Trading Systems Front End Engineer

FalconX

New York, New York, United States (On-Site)

• 1 Month ago

Software Engineer, Systems ML - SW/HW Co-design

Media Systems Engineer I

Take-Two Interactive

Novato, California, United States (On-Site)

• 2 Weeks ago

Systems Engineer

Alten Technology

Boston, Massachusetts, United States (On-Site)

• 1 Month ago

Radar Architect and System Engineer

Thales

Rome, Lazio, Italy (On-Site)

• 2 Months ago

Senior Application Developer

Sprinkler

Bengaluru, Karnataka, India (On-Site)

• 2 Months ago

System Engineer

Forescout Technologies Inc

Milan, Lombardy, Italy (On-Site)

• 3 Months ago

Embedded Database Systems Engineer

Passive Logic

Salt Lake City, Utah, United States (On-Site)

• 1 Month ago

Senior Software Engineer / Researcher, AI-Native Database Systems

bytedance

San Jose, California, United States (On-Site)

• 2 Months ago

Get notifed when new similar jobs are uploaded

About The Company

bytedance

1022 Active Jobs

Founded in 2012, ByteDance's mission is to inspire creativity and enrich life. With a suite of more than a dozen products, including TikTok as well as platforms specific to the China market, including Toutiao, Douyin, and Xigua, ByteDance has made it easier and more fun for people to connect with, consume, and create content.

Get notified when new jobs are added by bytedance

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

A global community of game builders. Helping people upskill and land jobs in the best gaming studios.

Company

Key Links

hello@outscal.com

Made in INDIA 💛💙

Software Engineer, LLM Storage System Intern

Job Summary

Job Description

2 skills required

2 skills required for this role

Job Details

Similar Jobs

Tech Lead - Data Tech Infrastructure- San Jose

Lead Engineer (Python)

DevOps/Software Engineer

Senior Software Engineer

Production Engineering

Lead Systems Engineer, High-Performance Computing

Hardware System Electrical Engineer - Beats

Systems Engineer II - Windows

Software Engineer, Systems ML - SW/HW Co-design

STAFF SW SYSTEMS ENGINEER

Similar Skill Jobs

Senior Software and System Architect

Architecte Cloud Hybride H/F

Cloud Engineer/Architect (DevOps)

DevOps Architect

Backend GO Developer

Technical Support Engineer

Senior Staff Engineer

Senior software engineer (Node.js & Angular)

Software Engineer, Data Engineering

Product Manager, SaaS

Jobs in Singapore

Data Science Lead - AI

OCBP - Global Monetization Product and Technology

Sales Executive

ASEAN Sales Leader

Clinical Manager Cardiology Ultrasound APAC

Data Center Technical Project Manager - Fibre Delivery

Blockchain Engineer

Account Manager

Engineer 2, Product Lifecycle (Mechanical)

Network Implementation Engineer, Physical Network Infrastructure

System Design Jobs

Electrical Engineer - Motor Insulation System

Senior Trading Systems Front End Engineer

Software Engineer, Systems ML - SW/HW Co-design

Media Systems Engineer I

Systems Engineer

Radar Architect and System Engineer

Senior Application Developer

System Engineer

Embedded Database Systems Engineer

Senior Software Engineer / Researcher, AI-Native Database Systems

About The Company

Research Scientist Graduate (Generative AI for Science (ByteDance Seed)) - 2026 Start (PhD)

Category Manager, Furnishing and Home Supplies (Philippines, eCommerce)

Research Scientist Graduate, AI-Native Database Systems - 2026 Start (PhD)

Research Scientist in Generative AI Graduate (Intelligent Creation)

Research Scientist Graduate (LLM Model Evaluation - Seed)

Machine Learning Graduate (E-Commerce Governance-CV/NLP/Multimodal/LLM)

Research Scientist Graduate (eCommerce Recommendation)

Software Engineer Graduate (Applied Machine Learning - Orchestration)

Product Manager, Large Language Model

Network Engineer, High Performance GPU Network Direction

Level Up Your Career in Game Development!