Senior/Tech Lead AI/LLM Network Software Development Engineer

4 Months ago • 4-8 Years • Network Engineering • $194,000 PA - $410,000 PA

Job Summary

Job Description

ByteDance seeks a Senior/Tech Lead AI/LLM Network Software Development Engineer to design, implement, and deploy high-speed network technologies for AI/LLM applications. Responsibilities include developing platforms for monitoring and analyzing large-scale AI/LLM networks, researching high-performance AI communication frameworks, and building next-generation AI network infrastructure. The ideal candidate possesses expertise in computer networks, network programming (C/C++, Python, Go), high-speed network systems (RDMA, congestion control), and experience with high-performance communication frameworks (NCCL, MPI, RPC). The role involves optimizing AI/LLM network scalability, reliability, and performance, working with large-scale heterogeneous network hardware.
Must have:
  • High-speed network tech design & deployment
  • AI/LLM network monitoring & analysis platform development
  • High-performance AI communication framework research
  • Proficiency in C/C++, Python, Go
  • Experience with RDMA, congestion control
Good to have:
  • Experience with NCCL, MPI, RPC libraries
  • AI network diagnosis and performance optimization experience
Perks:
  • Medical, dental, and vision insurance
  • 401(k) savings plan with company match
  • Paid parental leave
  • Paid time off
  • Wellbeing benefits

Job Details

Responsibilities
About ByteDance Founded in 2012, ByteDance's mission is to inspire creativity and enrich life. With a suite of more than a dozen products, including TikTok and Helo as well as platforms specific to the China market, including Toutiao, Douyin, and Xigua, ByteDance has made it easier and more fun for people to connect with, consume, and create content. Why Join Us Creation is the core of ByteDance's purpose. Our products are built to help imaginations thrive. This is doubly true of the teams that make our innovations possible. Together, we inspire creativity and enrich life - a mission we aim towards achieving every day. To us, every challenge, no matter how ambiguous, is an opportunity; to learn, to innovate, and to grow as one team. Status quo? Never. Courage? Always. At ByteDance, we create together and grow together. That's how we drive impact - for ourselves, our company, and the users we serve. Join us. About the Team ByteDance Networking brings together innovative ideas and technologies from network architecture, software defined networking (SDN), network virtualization, switch software and hardware co-design, and high-speed networking, to create hyperscale data-center networking solutions that power several of the most popular apps of the world such as Douyin and TikTok which serve hundreds of millions of users around the globe. ByteDance Networking is responsible for designing, building, and operating the global, intelligent network infrastructure to meet the requirements of high availability, scalability, and high-performance. By joining this team, you will gain marketable software development and/or network operation experiences in data center networking at massive scale. Responsibilities: - Design, implementation and deployment of high-speed network technologies to support AI/LLM applications. - Design and development of platforms/systems for monitoring, analysis and diagnosis of large scale AI/LLM network. - Research and development of high-performance AI communication framework, network protocol stacks, and codesign optimization of host-network-application to improve the scalability, reliability and performance of AI/LLM network. - Building next generation AI network infrastructure supporting large scale heterogeneous network hardware with innovative and deployable solutions.
Qualifications
Minimum Qualifications - Bachelor or higher degree in computer science, electronic engineering, network engineering or related fields. - Proficiency in computer network and network programming. - Proficiency in one or several mainstream programming languages, including C/C++, Python, Go and so on. - Be familiar with the latest advances in the area of high-speed network systems, including RDMA, congestion control, AI network optimization and so on. - Experience in developing high performance communication frameworks(including NCCL, MPI and RPC libraries) is a plus. - Experience in developing software systems for AI network diagnosis and performance optimization is a plus. Preferred Qualifications - Experience in developing high performance communication frameworks(including NCCL, MPI and RPC libraries) is a plus. - Experience in developing software systems for AI network diagnosis and performance optimization is a plus. ByteDance is committed to creating an inclusive space where employees are valued for their skills, experiences, and unique perspectives. Our platform connects people from across the globe and so does our workplace. At ByteDance, our mission is to inspire creativity and enrich life. To achieve that goal, we are committed to celebrating our diverse voices and to creating an environment that reflects the many communities we reach. We are passionate about this and hope you are too. ByteDance Inc. is committed to providing reasonable accommodations in our recruitment processes for candidates with disabilities, pregnancy, sincerely held religious beliefs or other reasons protected by applicable laws. If you need assistance or a reasonable accommodation, please reach out to us at https://shorturl.at/cdpT2
Job Information
【For Pay Transparency】Compensation Description (Annually)

The base salary range for this position in the selected city is $194000 - $410000 annually.

Compensation may vary outside of this range depending on a number of factors, including a candidate’s qualifications, skills, competencies and experience, and location. Base pay is one part of the Total Package that is provided to compensate and recognize employees for their work, and this role may be eligible for additional discretionary bonuses/incentives, and restricted stock units.

Benefits may vary depending on the nature of employment and the country work location. Employees have day one access to medical, dental, and vision insurance, a 401(k) savings plan with company match, paid parental leave, short-term and long-term disability coverage, life insurance, wellbeing benefits, among others. Employees also receive 10 paid holidays per year, 10 paid sick days per year and 17 days of Paid Personal Time (prorated upon hire with increasing accruals by tenure).

The Company reserves the right to modify or change these benefits programs at any time, with or without notice.

For Los Angeles County (unincorporated) Candidates:

Qualified applicants with arrest or conviction records will be considered for employment in accordance with all federal, state, and local laws including the Los Angeles County Fair Chance Ordinance for Employers and the California Fair Chance Act. Our company believes that criminal history may have a direct, adverse and negative relationship on the following job duties, potentially resulting in the withdrawal of the conditional offer of employment:

1. Interacting and occasionally having unsupervised contact with internal/external clients and/or colleagues;

2. Appropriately handling and managing confidential information including proprietary and trade secret information and access to information technology systems; and

3. Exercising sound judgment.

Similar Jobs

Peak - Data Scientist

Peak

(On-Site)
9 Months ago
Reliance Industries  - Manual Do QA

Reliance Industries

Bengaluru, Karnataka, India (On-Site)
6 Months ago
InnoPhase IoT - Staff/Sr. Staff PHY Design Engineer

InnoPhase IoT

San Jose, California, United States (On-Site)
1 Month ago
Google - Senior Machine Learning Physical Design Engineer

Google

Bengaluru, Karnataka, India (On-Site)
1 Month ago
Intel  - Software Research Engineer/Scientist

Intel

Hillsboro, Oregon, United States (Hybrid)
2 Weeks ago
bytedance - Software Engineer, SRE - Platform Services

bytedance

Seattle, Washington, United States (On-Site)
3 Months ago
Bungie - Senior Infrastructure Engineer

Bungie

(Hybrid)
2 Months ago
bytedance - Senior Software Engineer, Traffic Platform

bytedance

San Jose, California, United States (On-Site)
7 Months ago
bytedance - Software Developer (Routing Verification & Emulation)

bytedance

Seattle, Washington, United States (On-Site)
2 Months ago
Google - Software Engineer, Early Career

Google

Sydney, New South Wales, Australia (On-Site)
1 Month ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Ansys - Lead R&D Software Engineer - C++/Python, EDA Tools

Ansys

Chalandri, Greece (On-Site)
3 Weeks ago
Super.com - Senior Full-Stack Software Engineer ( Remote! )

Super.com

Austin, Texas, United States (Remote)
7 Months ago
rivos - Physical Design - Intern

rivos

Santa Clara, California, United States (On-Site)
7 Months ago
fortis games - Sr. Software Engineer (Game Quality)

fortis games

United Kingdom (Remote)
1 Week ago
Ubisoft - Monitoring Specialist - Golang Developer

Ubisoft

Saint-Mandé, Île-de-France, France (Hybrid)
1 Month ago
Insomniac - Senior Facial Character TD

Insomniac

(Remote)
4 Weeks ago
Meta - Data Engineer, Product Analytics

Meta

San Francisco, California, United States (On-Site)
6 Months ago
Unisys - Senior VMware System Engineer

Unisys

Bengaluru, Karnataka, India (On-Site)
1 Month ago
Varonis Internal - Phishing Threat Researcher

Varonis Internal

United States (On-Site)
1 Week ago
Gigamon - Staff SW QA Engineer

Gigamon

Chennai, Tamil Nadu, India (Hybrid)
1 Month ago

Get notifed when new similar jobs are uploaded

Jobs in San Jose, California, United States

The Walt Disney Company - Editor - Disney Publishing

The Walt Disney Company

New York, New York, United States (On-Site)
1 Month ago
Reddit - Client Account Manager

Reddit

Los Angeles, California, United States (On-Site)
2 Weeks ago
Thousand Eyes - Lead Site Reliability Engineer II, Production Engineering

Thousand Eyes

San Francisco, California, United States (On-Site)
2 Weeks ago
bytedance - Software Engineer, SRE - Platform Services

bytedance

San Jose, California, United States (On-Site)
3 Months ago
Qualcomm - CPU Physical Design Engineer

Qualcomm

Santa Clara, California, United States (On-Site)
5 Days ago
Reddit - Staff Software Engineer, Data Platform

Reddit

United States (Remote)
2 Weeks ago
Crunchyroll - DevOps Engineer - Cloud Reliability

Crunchyroll

San Francisco, California, United States (Hybrid)
3 Months ago
Corsair - Channel Marketing Manager Americas

Corsair

Milpitas, California, United States (On-Site)
1 Month ago
Nintendo - Counsel, Intellectual Property

Nintendo

Redmond, Washington, United States (Hybrid)
7 Months ago
Twitch - Senior Manager - Corporate Communications

Twitch

San Francisco, California, United States (On-Site)
2 Months ago

Get notifed when new similar jobs are uploaded

Network Engineering Jobs

bytedance - Site Reliability Engineer Graduate (Technical Infrastructure) - 2025 Start (BS/MS)

bytedance

Seattle, Washington, United States (On-Site)
7 Months ago
Playtika - IT Infrastructure Engineer

Playtika

Poland (Hybrid)
7 Months ago
Ion - Cloud Network Engineer

Ion

Italy (Hybrid)
7 Months ago
bytedance - Site Reliability Engineer - Data Infrastructure (San Jose)

bytedance

San Jose, California, United States (On-Site)
7 Months ago
bytedance - AI/LLM Network Software Engineer (High Speed Network)

bytedance

Seattle, Washington, United States (On-Site)
1 Month ago
ESL FACEIT Group - EFG - IT Specialist

ESL FACEIT Group - EFG

Ohio, United States (On-Site)
2 Months ago
bytedance - Senior Software Engineer, Multi Cloud CDN - San Jose / Seattle / Boston

bytedance

San Jose, California, United States (On-Site)
5 Months ago
Google - Senior Technical Program Manager, Edge Serving Capacity Delivery

Google

Bengaluru, Karnataka, India (On-Site)
1 Month ago
bytedance - Senior Software Engineer - IaaS AI Infra

bytedance

San Jose, California, United States (On-Site)
2 Months ago
Google - Program Manager, Google Enterprise Networking, University Graduate

Google

Austin, Texas, United States (On-Site)
1 Month ago

Get notifed when new similar jobs are uploaded

About The Company

Founded in 2012, ByteDance's mission is to inspire creativity and enrich life. With a suite of more than a dozen products, including TikTok as well as platforms specific to the China market, including Toutiao, Douyin, and Xigua, ByteDance has made it easier and more fun for people to connect with, consume, and create content.

San Jose, California, United States (On-Site)

Tokyo, Japan (On-Site)

Taguig, Metro Manila, Philippines (On-Site)

San Jose, California, United States (On-Site)

Ho Chi Minh City, Vietnam (On-Site)

San Diego, California, United States (On-Site)

View All Jobs

Get notified when new jobs are added by bytedance

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug