Senior/Tech Lead AI/LLM Network Software Development Engineer

1 Month ago • 4-8 Years • Network Engineering • $194,000 PA - $410,000 PA

Job Summary

Job Description

ByteDance seeks a Senior/Tech Lead AI/LLM Network Software Development Engineer in San Jose to design, implement, and deploy high-speed network technologies for AI/LLM applications. Responsibilities include developing platforms for monitoring and analyzing large-scale AI/LLM networks; researching and developing high-performance AI communication frameworks, network protocol stacks, and co-design optimization; and building next-generation AI network infrastructure. The ideal candidate will have proficiency in computer networks, network programming, and C/C++, Python, or Go. Experience with RDMA, congestion control, AI network optimization, and high-performance communication frameworks (NCCL, MPI, RPC) is highly beneficial.
Must have:
  • Design & implement high-speed network technologies for AI/LLM apps
  • Develop monitoring & analysis platforms for large-scale AI/LLM networks
  • Proficiency in computer networks and network programming
  • Proficiency in C/C++, Python, or Go
Good to have:
  • Experience with RDMA, congestion control, AI network optimization
  • Experience with NCCL, MPI, and RPC libraries
  • Experience in AI network diagnosis and performance optimization
Perks:
  • Medical, dental, and vision insurance
  • 401(k) savings plan with company match
  • Paid parental leave
  • Paid holidays, sick days, and personal time

Job Details

Responsibilities
About ByteDance Founded in 2012, ByteDance's mission is to inspire creativity and enrich life. With a suite of more than a dozen products, including TikTok and Helo as well as platforms specific to the China market, including Toutiao, Douyin, and Xigua, ByteDance has made it easier and more fun for people to connect with, consume, and create content. Why Join Us Creation is the core of ByteDance's purpose. Our products are built to help imaginations thrive. This is doubly true of the teams that make our innovations possible. Together, we inspire creativity and enrich life - a mission we aim towards achieving every day. To us, every challenge, no matter how ambiguous, is an opportunity; to learn, to innovate, and to grow as one team. Status quo? Never. Courage? Always. At ByteDance, we create together and grow together. That's how we drive impact - for ourselves, our company, and the users we serve. Join us. About the Team ByteDance Networking brings together innovative ideas and technologies from network architecture, software defined networking (SDN), network virtualization, switch software and hardware co-design, and high-speed networking, to create hyperscale data-center networking solutions that power several of the most popular apps of the world such as Douyin and TikTok which serve hundreds of millions of users around the globe. ByteDance Networking is responsible for designing, building, and operating the global, intelligent network infrastructure to meet the requirements of high availability, scalability, and high-performance. By joining this team, you will gain marketable software development and/or network operation experiences in data center networking at massive scale. Responsibilities: - Design, implementation and deployment of high-speed network technologies to support AI/LLM applications. - Design and development of platforms/systems for monitoring, analysis and diagnosis of large scale AI/LLM network. - Research and development of high-performance AI communication framework, network protocol stacks, and codesign optimization of host-network-application to improve the scalability, reliability and performance of AI/LLM network. - Building next generation AI network infrastructure supporting large scale heterogeneous network hardware with innovative and deployable solutions.
Qualifications
Minimum Qualifications - Bachelor or higher degree in computer science, electronic engineering, network engineering or related fields. - Proficiency in computer network and network programming. - Proficiency in one or several mainstream programming languages, including C/C++, Python, Go and so on. - Be familiar with the latest advances in the area of high-speed network systems, including RDMA, congestion control, AI network optimization and so on. - Experience in developing high performance communication frameworks(including NCCL, MPI and RPC libraries) is a plus. - Experience in developing software systems for AI network diagnosis and performance optimization is a plus. Preferred Qualifications - Experience in developing high performance communication frameworks(including NCCL, MPI and RPC libraries) is a plus. - Experience in developing software systems for AI network diagnosis and performance optimization is a plus. ByteDance is committed to creating an inclusive space where employees are valued for their skills, experiences, and unique perspectives. Our platform connects people from across the globe and so does our workplace. At ByteDance, our mission is to inspire creativity and enrich life. To achieve that goal, we are committed to celebrating our diverse voices and to creating an environment that reflects the many communities we reach. We are passionate about this and hope you are too. ByteDance Inc. is committed to providing reasonable accommodations in our recruitment processes for candidates with disabilities, pregnancy, sincerely held religious beliefs or other reasons protected by applicable laws. If you need assistance or a reasonable accommodation, please reach out to us at https://shorturl.at/cdpT2
Job Information
【For Pay Transparency】Compensation Description (Annually)

The base salary range for this position in the selected city is $194000 - $410000 annually.

Compensation may vary outside of this range depending on a number of factors, including a candidate’s qualifications, skills, competencies and experience, and location. Base pay is one part of the Total Package that is provided to compensate and recognize employees for their work, and this role may be eligible for additional discretionary bonuses/incentives, and restricted stock units.

Benefits may vary depending on the nature of employment and the country work location. Employees have day one access to medical, dental, and vision insurance, a 401(k) savings plan with company match, paid parental leave, short-term and long-term disability coverage, life insurance, wellbeing benefits, among others. Employees also receive 10 paid holidays per year, 10 paid sick days per year and 17 days of Paid Personal Time (prorated upon hire with increasing accruals by tenure).

The Company reserves the right to modify or change these benefits programs at any time, with or without notice.

For Los Angeles County (unincorporated) Candidates:

Qualified applicants with arrest or conviction records will be considered for employment in accordance with all federal, state, and local laws including the Los Angeles County Fair Chance Ordinance for Employers and the California Fair Chance Act. Our company believes that criminal history may have a direct, adverse and negative relationship on the following job duties, potentially resulting in the withdrawal of the conditional offer of employment:

1. Interacting and occasionally having unsupervised contact with internal/external clients and/or colleagues;

2. Appropriately handling and managing confidential information including proprietary and trade secret information and access to information technology systems; and

3. Exercising sound judgment.

Similar Jobs

ION - Senior Consultant - Risk Advisory, Italy

ION

Turin, Piedmont, Italy (On-Site)
6 Months ago
Scale AI - Software Engineer (Product), International Public Sector

Scale AI

Doha, Doha Municipality, Qatar (On-Site)
1 Day ago
Zynga - Senior Data Scientist (Full Stack)

Zynga

Austin, Texas, United States (On-Site)
23 Hours ago
Single Store - Technical Account Manager

Single Store

(Remote)
23 Hours ago
WildBrain - Head of Lighting, CG

WildBrain

Vancouver, British Columbia, Canada (Hybrid)
1 Day ago
ByteDance - Research Scientist, Cloud & AI Computing - DPU/GPU/CPU

ByteDance

Seattle, Washington, United States (On-Site)
2 Weeks ago
ByteDance - Senior/Tech Lead AI/LLM Network Software Development Engineer

ByteDance

San Jose, California, United States (On-Site)
3 Months ago
Meta - Technical Program Manager, Net Infra (Backbone)

Meta

Bellevue, Washington, United States (On-Site)
5 Months ago
ByteDance - Network Operations Engineer, EDGE Networking

ByteDance

Singapore (On-Site)
6 Months ago
ByteDance - Research Scientist Intern (Traffic Infrastructure Global Engineering)

ByteDance

San Jose, California, United States (On-Site)
1 Month ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Rivos - Silicon Verification - Intern

Rivos

Santa Clara, California, United States (On-Site)
6 Months ago
Social Discovery Group - Senior DevOps

Social Discovery Group

(Remote)
1 Day ago
Coupa - Senior Technical Architect

Coupa

Tokyo, Japan (Hybrid)
6 Hours ago
The Walt Disney Company - Sr Security Specialist - Governance

The Walt Disney Company

Orlando, Florida, United States (On-Site)
2 Weeks ago
Google - Data Science Research, AI Benchmark

Google

Mexico City, Mexico City, Mexico (On-Site)
2 Days ago
Boomi - Software Engineer - Quality

Boomi

India (On-Site)
1 Day ago
Epic Games - Data Analyst

Epic Games

(On-Site)
2 Months ago
Meta - ASIC Engineer, Design Verification

Meta

Sunnyvale, California, United States (Remote)
5 Months ago
Canonical - Linux Cryptography and Security Engineer

Canonical

(Remote)
8 Hours ago

Get notifed when new similar jobs are uploaded

Jobs in San Jose, California, United States

AVER LLC - Senior Latent Print Examiner

AVER LLC

United States (On-Site)
6 Months ago
ByteDance - GPU/AI Application Platform Engineer Intern (Server Platform)

ByteDance

San Jose, California, United States (On-Site)
2 Weeks ago
Next Level Business Services - oracle adf developer

Next Level Business Services

Miami, Florida, United States (On-Site)
6 Months ago
Meta - ASIC Engineer, Design

Meta

Sunnyvale, California, United States (On-Site)
5 Months ago
The Walt Disney Company - Crowds Artist

The Walt Disney Company

Burbank, California, United States (On-Site)
1 Month ago
CharacterAI - Engineering Manager, Safety

CharacterAI

Menlo Park, California, United States (On-Site)
1 Month ago
Google - Technical Program Manager II, Security, CISO

Google

Kirkland, Washington, United States (On-Site)
2 Days ago
ByteDance - Software Engineer Intern (AIGC Platform - Monetization GenAI)

ByteDance

San Jose, California, United States (On-Site)
2 Weeks ago
Meta - Manager, Performance & Capacity Engineering - Capacity Planning Optimization

Meta

Menlo Park, California, United States (On-Site)
5 Months ago
Passive Logic - Senior Embedded Systems Engineer

Passive Logic

Salt Lake City, Utah, United States (On-Site)
6 Months ago

Get notifed when new similar jobs are uploaded

Network Engineering Jobs

NVIDIA - Senior Network Test Engineer

NVIDIA

Tel Aviv-Yafo, Tel Aviv District, Israel (On-Site)
1 Week ago
ByteDance - Senior Software Engineer, Multi Cloud CDN

ByteDance

Seattle, Washington, United States (On-Site)
3 Days ago
ByteDance - Software Development Engineer (SDN Traffic Intelligence & Control)

ByteDance

Seattle, Washington, United States (On-Site)
1 Month ago
ByteDance - Senior Software Engineer, Traffic Platform

ByteDance

Seattle, Washington, United States (On-Site)
6 Months ago
ByteDance - Site Reliability Engineer, Traffic Platform

ByteDance

Seattle, Washington, United States (On-Site)
6 Months ago
Meta - Technical Program Manager, Net Infra (Backbone)

Meta

Menlo Park, California, United States (On-Site)
5 Months ago
PlayStation Global - Staff Linux Network Software Engineer

PlayStation Global

London, England, United Kingdom (On-Site)
1 Month ago
ByteDance - Senior Software Engineer, Multi Cloud CDN

ByteDance

San Jose, California, United States (On-Site)
3 Days ago
ByteDance - Network Engineer, Optical Long-Haul and Submarine

ByteDance

Ashburn, Virginia, United States (On-Site)
2 Months ago
ByteDance - Senior Software Development Engineer - Distributed KV System

ByteDance

San Jose, California, United States (On-Site)
6 Months ago

Get notifed when new similar jobs are uploaded

About The Company

Where imagination meets innovation, delivering limitless gaming experiences.

San Diego, California, United States (On-Site)

San Jose, California, United States (On-Site)

Dubai, Dubai, United Arab Emirates (On-Site)

New York, New York, United States (On-Site)

San Jose, California, United States (On-Site)

San Jose, California, United States (On-Site)

Seattle, Washington, United States (On-Site)

View All Jobs

Get notified when new jobs are added by ByteDance

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug