Senior/Tech Lead AI/LLM Network Software Development Engineer

3 Months ago • 4-8 Years • Network Engineering • $194,000 PA - $410,000 PA

Job Summary

Job Description

ByteDance seeks a Senior/Tech Lead AI/LLM Network Software Development Engineer to design, implement, and deploy high-speed network technologies for AI/LLM applications. Responsibilities include developing platforms for monitoring and analyzing large-scale AI/LLM networks, researching high-performance AI communication frameworks, and building next-generation AI network infrastructure. The ideal candidate possesses expertise in computer networks, network programming (C/C++, Python, Go), high-speed network systems (RDMA, congestion control), and experience with high-performance communication frameworks (NCCL, MPI, RPC). The role involves optimizing AI/LLM network scalability, reliability, and performance, working with large-scale heterogeneous network hardware.
Must have:
  • High-speed network tech design & deployment
  • AI/LLM network monitoring & analysis platform development
  • High-performance AI communication framework research
  • Proficiency in C/C++, Python, Go
  • Experience with RDMA, congestion control
Good to have:
  • Experience with NCCL, MPI, RPC libraries
  • AI network diagnosis and performance optimization experience
Perks:
  • Medical, dental, and vision insurance
  • 401(k) savings plan with company match
  • Paid parental leave
  • Paid time off
  • Wellbeing benefits

Job Details

Responsibilities
About ByteDance Founded in 2012, ByteDance's mission is to inspire creativity and enrich life. With a suite of more than a dozen products, including TikTok and Helo as well as platforms specific to the China market, including Toutiao, Douyin, and Xigua, ByteDance has made it easier and more fun for people to connect with, consume, and create content. Why Join Us Creation is the core of ByteDance's purpose. Our products are built to help imaginations thrive. This is doubly true of the teams that make our innovations possible. Together, we inspire creativity and enrich life - a mission we aim towards achieving every day. To us, every challenge, no matter how ambiguous, is an opportunity; to learn, to innovate, and to grow as one team. Status quo? Never. Courage? Always. At ByteDance, we create together and grow together. That's how we drive impact - for ourselves, our company, and the users we serve. Join us. About the Team ByteDance Networking brings together innovative ideas and technologies from network architecture, software defined networking (SDN), network virtualization, switch software and hardware co-design, and high-speed networking, to create hyperscale data-center networking solutions that power several of the most popular apps of the world such as Douyin and TikTok which serve hundreds of millions of users around the globe. ByteDance Networking is responsible for designing, building, and operating the global, intelligent network infrastructure to meet the requirements of high availability, scalability, and high-performance. By joining this team, you will gain marketable software development and/or network operation experiences in data center networking at massive scale. Responsibilities: - Design, implementation and deployment of high-speed network technologies to support AI/LLM applications. - Design and development of platforms/systems for monitoring, analysis and diagnosis of large scale AI/LLM network. - Research and development of high-performance AI communication framework, network protocol stacks, and codesign optimization of host-network-application to improve the scalability, reliability and performance of AI/LLM network. - Building next generation AI network infrastructure supporting large scale heterogeneous network hardware with innovative and deployable solutions.
Qualifications
Minimum Qualifications - Bachelor or higher degree in computer science, electronic engineering, network engineering or related fields. - Proficiency in computer network and network programming. - Proficiency in one or several mainstream programming languages, including C/C++, Python, Go and so on. - Be familiar with the latest advances in the area of high-speed network systems, including RDMA, congestion control, AI network optimization and so on. - Experience in developing high performance communication frameworks(including NCCL, MPI and RPC libraries) is a plus. - Experience in developing software systems for AI network diagnosis and performance optimization is a plus. Preferred Qualifications - Experience in developing high performance communication frameworks(including NCCL, MPI and RPC libraries) is a plus. - Experience in developing software systems for AI network diagnosis and performance optimization is a plus. ByteDance is committed to creating an inclusive space where employees are valued for their skills, experiences, and unique perspectives. Our platform connects people from across the globe and so does our workplace. At ByteDance, our mission is to inspire creativity and enrich life. To achieve that goal, we are committed to celebrating our diverse voices and to creating an environment that reflects the many communities we reach. We are passionate about this and hope you are too. ByteDance Inc. is committed to providing reasonable accommodations in our recruitment processes for candidates with disabilities, pregnancy, sincerely held religious beliefs or other reasons protected by applicable laws. If you need assistance or a reasonable accommodation, please reach out to us at https://shorturl.at/cdpT2
Job Information
【For Pay Transparency】Compensation Description (Annually)

The base salary range for this position in the selected city is $194000 - $410000 annually.

Compensation may vary outside of this range depending on a number of factors, including a candidate’s qualifications, skills, competencies and experience, and location. Base pay is one part of the Total Package that is provided to compensate and recognize employees for their work, and this role may be eligible for additional discretionary bonuses/incentives, and restricted stock units.

Benefits may vary depending on the nature of employment and the country work location. Employees have day one access to medical, dental, and vision insurance, a 401(k) savings plan with company match, paid parental leave, short-term and long-term disability coverage, life insurance, wellbeing benefits, among others. Employees also receive 10 paid holidays per year, 10 paid sick days per year and 17 days of Paid Personal Time (prorated upon hire with increasing accruals by tenure).

The Company reserves the right to modify or change these benefits programs at any time, with or without notice.

For Los Angeles County (unincorporated) Candidates:

Qualified applicants with arrest or conviction records will be considered for employment in accordance with all federal, state, and local laws including the Los Angeles County Fair Chance Ordinance for Employers and the California Fair Chance Act. Our company believes that criminal history may have a direct, adverse and negative relationship on the following job duties, potentially resulting in the withdrawal of the conditional offer of employment:

1. Interacting and occasionally having unsupervised contact with internal/external clients and/or colleagues;

2. Appropriately handling and managing confidential information including proprietary and trade secret information and access to information technology systems; and

3. Exercising sound judgment.

Similar Jobs

Whoop - Director of Embedded Software Engineering

Whoop

Boston, Massachusetts, United States (On-Site)
22 Hours ago
OKX - Data Architect

OKX

Singapore, Singapore (On-Site)
6 Months ago
Google - Software Engineer III, AI/ML Machine Learning, Core

Google

Bengaluru, Karnataka, India (On-Site)
2 Weeks ago
Ethos Life - Staff Product Analyst

Ethos Life

Bengaluru, Karnataka, India (Hybrid)
1 Day ago
London stock Exchange - Trading Onboarding Specialist

London stock Exchange

Bengaluru, Karnataka, India (On-Site)
3 Hours ago
ByteDance - Senior Software Engineer, Multi Cloud CDN - San Jose / Seattle / Boston

ByteDance

Boston, Massachusetts, United States (On-Site)
4 Months ago
Google - Networking Test Engineer

Google

Bengaluru, Karnataka, India (On-Site)
2 Days ago
NVIDIA - Senior Software Engineer

NVIDIA

Tel Aviv-Yafo, Tel Aviv District, Israel (On-Site)
2 Months ago
Google - Staff Network Security Engineer

Google

Austin, Texas, United States (On-Site)
2 Days ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Escape Velocity Entertainment - Technical Artist (Houdini)

Escape Velocity Entertainment

(Remote)
1 Month ago
Google - SoC Product Engineer, Platform, Google Cloud

Google

Tel Aviv-Yafo, Tel Aviv District, Israel (On-Site)
2 Weeks ago
Fairmatic - Senior Full Stack Engineer

Fairmatic

Tel Aviv-Yafo, Tel Aviv District, Israel (Hybrid)
6 Months ago
Postman - Staff Engineer, Monetization

Postman

San Francisco, California, United States (Hybrid)
1 Day ago
ByteDance - Software Engineer Intern, Authorization

ByteDance

Singapore (On-Site)
1 Month ago
ByteDance - Senior Software Engineer, Multi Cloud CDN - San Jose / Seattle / Boston

ByteDance

Seattle, Washington, United States (On-Site)
4 Months ago
Google - Software Engineering Manager II, Google Compute Infrastructure

Google

Warsaw, Masovian Voivodeship, Poland (On-Site)
2 Days ago
InnoPhase IoT - Design Verification Engineer

InnoPhase IoT

Bengaluru, Karnataka, India (On-Site)
1 Day ago
Playtika - Data Science Expert

Playtika

Israel (On-Site)
3 Months ago
Microsoft - Member of Technical Staff, Infrastructure Engineer

Microsoft

Mountain View, California, United States (Hybrid)
1 Month ago

Get notifed when new similar jobs are uploaded

Jobs in San Jose, California, United States

Gearbox Software - Level Artist

Gearbox Software

Frisco, Texas, United States (On-Site)
4 Months ago
Google - Business Intelligence Sales Specialist II

Google

Atlanta, Georgia, United States (On-Site)
2 Days ago
Interface AI - Sales Development Representative

Interface AI

United States (Remote)
2 Months ago
Google - Global Commodity Manager, High Bandwidth Memory

Google

Sunnyvale, California, United States (On-Site)
2 Weeks ago
Crunchyroll - Buying Coordinator

Crunchyroll

Des Moines, Iowa, United States (Hybrid)
4 Months ago
ByteDance - Machine Learning Engineer - Machine Learning Infrastructure

ByteDance

Seattle, Washington, United States (On-Site)
6 Months ago
Glean - Sales Development Representative

Glean

New York, New York, United States (Hybrid)
6 Hours ago
Mercury - Product and Regulatory Counsel

Mercury

San Francisco, California, United States (On-Site)
17 Hours ago
Gibbs CAM - Staff Accountant

Gibbs CAM

Cincinnati, Ohio, United States (Hybrid)
21 Hours ago
Riot Games - Sr. Manager, Software Engineering

Riot Games

Los Angeles, California, United States (On-Site)
1 Day ago

Get notifed when new similar jobs are uploaded

Network Engineering Jobs

ByteDance - Software Development Engineer (SDN Traffic Intelligence & Control)

ByteDance

Seattle, Washington, United States (On-Site)
1 Month ago
ByteDance - Cloud Network Engineer

ByteDance

Seattle, Washington, United States (On-Site)
2 Weeks ago
Google - Network Engineer, Public Sector

Google

Reston, Virginia, United States (On-Site)
2 Weeks ago
ByteDance - Research Scientist Intern (Traffic Infrastructure Global Engineering)

ByteDance

San Jose, California, United States (On-Site)
2 Weeks ago
Google - Technical Program Manager, Global Network Infrastructure

Google

Singapore (On-Site)
2 Weeks ago
Microsoft - Senior Software Engineer - Optical Network Agents & Automation Platforms

Microsoft

Redmond, Washington, United States (On-Site)
1 Week ago
Larian Studios - Lead Security & Network Engineer

Larian Studios

Guildford, England, United Kingdom (On-Site)
2 Months ago
Meta - Technical Program Manager, Net Infra (Backbone)

Meta

Denver, Colorado, United States (On-Site)
5 Months ago

Get notifed when new similar jobs are uploaded

About The Company

Where imagination meets innovation, delivering limitless gaming experiences.

San Diego, California, United States (On-Site)

San Jose, California, United States (On-Site)

Dubai, Dubai, United Arab Emirates (On-Site)

New York, New York, United States (On-Site)

San Jose, California, United States (On-Site)

San Jose, California, United States (On-Site)

Seattle, Washington, United States (On-Site)

View All Jobs

Get notified when new jobs are added by ByteDance

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug