Software Engineer, AI Infrastructure

1 Month ago • All levels • Devops

Job Summary

Job Description

Our team builds a highly available and scalable general-purpose Serverless platform (FaaS) at ByteDance, handling over 100M+ QPS. We enable one-click function creation and deployment, abstracting infrastructure complexities to reduce developer burden. The platform dynamically scales functions for optimized resource utilization and costs, leveraging lightweight execution and rapid startup. We are seeking innovative engineers to develop AI Agent Ecosystems, design secure sandbox infrastructure for large model inference, enhance the FaaS platform's usability and scalability, architect global high-availability with NoOps capabilities, and optimize cold start performance for demanding serverless functions.
Must have:
  • Develop AI Agent Ecosystems: Contribute to designing and building AI agent frameworks, tool integration systems, and multi-agent collaboration platforms.
  • Design Secure Sandbox Infrastructure: Lead the development of sandbox technologies to support secure and efficient large model inference and training workloads.
  • Enhance Serverless Platform: Drive the design and evolution of our FaaS platform, focusing on usability, scalability, and cost optimization for enterprise users.
  • Build Global High-Availability Architecture: Architect automated disaster recovery and fault tolerance mechanisms across multi-cluster and multi-region environments to achieve NoOps capabilities.
  • Optimize Cold Start Performance: Innovate solutions for large-scale cold start scenarios, delivering multi-layered optimization to meet the demanding requirements of serverless functions.
Good to have:
  • containerization
  • networking
  • distributed tracing
Perks:
  • Inspiring creativity
  • global, diverse teams
  • create value for our communities
  • enrich life
  • inclusive space
  • valued for their skills, experiences, and unique perspectives

Job Details

Team IntroductionOur team is dedicated to building a highly available and scalable general-purpose Serverless platform that embodies the philosophy of Function-as-a-Service (FaaS). By enabling one-click function creation and deployment while abstracting infrastructure and operational complexities, we significantly reduce developers' burdens in both development and maintenance. Leveraging lightweight function execution and rapid startup capabilities, our platform dynamically scales functions to optimize resource utilization and costs. Currently handling 100M+ QPS, our architecture and product scale are industry-leading. We seek innovative, passionate engineers with experience in high-availability systems to join us in pioneering the future of serverless computing.

Responsibilities

  • Develop AI Agent Ecosystems: Contribute to designing and building AI agent frameworks, tool integration systems, and multi-agent collaboration platforms.
  • Design Secure Sandbox Infrastructure: Lead the development of sandbox technologies to support secure and efficient large model inference and training workloads.
  • Enhance Serverless Platform: Drive the design and evolution of our FaaS platform, focusing on usability, scalability, and cost optimization for enterprise users.
  • Build Global High-Availability Architecture: Architect automated disaster recovery and fault tolerance mechanisms across multi-cluster and multi-region environments to achieve NoOps capabilities.
  • Optimize Cold Start Performance: Innovate solutions for large-scale cold start scenarios, delivering multi-layered optimization to meet the demanding requirements of serverless functions.

Qualifications

Minimum Qualifications:

  • Strong Programming Fundamentals: Proficiency in algorithms, data structures, and at least one programming language (Go, Python, Java, Node.js, Rust, C).
  • Distributed Systems Expertise: Hands-on experience with large-scale distributed systems, including system modeling and problem-solving in production environments.
  • Cloud Native Experience: Familiarity with Kubernetes, Knative, Firecracker, or similar open-source projects.
  • Serverless Product Knowledge: Experience with AWS Lambda, Google Cloud Functions, or equivalent platforms.

Preferred Qualifications:

  • Familiarity with containerization, networking, and distributed tracing. If you're passionate about pushing the boundaries of serverless computing and thrive in fast-paced, innovative environments, we want to hear from you!

Similar Jobs

Side - AI Engineer - Talent Pool

Side

United States (Remote)
1 Month ago
Ansys - Senior R&D Engineer

Ansys

Waterloo, Ontario, Canada (Remote)
1 Month ago
Capgemini - Perl Developer

Capgemini

Pune, Maharashtra, India (On-Site)
2 Months ago
G5 games - C++ Gameplay Programmer

G5 games

Astana, Astana, Kazakhstan (Remote)
9 Months ago
Moving Walls India - Data Engineer

Moving Walls India

Chennai, Tamil Nadu, India (On-Site)
3 Years ago
Saviynt - Director - Site Reliability Engineering

Saviynt

Bengaluru, Karnataka, India (Hybrid)
6 Months ago
SSC Technologies - DevOps Engineer

SSC Technologies

Kansas City, Missouri, United States (Hybrid)
2 Months ago
CyberArk - Software Architect

CyberArk

Hyderabad, Telangana, India (On-Site)
4 Months ago
HCL Tech - Sr solution architect

HCL Tech

Noida, Uttar Pradesh, India (On-Site)
3 Months ago
bytedance - Senior Software Engineer, Multi Cloud CDN

bytedance

San Jose, California, United States (On-Site)
3 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Passive Logic - Digital Twin AI Framework Engineer

Passive Logic

Salt Lake City, Utah, United States (On-Site)
7 Months ago
Mistral AI - Applied AI Engineer, Senior Fullstack Software Engineer

Mistral AI

Paris, Île-de-France, France (On-Site)
2 Months ago
Google - Software Engineer, Google Store, Front End

Google

Mountain View, California, United States (On-Site)
1 Month ago
Nasdaq - Principal Data Engineer

Nasdaq

Bengaluru, Karnataka, India (Hybrid)
4 Weeks ago
Nintendo - Senior Engineer, Device Driver (NTD)

Nintendo

Redmond, Washington, United States (On-Site)
7 Months ago
Zinnia - Revenue Operations Manager (Data and Analytics)

Zinnia

New York, New York, United States (Hybrid)
1 Month ago
LMArena - Senior Software Engineer, Frontend

LMArena

California, United States (Hybrid)
4 Months ago
Veeam Software - Senior C# Developer

Veeam Software

Poland (Remote)
1 Month ago
Veeam Software - C# Developer (Internal Tools)

Veeam Software

Poland (Remote)
3 Months ago
version 1 - Senior Outsystems Developer

version 1

Bengaluru, Karnataka, India (On-Site)
1 Month ago

Get notifed when new similar jobs are uploaded

Jobs in Singapore

PwC - International Growth Practice -Senior Associate

PwC

Singapore (On-Site)
10 Months ago
Habby fun - Senior 2D Animator

Habby fun

Singapore (On-Site)
1 Month ago
Ubisoft - Senior Technical Artist (MOSAIC)

Ubisoft

Singapore, Singapore (On-Site)
4 Months ago
BigID - Senior Solutions/Presales Engineer

BigID

Singapore (Remote)
1 Month ago
Thales - Regional Manager, Business Security & Governance

Thales

Singapore, Singapore (On-Site)
3 Months ago
Workato - Manager, Technical Delivery

Workato

Singapore (On-Site)
1 Month ago
Riot Games - Associate Media Specialist, APAC (Contract)

Riot Games

Singapore (On-Site)
4 Months ago
mighty bear games - Sr. Full Stack Engineer

mighty bear games

Singapore (Remote)
3 Months ago
Illumina - Senior Software Design Quality Engineer

Illumina

Singapore (On-Site)
3 Months ago

Get notifed when new similar jobs are uploaded

Devops Jobs

Riot Games - Staff Software Engineer (Build) - Teamfight Tactics

Riot Games

Los Angeles, California, United States (On-Site)
7 Months ago
Blinkhealth - Senior Cloud Engineer

Blinkhealth

Pittsburgh, Pennsylvania, United States (On-Site)
1 Month ago
EMA - Deployment Engineer

EMA

Bengaluru, Karnataka, India (Hybrid)
7 Months ago
Escape Velocity Entertainment - Site Reliability Engineer

Escape Velocity Entertainment

(Remote)
4 Months ago
dbt Labs - Solutions Architect

dbt Labs

Atlanta, Georgia, United States (On-Site)
1 Month ago
CD PROJEKT RED - Senior DevOps Engineer

CD PROJEKT RED

Warsaw, Masovian Voivodeship, Poland (On-Site)
3 Months ago
Spaulding Ridge - Anaplan Solution Architect

Spaulding Ridge

Toronto, Ontario, Canada (On-Site)
3 Months ago
C3 IoT - Site Reliability Engineer - Field Operations

C3 IoT

Redwood City, California, United States (On-Site)
1 Month ago
Reddit - Senior Software Engineer - Ads Experimentation Platform

Reddit

United States (Remote)
3 Months ago
Penn Interactive - Senior Machine Learning Engineer, Platform

Penn Interactive

Philadelphia, Pennsylvania, United States (On-Site)
2 Months ago

Get notifed when new similar jobs are uploaded

About The Company

Founded in 2012, ByteDance's mission is to inspire creativity and enrich life. With a suite of more than a dozen products, including TikTok as well as platforms specific to the China market, including Toutiao, Douyin, and Xigua, ByteDance has made it easier and more fun for people to connect with, consume, and create content.
View All Jobs

Get notified when new jobs are added by bytedance

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug