Site Reliability Engineer, Traffic Platform - 2025 Start

3 Months ago • All levels • DevOps • Network Engineering

Job Summary

Job Description

ByteDance's Traffic Infrastructure Global Engineering (TIGE) team is seeking experienced software engineers to join their Kubernetes based Cloud Native Traffic Platform. As a Site Reliability Engineer (SRE), you will be responsible for building, expanding and operating ByteDance’s global traffic platform, ensuring the traffic services are reliable, fault-tolerant, efficiently scalable and cost-effective. Your role will involve building tools, automations, visualizations and monitors for the platform, working in a fast-paced environment, and participating in technical operations and rotations in response to performance and reliability issues. You will manage complex systems at scale, including traffic systems that serve hyperscale datacenters and public cloud, and global load balancers that handle Tbps of traffic.
Must have:
  • Master's or Bachelor's degree in Computer Engineering, Electrical Engineering, Computer Science or related major
  • Experience with Linux systems
  • Experience in one or more programming languages such as Go, Python and Shell script
  • Familiar with Cloud and CI/CD framework/Tools, such as GIT, Docker, Kubernetes, etc.
  • Self-driven and capable of coping with ambiguity
  • Strong analytical skills
Good to have:
  • Experience in designing, analyzing and building automation and tools for large scale systems
  • Experience in building solutions with AWS, Google, Azures and other cloud services
  • Experience in networking technologies TCP/IP, HTTP, DNS, etc. in a carrier-grade environment
  • Experience in developing and operating Kubernetes, Nginx, ipvs, ELK stack, etc.

Job Details

Responsibilities
Founded in 2012, ByteDance's mission is to inspire creativity and enrich life. With a suite of more than a dozen products, including TikTok as well as platforms specific to the China market, including Toutiao, Douyin, and Xigua, ByteDance has made it easier and more fun for people to connect with, consume, and create content. Why Join Us Creation is the core of ByteDance's purpose. Our products are built to help imaginations thrive. This is doubly true of the teams that make our innovations possible. Together, we inspire creativity and enrich life - a mission we aim towards achieving every day. To us, every challenge, no matter how ambiguous, is an opportunity; to learn, to innovate, and to grow as one team. Status quo? Never. Courage? Always. At ByteDance, we create together and grow together. That's how we drive impact - for ourselves, our company, and the users we serve. Join us. We are looking for talented individuals to join us in 2025. As a graduate, you will get unparalleled opportunities for you to kickstart your career, pursue bold ideas and explore limitless growth opportunities. Co-create a future driven by your inspiration with ByteDance. Candidates can apply to a maximum of two positions and will be considered for jobs in the order you apply. The application limit is applicable to TikTok and its affiliates' jobs globally. Applications will be reviewed on a rolling basis - we encourage you to apply early. Team Introduction The Traffic Infrastructure Global Engineering (TIGE) team at ByteDance operates a large network of POPs around the world that we use to accelerate site traffic and cache CDN content, and we own all layer 4 and layer 7 traffic management for Tiktok Edge. By joining us, you can learn how to build content delivery networks and Edge Computing Platform within Tiktok's Edge. To better support Tiktok, the TIGE team is seeking experienced software engineers who can help improve our Kubernetes based Cloud Native Traffic Platform. The traffic platform balances, manages and processes Tiktok application traffic across all Tiktok's edge clusters. Also, the traffic platform contains varied network services in order to orchestrate the delivery of bits from our servers to your phone. Site Reliability Engineering (SRE) combines software and systems engineering to build and run large-scale, massively distributed infrastructures. Our SREs are tasked to ensure the traffic services are reliable, fault-tolerant, efficiently scalable and cost-effective. You will have the opportunity to manage a variety of complex systems at scale, including traffic systems that serve hyperscale datacenters and public cloud, global load balancer that handles Tbps of traffic etc.. Responsibility Build, expand and operate Bytedance’s global traffic platform, including large-scale systems in public and private clouds, edge data centers. • Build tools, automations, visualizations and monitors to facilitate the operation and optimization of the global traffic platform. • Work in a fast-paced environment. Participate in technical operations and rotations in response to performance and reliability issues. • Help improve the whole lifecycle of infrastructure services from inception and design throughout development, to deployment, user support and refinement
Qualifications
Minimum Qualifications: • Master’s or Bachelor's degree with within Computer Engineering, Electrical Engineering, Computer Science or related major • Experience with Linux systems from kernel to shell and beyond with experience working with system libraries, file systems, and client-server protocols. • Past experience in one or more programming languages such as Go, Python and Shell script. • Familiar with Cloud and CI/CD framework/Tools, such as GIT, Docker, Kubernetes, etc. • Self-driven and capable of coping with ambiguity and moving projects from concept to delivery. • Strong in analytical skills and the ability to solve real world problems in a fast moving environment. Preferred Qualifications: • Experience in designing, analyzing and building automation and tools for large scale systems • Experience in building solutions with AWS, Google, Azures and other cloud services. • Experience in networking technologies such TCP/IP, HTTP, DNS, etc. in a carrier-grade environment. • Experience in developing and operating one or more of following systems: Kubernetes, Nginx, ipvs, ELK stack, etc. ByteDance is committed to creating an inclusive space where employees are valued for their skills, experiences, and unique perspectives. Our platform connects people from across the globe and so does our workplace. At ByteDance, our mission is to inspire creativity and enrich life. To achieve that goal, we are committed to celebrating our diverse voices and to creating an environment that reflects the many communities we reach. We are passionate about this and hope you are too. By submitting an application for this role, you accept and agree to our global applicant privacy policy, which may be accessed here: https://jobs.bytedance.com/en/legal/privacy. If you have any questions, please reach out to us at apac-earlycareers@bytedance.com

Similar Jobs

ByteDance - Principal Site Reliability Engineer, CDN

ByteDance

Singapore (On-Site)
3 Months ago
Rackspace Technology - PreSales - AI- Data Architect (AWS) - Sydney

Rackspace Technology

Sydney, New South Wales, Australia (On-Site)
4 Months ago
ByteDance - Security Operation Engineer, Security Assurance

ByteDance

Singapore (On-Site)
23 Hours ago
Rackspace Technology - Site Reliability Engineer / Observability Engineer

Rackspace Technology

India (Remote)
2 Weeks ago
Visa - Staff Systems Engineer - DevEx

Visa

Singapore, Singapore (On-Site)
4 Months ago
VGW - Senior Site Reliability Engineer

VGW

Krakow Am See, Mecklenburg-Vorpommern, Germany (On-Site)
4 Months ago
Hitachi - Data Engineer + Power BI Developer

Hitachi

Pune, Maharashtra, India (Remote)
4 Months ago
PowerSchool - Sr Cloud Ops Eng I

PowerSchool

Bengaluru, Karnataka, India (On-Site)
3 Months ago
Equivalent Jobs - Technical Product Owner

Equivalent Jobs

(Remote)
2 Weeks ago
Equivalent Jobs - HEAD OF TRADING INFRASTRUCTURE

Equivalent Jobs

(Remote)
3 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Rivos - Silicon DFT - Full time

Rivos

Hsinchu, Hsinchu City, Taiwan (Hybrid)
4 Months ago
NVIDIA - System Test Design Engineer

NVIDIA

(Remote)
1 Month ago
Sporty Group - Information Security Engineer

Sporty Group

(Remote)
7 Months ago
ByteDance - Backend Software Engineer

ByteDance

San Jose, California, United States (On-Site)
5 Days ago
Keywords Studios (Player Support) - Video Game Engine Programmer

Keywords Studios (Player Support)

Tokyo, Japan (Remote)
6 Months ago
NVIDIA - SWQA Test Development Engineer

NVIDIA

Shanghai, Shanghai, China (Hybrid)
1 Month ago
Paytm - DevOps Engineer/Senior DevOps-Paytm Money

Paytm

Bengaluru, Karnataka, India (On-Site)
2 Months ago
Luxoft - Senior Computer Systems Linux Engineer w/ Python

Luxoft

Bucharest, Bucharest, Romania (On-Site)
3 Months ago
Marvell India - Performance Engineer

Marvell India

Pune, Maharashtra, India (On-Site)
4 Months ago
Info Stretch - Java/Batch Job Scheduler

Info Stretch

United States (Remote)
2 Months ago

Get notifed when new similar jobs are uploaded

Jobs in Singapore

ByteDance - Cloud Native Infrastructure Engineer - Foundational Technology

ByteDance

Singapore (On-Site)
3 Months ago
NinjaVan - Station Assistant (Day Shift)

NinjaVan

Singapore, Singapore (On-Site)
4 Months ago
ByteDance - Merchant Financing Product Manager - Global Payment

ByteDance

Singapore (On-Site)
3 Weeks ago
PwC - Deals - Strategy& Deals, Director

PwC

Singapore (On-Site)
4 Months ago
Razer - Product Marketing Specialist

Razer

Singapore (On-Site)
3 Months ago
Riot Games - Researcher III

Riot Games

Singapore (On-Site)
5 Days ago
The Walt Disney Company - Manager Marine Nautical, Safety, Environmental & Port Operations

The Walt Disney Company

Singapore, Singapore (On-Site)
3 Months ago
Riot Games - Insights Analyst III

Riot Games

Singapore (On-Site)
5 Months ago
ByteDance - LLM Global Data - LLM Coding Trainer Intern - 2025 Start

ByteDance

Singapore (On-Site)
1 Month ago
Razer - Associate Director, Software Product Marketing

Razer

Singapore (On-Site)
4 Months ago

Get notifed when new similar jobs are uploaded

DevOps Jobs

HP - Senior Technical Lead - MS Dynamics

HP

Bengaluru, Karnataka, India (On-Site)
5 Months ago
Intrepid Studios,  Inc  - Associate Software Engineer

Intrepid Studios, Inc

(Remote)
1 Week ago
Netomi - Devops Engineer - II

Netomi

Gurugram, Haryana, India (Remote)
2 Months ago
Microsoft - Site Reliability Engineer

Microsoft

Bucharest, Bucharest, Romania (Remote)
1 Month ago
NVIDIA - GenAI and MLOps Intern - Spring 2025

NVIDIA

Taipei City, Taiwan (On-Site)
3 Weeks ago
Ubisoft - Web Developer

Ubisoft

Bucharest, Bucharest, Romania (Hybrid)
2 Weeks ago
TrueBlue  Inc  - Site Reliability Engineer

TrueBlue Inc

Gurugram, Haryana, India (On-Site)
5 Months ago
Luxoft - Senior Software Support Engineer

Luxoft

Zlínský Kraj, Czechia (Remote)
2 Months ago
Nagarro - Senior Engineer, DevOps

Nagarro

Mumbai, Maharashtra, India (On-Site)
4 Months ago
Microsoft - Principal Software Engineering Manager – Azure Storage Armada Platform

Microsoft

Bengaluru, Karnataka, India (On-Site)
1 Month ago

Get notifed when new similar jobs are uploaded

About The Company

Where imagination meets innovation, delivering limitless gaming experiences.

Taguig, Metro Manila, Philippines (On-Site)

Singapore (On-Site)

Dubai, Dubai, United Arab Emirates (On-Site)

State Of São Paulo, Brazil (On-Site)

Seattle, Washington, United States (On-Site)

San Jose, California, United States (On-Site)

San Jose, California, United States (On-Site)

View All Jobs

Get notified when new jobs are added by ByteDance

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug