Site Reliability Engineer, Traffic Platform

4 Months ago • 3 Years + • DevOps

Job Summary

Job Description

ByteDance seeks a Site Reliability Engineer (SRE) to build, expand, and operate its global traffic platform. Responsibilities include managing large-scale systems in public and private clouds, edge data centers, and building automation tools. The SRE will work in a fast-paced environment, responding to performance and reliability issues, and optimizing the global traffic platform. This role requires experience with Linux systems, programming languages (Go, Python, Shell), cloud platforms (AWS, Google Cloud, Azure), and CI/CD frameworks. Experience with Kubernetes, Nginx, and networking technologies is preferred.
Must have:
  • Master's or Bachelor's degree (3+ yrs exp)
  • Linux systems expertise
  • 3+ years in Go, Python, or Shell
  • Cloud & CI/CD experience (GIT, Docker, Kubernetes)
  • Problem-solving skills
  • Build and operate global traffic platform
Good to have:
  • Experience with AWS, Google Cloud, Azure
  • Networking knowledge (TCP/IP, HTTP, DNS)
  • Experience with Kubernetes, Nginx, ipvs, ELK

Job Details

Responsibilities
About ByteDance Founded in 2012, ByteDance's mission is to inspire creativity and enrich life. With a suite of more than a dozen products, including TikTok as well as platforms specific to the China market, including Toutiao, Douyin, and Xigua, ByteDance has made it easier and more fun for people to connect with, consume, and create content. Why Join Us Creation is the core of ByteDance's purpose. Our products are built to help imaginations thrive. This is doubly true of the teams that make our innovations possible. Together, we inspire creativity and enrich life - a mission we aim towards achieving every day. To us, every challenge, no matter how ambiguous, is an opportunity; to learn, to innovate, and to grow as one team. Status quo? Never. Courage? Always. At ByteDance, we create together and grow together. That's how we drive impact-for ourselves, our company, and the users we serve. Join us. Site Reliability Engineering (SRE) combines software and systems engineering to build and run large-scale, massively distributed infrastructures. Our SREs are tasked to ensure the traffic services are reliable, fault-tolerant, efficiently scalable and cost-effective. You will have the opportunity to manage a variety of complex systems at scale, including traffic systems that serve hyperscale datacenters and public cloud, global load balancer that handles Tbps of traffic Job Description - Build, expand and operate Bytedance’s global traffic platform, including large-scale systems in public and private clouds, edge data centers. - Build tools, automations, visualizations and monitors to facilitate the operation and optimization of the global traffic platform. - Work in a fast-paced environment. Participate in technical operations and rotations in response to performance and reliability issues. - Help improve the whole lifecycle of infrastructure services from inception and design throughout development, to deployment, user support and refinement
Qualifications
Minimum qualifications • Master’s degree (or Bachelor's degree with 3+) years of experience in Computer Engineering, Electrical Engineering, Computer Science or related major • Proven years experience working with Linux systems from kernel to shell and beyond with experience working with system libraries, file systems, and client-server protocols. • 3+ years experience in one or more programming languages such as Go, Python and Shell script. • Familiar with Cloud and CI/CD framework/Tools, such as GIT, Docker, Kubernetes, etc. • Self-driven and capable of coping with ambiguity and moving projects from concept to delivery. • Strong in analytical skills and the ability to solve real world problems in a fast moving environment. Preferred qualifications • Experience in designing, analyzing and building automation and tools for large scale systems • Experience in building solutions with AWS, Google, Azures and other cloud services. • Experience in networking technologies such TCP/IP, HTTP, DNS, etc. in a carrier-grade environment. • Experience in developing and operating one or more of following systems: Kubernetes, Nginx, ipvs, ELK stack, etc. ByteDance is committed to creating an inclusive space where employees are valued for their skills, experiences, and unique perspectives. Our platform connects people from across the globe and so does our workplace. At ByteDance, our mission is to inspire creativity and enrich life. To achieve that goal, we are committed to celebrating our diverse voices and to creating an environment that reflects the many communities we reach. We are passionate about this and hope you are too.

Similar Jobs

AI Dash - Staff Engineer - Backend

AI Dash

Bengaluru, Karnataka, India (Hybrid)
5 Days ago
Power Integrations - IT Support Manager (APAC)

Power Integrations

Penang, Malaysia (On-Site)
7 Months ago
Rockstar Games - Online System Administrator

Rockstar Games

Bengaluru, Karnataka, India (On-Site)
1 Week ago
The Embassy - Pipeline Developer

The Embassy

Vancouver, British Columbia, Canada (Hybrid)
3 Months ago
Ziff Davis - Systems/DevOps Engineer

Ziff Davis

Canada (Remote)
2 Weeks ago
Zazz - Cloud Engineer (Azure)

Zazz

(Remote)
3 Months ago
Warhorse Studios - DevOps / C# Tools Programmer

Warhorse Studios

Prague, Prague, Czechia (On-Site)
2 Months ago
Ion - Senior DevSecOps Engineer, Italy

Ion

Pisa, Tuscany, Italy (On-Site)
7 Months ago
Nagarro - Principal Engineer -- PHP Developer

Nagarro

New Jersey, United States (Remote)
7 Months ago
PwC - Senior Associate_Azure Data Engineer_Data & Analytics_Advisory_PAN  India

PwC

Bengaluru, Karnataka, India (On-Site)
8 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Travel HR Portal - EUC Engineer

Travel HR Portal

India (On-Site)
2 Days ago
ness digital  - Application Operations Engineer

ness digital

United States (Hybrid)
3 Weeks ago
zeta - Sr. Site Reliability Engineer

zeta

Bengaluru, Karnataka, India (On-Site)
7 Months ago
fluence - Controls Software Engineer II

fluence

Houston, Texas, United States (Hybrid)
7 Months ago
Google - Cloud Technical Solutions Engineer, Infrastructure

Google

Tokyo, Japan (On-Site)
1 Month ago
Qualcomm - Senior ASIC Platform Design Engineer

Qualcomm

Colombes, Île-de-France, France (On-Site)
1 Week ago
gyb games - Senior Backend Developer

gyb games

Istanbul, İstanbul, Türkiye (On-Site)
1 Month ago
Treelix - Cloud Operations Senior Tools Engineer

Treelix

Bengaluru, Karnataka, India (On-Site)
2 Weeks ago
Samsung Semiconductor - Staff DevOps Engineer

Samsung Semiconductor

San Jose, California, United States (Hybrid)
4 Months ago
sinch  - Team Lead/Lead System Engineer - System & Operations

sinch

Delhi, India (On-Site)
2 Weeks ago

Get notifed when new similar jobs are uploaded

Jobs in Singapore

Rolls-Royce - Service Operations Manager PSB APAC (Commissioning)

Rolls-Royce

Singapore (On-Site)
7 Months ago
The Walt Disney Company - Weddings, VIP, Special Services Manager

The Walt Disney Company

Singapore, Singapore (On-Site)
1 Month ago
ElevenLabs - Customer Success Manager - APAC

ElevenLabs

Singapore (Remote)
2 Months ago
Bushiroad - Localization Quality Assurance Executive/Senior Executive

Bushiroad

Singapore, Singapore (On-Site)
3 Months ago
Diligent - Field Marketing Manager

Diligent

Singapore, Singapore (On-Site)
1 Month ago
bytedance - Research Scientist/Engineer, Large Language Model - 2025 Start

bytedance

Singapore (On-Site)
5 Months ago
Workato - Commercial Sales Manager

Workato

Singapore (On-Site)
1 Week ago
bytedance - Product Design Intern - Global Payment

bytedance

Singapore (On-Site)
1 Month ago
bytedance - Software Engineer (Payment Network) Intern

bytedance

Singapore (On-Site)
1 Month ago

Get notifed when new similar jobs are uploaded

DevOps Jobs

bytedance - Production System Engineer, Infrastructure Engineering Intern

bytedance

Singapore (On-Site)
2 Months ago
Alp Consulting  - Unity 3D developer

Alp Consulting

Bengaluru, Karnataka, India (Hybrid)
1 Year ago
Axinous - Senior Software Development Manager - C, Linux, Distributed Systems

Axinous

Bengaluru, Karnataka, India (Hybrid)
4 Months ago
N-ix - Solution Architect (Spanish Speaking)

N-ix

Poland (Remote)
2 Months ago
Ion - Senior DevSecOps Engineer, Italy

Ion

Milan, Lombardy, Italy (On-Site)
7 Months ago
G5 games - Monitoring Engineer

G5 games

(Remote)
1 Month ago
Sandsoft Games - DevOps & Automation Engineer

Sandsoft Games

Barcelona, Catalonia, Spain (Hybrid)
2 Months ago
DraftKings - Manager, System DBA Operations

DraftKings

Sofia, Sofia City Province, Bulgaria (On-Site)
6 Months ago
Teradata - Senior Cloud Engineer

Teradata

Pune, Maharashtra, India (On-Site)
7 Months ago
Ubisoft - Linux DevOps System Administrator

Ubisoft

Montreal, Quebec, Canada (On-Site)
1 Month ago

Get notifed when new similar jobs are uploaded

About The Company

Founded in 2012, ByteDance's mission is to inspire creativity and enrich life. With a suite of more than a dozen products, including TikTok as well as platforms specific to the China market, including Toutiao, Douyin, and Xigua, ByteDance has made it easier and more fun for people to connect with, consume, and create content.

San Jose, California, United States (On-Site)

Tokyo, Japan (On-Site)

Taguig, Metro Manila, Philippines (On-Site)

San Jose, California, United States (On-Site)

Ho Chi Minh City, Vietnam (On-Site)

San Diego, California, United States (On-Site)

View All Jobs

Get notified when new jobs are added by bytedance

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug