Site Reliability Engineer

2 Months ago • 3-6 Years • DevOps • Undisclosed

About the job

Job Description

ByteDance is seeking a talented Site Reliability Engineer to join its team. This role involves building, expanding, and operating the company's global infrastructure, including large-scale systems in public and private clouds, data centers, and content delivery networks. You'll be responsible for building tools, automations, and visualizations to facilitate the operation and optimization of the global infrastructure, and working in a fast-paced environment to respond to performance and reliability issues. This is a great opportunity to work with complex systems at scale and contribute to the development of a global tech giant.
Must have:
  • Master's degree or Bachelor's with 3+ years experience
  • 3+ years experience with Unix/Linux systems
  • 3+ years experience in programming languages like Java, C++, Go, or scripting (Shell, Python)
Good to have:
  • Self-driven and capable of handling ambiguity
  • Strong analytical skills and problem-solving abilities
  • Experience with automation and tools for large-scale systems
  • Experience with cloud services like AWS, Google Cloud, Azure
  • Experience with networking technologies like TCP/IP, BGP, DNS
  • Experience with OpenStack, Kubernetes, Nginx, ipvs, ELK stack, Hadoop
Responsibilities
Founded in 2012, ByteDance's mission is to inspire creativity and enrich life. With a suite of more than a dozen products, including TikTok, Helo, and Resso, as well as platforms specific to the China market, including Toutiao, Douyin, and Xigua, ByteDance has made it easier and more fun for people to connect with, consume, and create content. ByteDance's platforms aim to help users explore and discover the world's creativity, knowledge and moments that matter in everyday life, while empowering everyone to be a creator directly from their smartphones. We are committed to building a safe, healthy and positive online environment for all our users. We embrace a culture of diversity, intellectual curiosity, openness and problem solving. Site Reliability Engineering (SRE) combines software and systems engineering to build and run large-scale, massively distributed infrastructures. Our SREs are tasked to ensure the infrastructure services are reliable, fault-tolerant, efficiently scalable and cost-effective. You will have the opportunity to manage a variety of complex systems at scale, including systems that administer hyperscale datacenters and public cloud, a global content distribution networks (CDNs) and load balancers that handles Tbps of traffic etc.. Responsibilities - Build, expand and operate Bytedance’s global infrastructures, including large-scale systems in public and private clouds, data centers and content delivery networks. - Build tools, automations, visualizations and monitors to facilitate the operation and optimization of the global infrastructure. - Work in a fast-paced environment. Participate in technical operations and rotations in response to performance and reliability issues. - Help improve the whole lifecycle of infrastructure services from inception and design throughout development, to deployment, user support and refinement
Qualifications
Minimum qualifications: - Master’s degree (or Bachelor's degree with 3+) years of experience in Computer Engineering, Electrical Engineering, Computer Science or related major - 3+ years experience working with Unix Linux systems from kernel to shell and beyond with experience working with system libraries, file systems, and client-server protocols. - 3+ years experience in one or more programming languages such as Java, C++, Go, or scripting experience in Shell and Python. Preferred qualifications: - Self-driven and capable of coping with ambiguity and move projects from concept to delivery. - Strong in analytical skills and the ability to solve real world problems in a fast moving environment. - Experience in designing, analyzing and building automation and tools for large scale systems - Experience in building solutions with AWS, Google, Azures and other cloud services. - Experience in networking technologies such TCP/IP, BGP, DNS, etc. in a carrier-grade environment. - Experience in developing and operating one or more of following systems: OpenStack, Kubernetes, Nginx, ipvs, ELK stack, Hadoop, etc. ByteDance is committed to creating an inclusive space where employees are valued for their skills, experiences, and unique perspectives. Our platform connects people from across the globe and so does our workplace. At ByteDance, our mission is to inspire creativity and enrich life. To achieve that goal, we are committed to celebrating our diverse voices and to creating an environment that reflects the many communities we reach. We believe individuals shouldn't be disadvantaged because of their background or identity, but instead should be considered based on their strengths and experience. We are passionate about this and hope you are too.
View Full Job Description

Add your resume

80%

Upload your resume, increase your shortlisting chances by 80%

About The Company

Where imagination meets innovation, delivering limitless gaming experiences.

View All Jobs

Get notified when new jobs are added by ByteDance

Similar Jobs

Nagarro - Senior Engineer, Java

Nagarro, India (On-Site)

Patterned Learning Career - Sr. Backend Engineer

Patterned Learning Career, (Remote)

Paypal - Senior AI Machine Learning Engineer

Paypal, United States (On-Site)

Luxoft - Corporate & Syndicated Lending Principal Engineer

Luxoft, United Arab Emirates (On-Site)

Axinous - Architect, Software Development

Axinous, United States (Hybrid)

Saviynt - Engineer/Sr. Engineer, CloudOps

Saviynt, India (Hybrid)

ByteDance - Senior Software Engineer, Cloud Infrastructure

ByteDance, United States (On-Site)

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Google - Student Researcher, BS/MS, Winter/Summer 2025

Google, United States (On-Site)

Netskope - Staff Software Engineer, SSPM

Netskope, India (Remote)

Rajalakshmi Institute of Technology - DevOps Lead - CI/CD with Gitlab Only

Rajalakshmi Institute of Technology, India (Hybrid)

OKX - Data Engineer

OKX, Hong Kong (On-Site)

Nasdaq - Cloud Solutions Senior Analyst

Nasdaq, India (Hybrid)

Blizzard Entertainment - Senior Test Analyst, WoW Dev Tools | Irvine, CA or Austin, TX

Blizzard Entertainment, United States (Hybrid)

Get notifed when new similar jobs are uploaded

Jobs in Dublin, County Dublin, Ireland

Get notifed when new similar jobs are uploaded

DevOps Jobs

Sumo Logic - Senior Site Reliability Engineer - Core

Sumo Logic, India (On-Site)

Glean - Infrastructure Support Engineer

Glean, India (On-Site)

Microsoft - Senior Digital Cloud Solution Architect

Microsoft, Australia (On-Site)

Publicis Groupe - Openlink Endur Architect/Senior Architect

Publicis Groupe, India (On-Site)

LSEG (London Stock Exchange Group) - Technical Design Authority

LSEG (London Stock Exchange Group), India (Hybrid)

ByteDance - Site Reliability Engineer, Traffic Platform

ByteDance, Singapore (On-Site)

Limit Break - Senior Site Reliability Engineer

Limit Break, Japan (On-Site)

JIFFYai - STAFF ENGINEER SRE

JIFFYai, India (Hybrid)

Get notifed when new similar jobs are uploaded