Site Reliability Engineer (Cloud) - Infrastructure Engineering

2 Hours ago • 5-5 Years

About the job

SummaryBy Outscal

ByteDance seeks a skilled Site Reliability Engineer (Cloud) with 5+ years of experience in Unix/Linux systems, cloud technologies (AWS, Google, OCI), and automation tools. Strong analytical and problem-solving skills are essential for this fast-paced role.
Responsibilities
About ByteDance Founded in 2012, ByteDance's mission is to inspire creativity and enrich life. With a suite of more than a dozen products, including TikTok, Helo, and Resso, as well as platforms specific to the China market, including Toutiao, Douyin, and Xigua, ByteDance has made it easier and more fun for people to connect with, consume, and create content. Why Join Us Creation is the core of ByteDance's purpose. Our products are built to help imaginations thrive. This is doubly true of the teams that make our innovations possible. Together, we inspire creativity and enrich life - a mission we aim towards achieving every day. To us, every challenge, no matter how ambiguous, is an opportunity; to learn, to innovate, and to grow as one team. Status quo? Never. Courage? Always. At ByteDance, we create together and grow together. That's how we drive impact - for ourselves, our company, and the users we serve. Join us. About the Team The Infrastructure Engineering team supports the company's fast growth by building and operating hyperscale datacenters. The team manages the end to end lifecycle of server fleet, providing cloud solutions and various infrastructure services ensuring that they are scalable and are reliable. Responsibilities • Build, expand, and operate Bytedance’s global infrastructures, including large-scale systems in public and private clouds, data centers, and content delivery networks. • Build tools, automation, visualizations, and monitors to facilitate the operation and optimization of the global infrastructure. • Work in a fast-paced environment. Participate in technical operations and rotations in response to performance and reliability issues. • Help improve the whole lifecycle of infrastructure services from inception and design throughout development to deployment, user support, and refinement. • Deploy and configure solutions in the cloud. • Automate cloud operations, develop infrastructure automation scripts and participates in the continuous improvement of cloud solutions. • Participate in the specification, setup and run Proof of Concepts and demonstrations of cloud solutions. • Administer and maintain servers across virtual platforms.
Qualifications
• At least a Bachelor’s degree in any of these faculties: Computer Science, Information Technology, Programming & Systems Analysis, Science (Computer Studies) • 5+ years of experience working with Unix/Linux systems from kernel to shell and beyond, with experience working with system libraries, file systems, and client-server protocols. • 5+ years experience with essential system-level apps, like DNS, APT, LDAP, Nginx, CI/CD, Ansible, Packer etc. • 3+ years experience in one or more programming languages such as Java, C++, Go, or scripting experience in Shell and Python. • Self-driven and capable of coping with ambiguity and moving projects from concept to delivery. • Strong analytical skills and the ability to solve real-world problems in a fast-moving environment. • Experience in designing, analyzing, and building automation and tools for large-scale systems • Experience in building solutions with AWS, Google, OCI, and other cloud services. • Strong communication and collaboration skills Preferred Qualifications • Master’s degree (or Bachelor's degree with 3+) years of experience in Computer Engineering, Electrical Engineering, Computer Science, or related major. • Familiarity with Kubernetes techniques. • Familiarity with Microservices and FaaS techniques. • Experience in Web App or UI design and implementation. • Experience in DB design, usage, and DBA. • Experience with Unit Tests, integration tests, and performance tests. • Experience in system and data security. ByteDance is committed to creating an inclusive space where employees are valued for their skills, experiences, and unique perspectives. Our platform connects people from across the globe and so does our workplace. At ByteDance, our mission is to inspire creativity and enrich life. To achieve that goal, we are committed to celebrating our diverse voices and to creating an environment that reflects the many communities we reach. We are passionate about this and hope you are too.

About The Company

Where imagination meets innovation, delivering limitless gaming experiences.

View All Jobs

Similar Jobs

PlayStation Global - Site Reliability Engineer II

England, United Kingdom (On-Site)

Visa - Site Reliability Engineer - Docker, Kubernetes, AWS

Masovian Voivodeship, Poland (Hybrid)

Visa - Sr. Site Reliability Engineer

Colorado, United States (Hybrid)

Visa - Sr. Site Reliability Engineer - PRE

Colorado, United States (Hybrid)

Visa - Site Reliability Engineer Intern

Colorado, United States (On-Site)

Visa - Site Reliability Engineer - New College Grad 2025

Colorado, United States (On-Site)

Visa - Sr. Site Reliability Engineer - PRE

Karnataka, India (Hybrid)

Visa - Site Reliability Engineer Intern

Texas, United States (On-Site)

Similar Skill Jobs

Aristocrat Gaming - Data Analyst

Uttar Pradesh, India (Hybrid)

Go Fund Me - Manager, Data Science

California, United States (Hybrid)

Truecaller - Senior Android Engineer

Stockholm County, Sweden (On-Site)

paypay - Android Engineer

Worldwide (Remote)

Social Discovery Group - Senior NLP Engineer

Serbia (Remote)

Social Discovery Group - Senior NLP Engineer

Georgia (Remote)

Social Discovery Group - Senior NLP Engineer

Poland (Remote)

Starkflow - AWS Architect

Dubai, United Arab Emirates (On-Site)

Software Engineering Jobs

Aristocrat Gaming - Affiliate Program Backoffice

Sliema, Malta (Hybrid)

Truecaller - Senior Android Engineer

Stockholm County, Sweden (On-Site)

paypay - Android Engineer

Worldwide (Remote)

Axinous - Senior Manager, Global CXO Experiences

California, United States (Hybrid)

Starkflow - AWS Architect

Dubai, United Arab Emirates (On-Site)

Netflix - Senior Technical Artist, Games Studio

Los Angeles, Ca, Usa Los Gatos, Ca, Usa (On-Site)

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug