Production System Engineer, Infrastructure Engineering Intern

1 Month ago • Upto 1 Years • DevOps

Job Summary

Job Description

The Infrastructure Engineering team at ByteDance supports the company's growth by building and operating hyperscale datacenters. This internship focuses on production systems, encompassing the entire server lifecycle: deployment, OS installation, service management, troubleshooting, decommissioning, and recycling. Responsibilities include server operations, lifecycle management, automation tool development, system monitoring and optimization, troubleshooting, collaboration with other engineering teams, and optional contribution to internal dashboard development. The role involves working with cutting-edge hardware and large-scale server fleets across global datacenters.
Must have:
  • Server operations & support
  • Lifecycle management
  • Automation & tool development
  • System monitoring & optimization
  • Troubleshooting & issue resolution
  • Linux system administration
  • Basic scripting (Python, Bash, or Golang)
Good to have:
  • Server maintenance experience
  • Knowledge of monitoring tools (Prometheus, Grafana, Nagios)
  • Low-code/front-end development experience
  • DevOps practices, CI/CD, Ansible
  • Data analysis
Perks:
  • Work on global infrastructure
  • Mentorship from industry experts
  • High-impact project contribution
  • Exposure to state-of-the-art technologies

Job Details

Responsibilities
About the Team The Infrastructure Engineering team supports the company's fast growth by building and operating hyperscale datacenters. The team manages the end to end lifecycle of server fleet, providing cloud solutions and various infrastructure services ensuring that they are scalable and are reliable. Embark on an exciting expedition to explore the rapidly expanding ByteDance/TikToK domains in the United States, Europe, and Asia. Here, the Infrastructure Engineering team is crafting monumental data citadels that encircle the planet, sheltering legions of hundreds of thousands of servers. As the maestro of our production systems, you will embark on a captivating odyssey, taming the life cycles of these servers. Your adventure will begin with the orchestration of their initial deployment, navigating the intricate terrain of OS installation, summoning services like a digital magician, and maintaining vigilant watch over our inventory. But, like any epic tale, there will be times of challenge when you become a troubleshooter extraordinaire, mending and restoring with unwavering dedication. Eventually, you'll guide them into the sunset, orchestrating their decommissioning and ensuring their rebirth through recycling, all while contributing to the pulsating rhythm of international company's technological evolution. Why Join Us? - Work on real-world global infrastructure in hyperscale environments with cutting edge hardware. - Gain mentorship and technical guidance from industry experts. - Opportunity to contribute to high-impact projects in server operations, automation, new product introduction and provision etc. - Exposure to state-of-the-art IT and datacenter technologies and large-scale various fleets. Key Responsibilities - Server Operations & Infrastructure Support: Assist in the deployment, monitoring, and maintenance of large-scale server fleets across our global datacenters. - Lifecycle Management: Support the full lifecycle of servers, from system design, deployment, operation, troubleshooting, and decommissioning. - Automation & Tool Development: Develop and optimize scripts or tools to enhance automation, monitoring, and operational efficiency. - System Monitoring & Performance Optimization: Implement and refine monitoring solutions to improve the availability, latency, stability, and overall performance of infrastructure services. - Troubleshooting & Issue Resolution: Work with engineers to diagnose and resolve system issues, contributing to root cause analysis and preventive measures. - Cross-team Collaboration: Work closely with infrastructure architects, developers, and data center engineers to support ongoing projects and enhancements. - Platform Development (Optional): If you have experience in front-end development, contribute to building internal dashboards or visualization tools for monitoring infrastructure health and operations.
Qualifications
Minimum Qualifications: - Currently pursuing a Bachelor’s or Master’s degree in Computer Science, Electronic Engineering, or a related technical field. - Familiarity with Linux system administration and command-line operations. - Basic scripting skills in Python, Bash Shell, or Golang. - Knowledge of server hardware&system, data center infrastructure, or networking. Preferred Qualifications (Nice to Have) - Hands-on experience with server maintenance, hardware diagnostics, or system troubleshooting. - Knowledge of monitoring tools such as Prometheus, Grafana, or Nagios. - Experience in lowcode platform or front-end development (JavaScript. Node.js, Django, or similar frameworks) for building dashboards or other systems. - Understanding of DevOps practices, CI/CD pipelines, or infrastructure automation tools (Ansible). - Data analysis By submitting an application for this role, you accept and agree to our global applicant privacy policy, which may be accessed here: https://jobs.bytedance.com/en/legal/privacy. If you have any questions, please reach out to us at apac-earlycareers@bytedance.com
Job Information
About Us

Founded in 2012, ByteDance's mission is to inspire creativity and enrich life. With a suite of more than a dozen products, including TikTok, Lemon8, CapCut and Pico as well as platforms specific to the China market, including Toutiao, Douyin, and Xigua, ByteDance has made it easier and more fun for people to connect with, consume, and create content.

Why Join ByteDance

Inspiring creativity is at the core of ByteDance's mission. Our innovative products are built to help people authentically express themselves, discover and connect – and our global, diverse teams make that possible. Together, we create value for our communities, inspire creativity and enrich life - a mission we work towards every day.

As ByteDancers, we strive to do great things with great people. We lead with curiosity, humility, and a desire to make impact in a rapidly growing tech company. By constantly iterating and fostering an "Always Day 1" mindset, we achieve meaningful breakthroughs for ourselves, our Company, and our users. When we create and grow together, the possibilities are limitless. Join us.

Diversity & Inclusion

ByteDance is committed to creating an inclusive space where employees are valued for their skills, experiences, and unique perspectives. Our platform connects people from across the globe and so does our workplace. At ByteDance, our mission is to inspire creativity and enrich life. To achieve that goal, we are committed to celebrating our diverse voices and to creating an environment that reflects the many communities we reach. We are passionate about this and hope you are too.

Similar Jobs

ION - Technical Consultant - Endur

ION

Dallas, Texas, United States (On-Site)
6 Months ago
PwC - Oracle Cloud  ERP Senior Technical Consultant

PwC

Makati, Metro Manila, Philippines (On-Site)
7 Months ago
Lockwood - QA Tester

Lockwood

Nottingham, England, United Kingdom (On-Site)
1 Day ago
Nielsen Holdings - Scala Developer

Nielsen Holdings

Bengaluru, Karnataka, India (On-Site)
5 Months ago
NVIDIA - Senior System Software Engineer, Firmware

NVIDIA

Bengaluru, Karnataka, India (Hybrid)
3 Months ago
ION - Senior Technical Consultant – IT2

ION

Central Sulawesi, Indonesia (On-Site)
6 Months ago
Scale AI - Software Engineer, Cloud Infrastructure

Scale AI

San Francisco, California, United States (On-Site)
6 Months ago
The Walt Disney Company - Build & Release Engineer

The Walt Disney Company

California, United States (On-Site)
1 Month ago
Argus Labs - Site Reliability Engineer (LATAM)

Argus Labs

(Remote)
1 Month ago
Google - Staff Software Engineer, Site Reliability Engineering

Google

Pittsburgh, Pennsylvania, United States (On-Site)
2 Weeks ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Google - Display Product Failure Analysis Engineer

Google

Fremont, California, United States (On-Site)
2 Days ago
Super - Senior Full-Stack Software Engineer ( Remote! )

Super

Austin, Texas, United States (Remote)
6 Months ago
Lionbridge Games - IT Technician

Lionbridge Games

Berlin, Berlin, Germany (On-Site)
2 Weeks ago
ByteDance - Solutions Architect

ByteDance

(On-Site)
1 Month ago
Activision - Senior Technical Artist

Activision

(Hybrid)
1 Month ago
ByteDance - Backend Engineer - Applied Machine Learning Platform

ByteDance

Singapore (On-Site)
6 Months ago
Google - Senior Software Engineer, iOS, Search, Image Experience

Google

Tokyo, Japan (On-Site)
2 Weeks ago
IMC - HR Business Partner - Trading

IMC

Chicago, Illinois, United States (On-Site)
21 Hours ago
NVIDIA - Senior Software Engineer

NVIDIA

Yokne'am Illit, North District, Israel (On-Site)
2 Months ago
PPFA - Operations Manager

PPFA

New York, New York, United States (Hybrid)
1 Day ago

Get notifed when new similar jobs are uploaded

Jobs in Singapore

ByteDance - BNPL Operation Manager - Global Payment

ByteDance

Singapore (On-Site)
2 Weeks ago
OKX - (Senior) Compliance Manager, Sanctions Advisory

OKX

Singapore, Singapore (On-Site)
8 Hours ago
The Walt Disney Company - Weddings, VIP, Special Services Manager

The Walt Disney Company

Singapore, Singapore (On-Site)
2 Months ago
Google - Social Insight Strategist

Google

Singapore (On-Site)
2 Weeks ago
Google - Technical Program Manager, Network Infrastructure

Google

Singapore (On-Site)
1 Week ago
ByteDance - Research Scientist/Engineer, Large Language Model - 2025 Start

ByteDance

Singapore (On-Site)
4 Months ago
ByteDance - Privacy and Security Manager - Information System -Singapore

ByteDance

Singapore (On-Site)
4 Months ago
ByteDance - Lark APAC Partnerships & Scale-Ups Marketing Intern

ByteDance

Singapore (On-Site)
1 Month ago
Rolls Royce - Project Manager

Rolls Royce

Singapore (On-Site)
6 Months ago
ByteDance - Data Analyst - Corporate Information System

ByteDance

Singapore (On-Site)
1 Month ago

Get notifed when new similar jobs are uploaded

DevOps Jobs

Warhorse Studios - DevOps / C# Tools Programmer

Warhorse Studios

Prague, Prague, Czechia (On-Site)
1 Month ago
Next Level Business Services - Hadoop AWS Developer

Next Level Business Services

Beaverton, Oregon, United States (On-Site)
6 Months ago
ByteDance - Senior Site Reliability Engineer - Data Infrastructure (San Jose)

ByteDance

San Jose, California, United States (On-Site)
6 Months ago
Zazz - Java Developer

Zazz

(Remote)
2 Months ago
Ubisoft - Engine Programmer [Snowdrop]

Ubisoft

Bucharest, Bucharest, Romania (Hybrid)
6 Months ago
Microsoft - Senior Software Engineer - CTJ - TS/SCI

Microsoft

Redmond, Washington, United States (On-Site)
2 Weeks ago
Google - Systems Development Engineer, Operations, Public Sector

Google

Reston, Virginia, United States (On-Site)
2 Days ago
Ubisoft - Linux DevOps System Administrator

Ubisoft

Montreal, Quebec, Canada (On-Site)
3 Days ago
Google - Engineering Manager, Data Lineage, BigQuery Experience

Google

Warsaw, Masovian Voivodeship, Poland (On-Site)
1 Week ago

Get notifed when new similar jobs are uploaded

About The Company

Where imagination meets innovation, delivering limitless gaming experiences.

San Diego, California, United States (On-Site)

San Jose, California, United States (On-Site)

Dubai, Dubai, United Arab Emirates (On-Site)

New York, New York, United States (On-Site)

San Jose, California, United States (On-Site)

San Jose, California, United States (On-Site)

Seattle, Washington, United States (On-Site)

View All Jobs

Get notified when new jobs are added by ByteDance

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug