Responsibilities
About ByteDance
Founded in 2012, ByteDance's mission is to inspire creativity and enrich life. With a suite of more than a dozen products, including TikTok as well as platforms specific to the China market, including Toutiao, Douyin, and Xigua, ByteDance has made it easier and more fun for people to connect with, consume, and create content.
Why Join Us
Creation is the core of ByteDance's purpose. Our products are built to help imaginations thrive. This is doubly true of the teams that make our innovations possible. Together, we inspire creativity and enrich life - a mission we aim towards achieving every day. To us, every challenge, no matter how ambiguous, is an opportunity; to learn, to innovate, and to grow as one team. Status quo? Never. Courage? Always. At ByteDance, we create together and grow together. That's how we drive impact - for ourselves, our company, and the users we serve. Join us.
About the Team
The Infrastructure Engineering team supports the company's fast growth by building and operating hyperscale datacenters. The team manages the end to end lifecycle of server fleet, providing cloud solutions and various infrastructure services ensuring that they are scalable and are reliable.
Embark on an exciting expedition to explore the rapidly expanding ByteDance domain in the United States, Europe, and Asia. Here, the Infrastructure Engineering team is crafting monumental data citadels that encircle the planet, sheltering legions of hundreds of thousands of servers. As the maestro of our production systems, you will embark on a captivating odyssey, taming the life cycles of these servers. Your adventure will begin with the orchestration of their initial deployment, navigating the intricate terrain of OS installation, summoning services like a digital magician, and maintaining vigilant watch over our inventory. But, like any epic tale, there will be times of challenge when you become a troubleshooter extraordinaire, mending and restoring with unwavering dedication. Eventually, you'll guide them into the sunset, orchestrating their decommissioning and ensuring their rebirth through recycling, all while contributing to the pulsating rhythm of ByteDance's technological evolution.
Key Responsibilities:
1. Lead and manage infrastructure automation through Ansible and implement DevOps best practices.
2. Develop, test, and maintain scripts in Bash, PowerShell, and Python to automate manual processes, enhance system functionalities, and troubleshoot issues.
3. Maintain, update, and troubleshoot Windows Server instances, ensuring secure and stable operation within the virtualized environment.
4. Deploy, configure, and oversee VMware environments, ensuring optimization and high availability.
5. Implement and manage physical security technologies including but not limited to Avigilon, Genetec, Lenel, and Traka.
6. Drive automation initiatives to improve operational efficiency and system reliability.
7. Design and implement IaC (Infrastructure as Code) workflows, guiding the transition and educating the team on best practices.
8. Demonstrate in-depth networking expertise, from designing scalable network architectures to troubleshooting complex network issues.
9. Plan and implement SSO (Single Sign-On) for centralized authentication and enhanced security.
10. Provide break-fix troubleshooting for systems, ensuring minimal downtime and swift resolution.
11. Collaborate cross-functionally with various teams to optimize system performance and security.
Qualifications
Minimum Qualifications:
1. Proven experience with Ansible or similar automation platforms (Chef, Puppet, Terraform) and a clear understanding of DevOps methodologies with strong scripting skills in Bash, PowerShell, and Python.
2. Hands-on experience with VMware ESXi deployment, configuration, and management through vCenter.
3. In-depth knowledge of VMware Cloud Foundation(VCF) for designing, deploying and managing infrastructure cloud solutions.
4. Familiarity with leading physical security technologies like Avigilon, Genetec, Lenel, and Traka.
5. Proficiency in IaC workflows and related tools/platforms.
6. Advanced networking skills, including a deep understanding of protocols, routing, switching, and troubleshooting.
7. Experience with SSO and AD/LDAP implementations and integrations.
Preferred Qualifications:
1. Demonstrable experience in break-fix troubleshooting across a variety of systems and platforms.
2. Relevant certifications in Ansible, VMware, or networking will be an added advantage.
3. Strong analytical, problem-solving, and decision-making skills.
4. Excellent communication skills with the ability to convey complex technical concepts to non-technical stakeholders.
ByteDance is committed to creating an inclusive space where employees are valued for their skills, experiences, and unique perspectives. Our platform connects people from across the globe and so does our workplace. At ByteDance, our mission is to inspire creativity and enrich life. To achieve that goal, we are committed to celebrating our diverse voices and to creating an environment that reflects the many communities we reach. We are passionate about this and hope you are too.