Software Engineer II - Platform Engineering

1 Hour ago • 2-3 Years • DevOps

About the job

SummaryBy Outscal

Must have:
  • 1-2 years Kubernetes experience
  • 1-2 years AWS experience
  • 2-3 years software development/operations experience
  • Python, Bash, or Golang coding skills
  • Experience with infrastructure CI/CD pipelines
  • Experience with IaC (Terraform)
  • Helm templating for k8s manifests
  • Understanding of GitOps
Good to have:
  • Experience with ArgoCD or Flux
  • Experience with PCI compliance
  • Knowledge of Karpenter
  • Experience with service mesh
  • Experience with observability stack (metrics, tracing, logging)
Not hearing back from companies?
Unlock the secrets to a successful job application and accelerate your journey to your next opportunity.

Welcome to Warner Bros. Discovery… the stuff dreams are made of.

Who We Are…

When we say, “the stuff dreams are made of,” we’re not just referring to the world of wizards, dragons and superheroes, or even to the wonders of Planet Earth. Behind WBD’s vast portfolio of iconic content and beloved brands, are the storytellers bringing our characters to life, the creators bringing them to your living rooms and the dreamers creating what’s next…

From brilliant creatives, to technology trailblazers, across the globe, WBD offers career defining opportunities, thoughtfully curated benefits, and the tools to explore and grow into your best selves. Here you are supported, here you are celebrated, here you can thrive.

Your New Role...

The Direct to Consumer (DTC) Group is a technology company within Warner Bros Discovery. We are building a global streaming video platform (OTT), and a suite of applications to support all of our network’s brands globally. We are building modern container-based micro-services operated on AWS. Our platform covers everything from search, catalogue, video transcoding, personalization, to global subscriptions, and really much more. We build user experiences ranging from classic lean-back viewing to interactive learning applications.  We build for connected TVs, web, mobile phones, tablets, and consoles for a large footprint of WBD owned networks including Max, Eurosport discovery+ and many more. This is a growing, global engineering group crucial to WBD’s future.  

We are hiring a Software Engineer II within the DTC Operational Engineering group with focus on Cluster Engineering. The team is dedicated to provide a secure, efficient and easy-to-use container runtime platform. We build automation for expanding it to new markets, tenants and geographical regions, bootstrapping a fully featured container runtime ready for back-end development teams to use. 

We team up with other platform teams, manage the lifecycle of critical platform components and work closely with SRE to improve reliability and efficiency of the platform. We provide strategies and processes for lifecycle management of the container runtime environment. We own critical components, assess when we should introduce new ones and set the bar for quality. We focus on reducing security incidents while ensuring cost-effectiveness and supporting the business needs. 

Your Role Accountabilities...

Responsibilities include, but are not limited to, operating Kubernetes clusters (version upgrades and managing critical Kubernetes components like Karpenter), security vulnerability mitigations (rolling out security patches to Kubernetes nodes) and cost optimizations (scaling of our thousands of servers running hundreds of in-house built services, supporting millions of customers across the globe). 

  • As a Software Engineer II on the team, you will work with Kubernetes cluster operational tasks like upgrading the Kubernetes version. This includes performing an analysis of what is changing in the new version and how it affects the workload running on each cluster, e.g. deprecated k8s APIs and controller compatibility.

  • You’ll use tools to scan clusters for finding things that needs remediation, define actions for it, as well as executing them.

  • As part of rolling out upgrades to hundreds of clusters, some ranging up to a thousand nodes, you will identify automation opportunities to reduce the amount of toil needed. 

  • To keep the runtime infrastructure secure you will roll out security patches to Kubernetes nodes (EC2 AMI updates) on a regular basis (PCI compliance demands new patches to be installed at least every 30 days) and help improve the automation tooling to eventually run this on an automated schedule. 

  • In order to keep our runtime cost low, you will take part in cost optimization efforts, like tweaking the Karpenter node scaling strategies, increasing the bin-packing efficiency of pods on nodes, ensuring the right node family type is used (m, c, r, spot and Graviton/ARM) and identifying over scaled infrastructure. 

  • To enable faster time to new markets and onboarding of new tenants to our streaming platform, you’ll work on automating the creation of new Kubernetes clusters together with bootstrapping of critical platform capabilities like service mesh, deployment systems and the observability stack (metrics, tracing, logging). 

Qualifications and Experience...

  • At least 1-2 years of Kubernetes experience. 

  • At least 1-2 years of AWS experience. 

  • At least 2-3 years of software development, infrastructure management or operations experience. 

  • Ability to write code for automation in python, bash or golang. 

  • Experience with designing infrastructure CI/CD pipelines, e.g. Jenkins or GitHub Actions. 

  • Experience with IaC, preferably Terraform. 

  • Used to Helm templating for k8s manifests. 

  • Understanding of how GitOps tooling like ArgoCD or Flux works. 

  • Experience in rolling out infrastructure changes to production by following a change management workflow. 

  • Knowing what metrics to monitor during a change rollout to identify problems. 

  • Strong ownership mentality during rollouts, stop/rollback and fix if problems occur or escalate to management/on-call if getting stuck. 

  • Strong sense of security, always using least privileges access and firewall configurations when needed for maintenance. 

  • Understanding of how running workloads on the Kubernetes clusters may be affected by cluster changes or node rotations. 

  • Willingness to talk to service development teams and understand their challenges when they report problems during maintenance windows. 

  • Ability to define and measure KPIs and honor SLAs for infrastructure maintenance. 

  • Experience with Git and GitHub PR workflows. 

  • Experience in working with Agile – Sprints, Epics/Stories, Jira. 

#Li-hybrid

How We Get Things Done…

This last bit is probably the most important! Here at WBD, our guiding principles are the core values by which we operate and are central to how we get things done. You can find them at   along with some insights from the team on what they mean and how they show up in their day to day. We hope they resonate with you and look forward to discussing them during your interview.

Championing Inclusion at WBD

Warner Bros. Discovery embraces the opportunity to build a workforce that reflects the diversity of our society and the world around us. Being an equal opportunity employer means that we take seriously our responsibility to consider qualified candidates on the basis of merit, regardless of sex, gender identity, ethnicity, age, sexual orientation, religion or belief, marital status, pregnancy, parenthood, disability or any other category protected by law.

If you’re a qualified candidate with a disability and you require adjustments or accommodations during the job application and/or recruitment process, please visit our for instructions to submit your request.

View Full Job Description

About The Company

Warner Bros. Discovery, a premier global media and entertainment company, offers audiences the world’s most differentiated and complete portfolio of content, brands and franchises across television, film, streaming and gaming. The new company combines WarnerMedia’s premium entertainment, sports and news assets with Discovery’s leading non-fiction and international entertainment and sports businesses.

New South Wales, Australia (On-Site)

California, United States (On-Site)

Florida, United States (On-Site)

California, United States (On-Site)

Mexico City, Mexico (On-Site)

England, United Kingdom (On-Site)

England, United Kingdom (Hybrid)

District Of Columbia, United States (On-Site)

View All Jobs

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug