Director of Engineering, Performance & Chaos

1 Month ago • 8 Years + • DevOps

Job Summary

Job Description

As Director of Engineering, Performance & Chaos, you'll lead the implementation of chaos engineering practices to enhance system resilience and performance. Responsibilities include managing an engineering team, leveraging fault injection and observability tools, ensuring high availability and scalability, collaborating with SRE, Architecture, and Incident Management teams, and establishing key performance metrics. You'll need 8+ years of software engineering experience with 5+ years in leadership roles, expertise in chaos engineering, and strong communication skills. The role involves managing infrastructure using AWS and GCP, implementing resilience engineering principles, and driving performance testing best practice adoption.
Must have:
  • 8+ years software engineering experience
  • 5+ years engineering leadership
  • Chaos engineering expertise
  • Strong communication & collaboration
  • AWS/GCP infrastructure management
  • Resilience engineering knowledge
Good to have:
  • Chaos Mesh and Gremlin experience
  • Microservices architecture knowledge
  • CI/CD and DevOps experience

Job Details

We’re defining what it means to build and deliver the most extraordinary sports and entertainment experiences. Our global team is trailblazing new markets, developing cutting-edge products, and shaping the future of responsible gaming.

Here, “impossible” isn’t part of our vocabulary. You’ll face some of the toughest but most rewarding challenges of your career. They’re worth it. Channeling your inner grit will accelerate your growth, help us win as a team, and create unforgettable moments for our customers.

The Crown Is Yours

As a Director of Engineering within the Engineering Excellence’s Performance & Chaos Engineering team, you will play a pivotal leadership role focused on enhancing system resilience and performance across the organization. You will spearhead the strategic implementation of chaos engineering practices, helping to proactively identify and mitigate system vulnerabilities. By leveraging fault injection, observability tools, and resilience testing, you will ensure the infrastructure remains highly available, scalable, and aligned with service-level agreements. You will collaborate with SRE, Architecture, and Incident Management to embed performance testing best practices and resilience strategies into the development lifecycle.

What you’ll do as a Director of Engineering

  • Lead the adoption and execution of chaos engineering practices to identify weaknesses, validate reliability, and improve systems proactively.

  • Foster a culture of innovation, collaboration, accountability, and inclusion within the engineering organization.

  • Lead, mentor, and manage an engineering team, providing strategic direction and career development.

  • Manage infrastructure using AWS, GCP, and modern cloud computing practices.

  • Using a combination of testing, fault injection, and observability tools, ensure systems are resilient and meet SLAs.

  • Implement resilience engineering principles to reduce downtime and maintain high availability across services.

  • Partner with Site Reliability, Architecture, Problem & Incident Management, and other stakeholders to drive performance testing best practice adoption.

  • Establish and track key performance metrics to measure team output and system performance.

What skills you’ll bring

  • 8+ years of experience in software engineering, with 5+ years in engineering leadership roles.

  • Proven experience in managing and scaling engineering teams in fast-paced, high-growth environments.

  • Proven background in chaos engineering, testing strategies, and operational excellence.

  • Strong communication skills and proven ability to collaborate effectively with partners across various departments.

  • Expertise with chaos engineering platforms, Chaos Mesh and Gremlin is highly desired.

  • Familiarity with microservices architecture, CI/CD pipelines, and DevOps practices.

#LI-MF1

Join Our Team

We’re a publicly traded (NASDAQ: DKNG) technology company headquartered in Boston. As a regulated gaming company, you may be required to obtain a gaming license issued by the appropriate state agency as a condition of employment. Don’t worry, we’ll guide you through the process if this is relevant to your role.

Similar Jobs

Ajmera Infotech - Site Reliability Engineer - Kubernetes

Ajmera Infotech

San Jose, California, United States (On-Site)
3 Months ago
Omnissa - Staff Engineer (C++,MacOS Internals)

Omnissa

Bengaluru, Karnataka, India (Hybrid)
7 Months ago
Rackspace Technology - Cloud Practice Engineer III

Rackspace Technology

Jalisco, Mexico (Remote)
1 Month ago
Omnissa - C++ Engineering Manager

Omnissa

Bengaluru, Karnataka, India (Hybrid)
8 Months ago
Playtika - Senior DATA/AI SRE Engineer

Playtika

Poland (On-Site)
6 Months ago
Luxoft - Senior .Net developer with AWS

Luxoft

Poland, Ohio, United States (Remote)
6 Months ago
Rackspace Technology - AI/ML Architect

Rackspace Technology

Vietnam (Remote)
2 Months ago
GoTo Group - Senior Software Engineer - Event Platform

GoTo Group

Bengaluru, Karnataka, India (On-Site)
7 Months ago
Brillio - DB Migration Engineer - R01531207

Brillio

Bengaluru, Karnataka, India (Hybrid)
7 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Bally's Interactive - Senior Data Developer

Bally's Interactive

Toronto, Ontario, Canada (Hybrid)
1 Month ago
Crunchyroll - Senior Data Engineer - Platform Engineering

Crunchyroll

San Francisco, California, United States (Remote)
5 Months ago
The Walt Disney Company - Senior Software Engineer, Ad Platforms

The Walt Disney Company

Washington, United States (On-Site)
1 Month ago
CI Games  - Automation Engineer

CI Games

Île-de-France, France (Remote)
2 Months ago
Genies - Machine Learning Infrastructure Engineer, 3D Model Inference & Deployment

Genies

San Mateo, California, United States (On-Site)
2 Months ago
ByteDance - Software Engineer, SRE - Platform Services

ByteDance

San Jose, California, United States (On-Site)
3 Months ago
IGN - Senior Full Stack Software Engineer

IGN

New York, New York, United States (Hybrid)
6 Months ago
Argus Labs - Site Reliability Engineer

Argus Labs

Calgary, Alberta, Canada (Remote)
2 Months ago
Tencent - Tencent Cloud - Senior Cloud Architect (R&D & Solution Design)

Tencent

Singapore (On-Site)
6 Months ago
ByteDance - Senior Software Engineer, Cloud Infrastructure

ByteDance

San Jose, California, United States (On-Site)
5 Months ago

Get notifed when new similar jobs are uploaded

Jobs in Canada

PwC - PowerBI Lead

PwC

Toronto, Ontario, Canada (On-Site)
7 Months ago
Evolution - Card Shuffler

Evolution

Burnaby, British Columbia, Canada (On-Site)
1 Month ago
Evolution - Customer Service - Japanese Speaking Game Presenter

Evolution

Burnaby, British Columbia, Canada (On-Site)
1 Month ago
Artists Animation Studio - Technical Lead / Support Technician

Artists Animation Studio

Kelowna, British Columbia, Canada (Hybrid)
10 Months ago
Blazesoft - Online Casino Program Manager

Blazesoft

Canada (On-Site)
10 Months ago
Evolution - Customer Service - Korean Speaking Online Game Presenter - $24.75/hour + bonus (Live Casino Dealer)

Evolution

Burnaby, British Columbia, Canada (On-Site)
8 Months ago
Behaviour Interactive - Associate Art Director - Dead by Daylight

Behaviour Interactive

Montreal, Quebec, Canada (Hybrid)
2 Months ago
NVIDIA - Senior ASIC Verification Engineer

NVIDIA

Canada (Hybrid)
2 Months ago
Global Step - Croatian Localization Video Game Tester (LQA)

Global Step

Montreal, Quebec, Canada (On-Site)
1 Month ago

Get notifed when new similar jobs are uploaded

DevOps Jobs

Google - Systems Development Engineer, Google Distributed Cloud

Google

Kirkland, Washington, United States (On-Site)
1 Month ago
Google - Customer Engineer (English, Japanese)

Google

Tokyo, Japan (On-Site)
1 Month ago
Modio - Cloud Systems Engineer

Modio

Prahran, Victoria, Australia (On-Site)
2 Months ago
Luxoft - Lead Integration and Release Engineer

Luxoft

Bucharest, Bucharest, Romania (On-Site)
6 Months ago
Google - Software Engineer III, Google Cloud

Google

Dublin, County Dublin, Ireland (On-Site)
1 Month ago
Impronics Technologies - AWS Cloud Engineer

Impronics Technologies

Gurugram, Haryana, India (On-Site)
1 Year ago
PwC - IN- Senior Associate_ DevOps_Advisory Corporate_Advisory _Bangalore

PwC

Bengaluru, Karnataka, India (On-Site)
7 Months ago
Argus Labs - Site Reliability Engineer (South East Asia)

Argus Labs

(Remote)
1 Month ago
Revenera - Senior Site Reliability Engineer

Revenera

Bengaluru, Karnataka, India (Hybrid)
7 Months ago
Google - Software Engineering Manager II, Google Cloud

Google

Hyderabad, Telangana, India (On-Site)
6 Months ago

Get notifed when new similar jobs are uploaded

About The Company

Boston, Massachusetts, United States (On-Site)

United States (Remote)

Boston, Massachusetts, United States (On-Site)

Plovdiv, Plovdiv Province, Bulgaria (Remote)

Boston, Massachusetts, United States (On-Site)

London, England, United Kingdom (On-Site)

Sofia, Sofia City Province, Bulgaria (Remote)

View All Jobs

Get notified when new jobs are added by DraftKings

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug