Lead Site Reliability Engineer

1 Week ago • 6 Years + • DevOps • $148,000 PA - $185,000 PA

Job Summary

Job Description

Lead SRE initiatives across multiple projects and products, collaborating with cross-functional teams to shape platform and infrastructure engineering efforts. Drive technical excellence by mentoring engineers and fostering a culture of continuous learning and innovation. Architect and automate self-healing infrastructure with declarative configurations, GitOps, and event-driven automation. Design, develop, and maintain software-driven infrastructure automation. Own product deployment, performance tuning, monitoring, and alerting to ensure high availability and system efficiency. Create robust observability strategies with service level agreements.
Must have:
  • 6+ years managing distributed cloud environments
  • Expert in networking and web concepts
  • Deep expertise in Kubernetes and container runtimes
  • Experience with IaC and configuration management tools
  • Strong software development skills (Go, Python)
  • Leading engineering teams and guiding technology roadmaps
  • Strong understanding of Linux-based operating systems
Good to have:
  • Understanding of applications written in object-oriented languages (C#/.NET, Java)
Perks:
  • Bonus
  • Equity
  • Benefits

Job Details

We’re defining what it means to build and deliver the most extraordinary sports and entertainment experiences. Our global team is trailblazing new markets, developing cutting-edge products, and shaping the future of responsible gaming.

Here, “impossible” isn’t part of our vocabulary. You’ll face some of the toughest but most rewarding challenges of your career. They’re worth it. Channeling your inner grit will accelerate your growth, help us win as a team, and create unforgettable moments for our customers.

The Crown Is Yours

As a Lead Site Reliability Engineer, you will drive key initiatives to enhance the reliability, scalability, and efficiency of our infrastructure. You’ll collaborate across teams to architect infrastructure automation while mentoring other Engineers to foster a culture of continuous learning and innovation. In this role, you will shape deployment strategies, performance tuning, and monitoring frameworks to support our rapid growth.

What you’ll do as a Lead Site Reliability Engineer

  • Lead SRE initiatives across multiple projects and products, collaborating with cross-functional teams to shape platform and infrastructure engineering efforts across the organization.

  • Drive technical excellence by mentoring and guiding engineers, fostering a culture of continuous learning and innovation.

  • Architect and automate self-healing, fault-tolerant infrastructure with declarative configurations, GitOps, and event-driven automation for scalable deployments across public clouds and on-premise.

  • Design, develop, and maintain software-driven infrastructure automation to build internal tools and eliminate repetitive operational tasks.

  • Own and drive decisions on product deployment, performance tuning, monitoring, and alerting to ensure high availability and system efficiency in production.

  • Create robust observability strategies with service level agreements to support our rapid traffic growth.

What you’ll bring   

  • 6+ years of experience managing distributed cloud environments (GCP, AWS, vSphere, Nutanix) and platform automation at scale.

  • Expert-level understanding of networking and web concepts, with the ability to debug issues down to the packet level.

  • Deep expertise in container orchestration (Kubernetes) and container runtimes (Docker, containerd), with the ability to design, scale, and troubleshoot complex workloads.

  • Experience with Infrastructure as Code (IaC) and configuration management tools (Terraform, Ansible, Chef, etc.), ensuring scalable and repeatable infrastructure provisioning.

  • Strong experience developing software for automation and infrastructure tooling (Go, Python).

  • Experience leading engineering teams and guiding technology roadmaps in large-scale, distributed environments.

  • Strong understanding of Linux-based operating systems, including performance tuning, kernel debugging, and low-level system optimizations.

  • Understanding of applications written in object-oriented languages (C#/.NET, Java).

Join Our Team

We’re a publicly traded (NASDAQ: DKNG) technology company headquartered in Boston. As a regulated gaming company, you may be required to obtain a gaming license issued by the appropriate state agency as a condition of employment. Don’t worry, we’ll guide you through the process if this is relevant to your role.

The US base salary range for this full-time position is 148,000.00 USD - 185,000.00 USD, plus bonus, equity, and benefits as applicable. Our ranges are determined by role, level, and location. The compensation information displayed on each job posting reflects the range for new hire pay rates for the position across all US locations. Within the range, individual pay is determined by work location and additional factors, including job-related skills, experience, and relevant education or training. Your recruiter can share more about the specific pay range and how that was determined during the hiring process. It is unlawful in Massachusetts to require or administer a lie detector test as a condition of employment or continued employment. An employer who violates this law shall be subject to criminal penalties and civil liability.

Similar Jobs

Nielsen Holdings - Senior Software Engineer (Java/Scala, Spark, Kubernetes, AWS)

Nielsen Holdings

Bengaluru, Karnataka, India (Hybrid)
5 Months ago
Luxoft - Senior Java Developer

Luxoft

Kraków, Lesser Poland Voivodeship, Poland (On-Site)
3 Months ago
Appier - Software Engineer, Backend Development

Appier

Taipei City, Taiwan (On-Site)
3 Months ago
Next Level Business Services - Java Full Stack Developer

Next Level Business Services

Reston, Virginia, United States (On-Site)
5 Months ago
Ello - Senior Unity Engineer (Contract)

Ello

São Paulo, State Of São Paulo, Brazil (Hybrid)
1 Week ago
NVIDIA - Senior Site Reliability Engineer - AI Research Clusters

NVIDIA

Austin, Texas, United States (Hybrid)
1 Month ago
ION - Cloud Engineer Kubernetes

ION

Rome, Lazio, Italy (Hybrid)
5 Months ago
Rackspace Technology - DevOps Engineer (AWS Terraform)

Rackspace Technology

India (Remote)
1 Month ago
Ajmera Infotech - Senior ASP.NET Developer with Azure Expertise

Ajmera Infotech

Hyderabad, Telangana, India (On-Site)
3 Months ago
Toppan Merrill - Site Reliability Engineer

Toppan Merrill

Chennai, Tamil Nadu, India (On-Site)
6 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Overwolf - Frontend Developer

Overwolf

Nottingham, England, United Kingdom (On-Site)
5 Days ago
PwC - IN_Senior Associate_ JAVA_Utility Transformation _Advisory_Jaipur

PwC

Jaipur, Rajasthan, India (On-Site)
3 Months ago
ByteDance - Site Reliability Engineer (Systems), Bytedance Engineering

ByteDance

Singapore (On-Site)
5 Months ago
Netflix - Software Engineer (L4/L5) - Enablement Apps

Netflix

Warsaw, Masovian Voivodeship, Poland (On-Site)
1 Month ago
Everyday Health Group - Senior Software Engineer, Backend

Everyday Health Group

(Remote)
4 Days ago
Anavation - Full Stack Developer

Anavation

Maryland, United States (Hybrid)
5 Days ago
JustPlay - Backend Engineer

JustPlay

Berlin, Berlin, Germany (Hybrid)
1 Week ago
Nagarro - Associate Staff Engineer, QA Automation

Nagarro

(On-Site)
5 Months ago
ARHS - Fullstack Developer

ARHS

Liège, Wallonia, Belgium (On-Site)
5 Months ago
The Walt Disney Company - Sr. Principal Software Engineer - Identity

The Walt Disney Company

New York, New York, United States (On-Site)
2 Months ago

Get notifed when new similar jobs are uploaded

Jobs in Boston, Massachusetts, United States

Epic Games - Senior Technical Product Manager, UE Rendering

Epic Games

Cary, North Carolina, United States (On-Site)
1 Week ago
seeking alpha - Senior Product Manager - Investing group

seeking alpha

United States (Remote)
4 Months ago
Crunchyroll - Corporate Communications Manager (Contract)

Crunchyroll

Los Angeles, California, United States (On-Site)
1 Month ago
ByteDance - Research Scientist in Large Multimodal Models Applications - San Diego

ByteDance

San Diego, California, United States (On-Site)
5 Months ago
The Walt Disney Company - Member Experience Professional II - Branch

The Walt Disney Company

Lake Buena Vista, Florida, United States (On-Site)
1 Week ago
Match Group - Product Operations Specialist

Match Group

Palo Alto, California, United States (Hybrid)
5 Months ago
Rackspace Technology - Google Cloud Engineer IV

Rackspace Technology

United States (Remote)
2 Months ago
Studio Wildcard - Senior Engine Programmer

Studio Wildcard

Bellevue, Washington, United States (Remote)
6 Days ago
NVIDIA - Senior Timing Methodology Engineer

NVIDIA

Santa Clara, California, United States (On-Site)
2 Months ago
Zoox - Staff Autonomy Integration Manager

Zoox

Foster City, California, United States (Hybrid)
5 Months ago

Get notifed when new similar jobs are uploaded

DevOps Jobs

Omnissa - Staff Engineer (C++ Windows Internals)

Omnissa

Bengaluru, Karnataka, India (On-Site)
5 Months ago
Tencent - SRE Intern

Tencent

Amsterdam, North Holland, Netherlands (On-Site)
1 Month ago
ByteDance - Senior Site Reliability Engineer - Data Infrastructure (San Jose)

ByteDance

San Jose, California, United States (On-Site)
5 Months ago
Luxoft - Senior Software Support Engineer

Luxoft

Poland, Ohio, United States (Remote)
4 Months ago
Rackspace Technology - Sr Big Data Engineer Airflow and Oozie (GCP)

Rackspace Technology

United States (Remote)
2 Months ago
Social Discovery Group - ML Ops Engineer (AI Product)

Social Discovery Group

(Remote)
2 Months ago
Egnyte - Staff Software Engineer

Egnyte

Mountain View, California, United States (Hybrid)
4 Months ago
ION - Software Architect - Java Multi-Tenant SAAS Cloud Native

ION

Pune, Maharashtra, India (On-Site)
5 Months ago
Fortis Games - Senior Cloud Security Engineer

Fortis Games

Portugal (On-Site)
1 Month ago
NVIDIA - Senior Site Reliability Engineer

NVIDIA

Westford, Massachusetts, United States (On-Site)
6 Days ago

Get notifed when new similar jobs are uploaded

About The Company

Boston, Massachusetts, United States (On-Site)

Boston, Massachusetts, United States (On-Site)

Ukraine (Remote)

Boston, Massachusetts, United States (On-Site)

Boston, Massachusetts, United States (Remote)

Sofia, Sofia City Province, Bulgaria (On-Site)

Boston, Massachusetts, United States (On-Site)

Boston, Massachusetts, United States (On-Site)

London, England, United Kingdom (On-Site)

View All Jobs

Get notified when new jobs are added by DraftKings

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug