Director, Site Reliability Engineering

2 Months ago • 10 Years + • $185,000 PA - $205,000 PA

Job Summary

Job Description

The Director, Site Reliability Engineering will lead teams to manage and support hosting services, including colocated hardware and cloud-based services. They will also define and operate processes for change management, financial management, and incident response. Responsibilities include ensuring high availability and security of cloud services, implementing automation and incident management, leading a team of Site Reliability Engineers, developing strategic plans for cloud infrastructure, overseeing infrastructure management for cost-efficiency, maintaining monitoring and alerting systems, collaborating with product development teams, championing DevOps culture, and managing budgets. The role requires a candidate capable of driving process and cultural transformation across organizations.
Must have:
  • 10+ years of cloud engineering experience.
  • 5+ years of experience in a leadership role.
  • Experience managing large scale AWS cloud platforms.
  • Deep understanding of SRE practices.
  • Experience with cloud infrastructure tools.
  • Excellent leadership and communication skills.
  • Experience driving process and culture transformation.
  • Ability to work with cross-functional teams.
  • Strong problem-solving and decision-making abilities.
Perks:
  • Competitive health plans
  • Paid time-off
  • Company paid holidays
  • 401K retirement program with a Company elected match
  • Other company sponsored programs

Job Details

HHAeXchange is the leading technology platform for home and community-based care. Founded in 2008, HHAeXchange was born out of an idea to create a fully comprehensive end-to-end homecare solution to help people who are aging or have disabilities thrive in their homes and communities. Our employees are passionate about transforming the healthcare space by building the only homecare ecosystem that fully connects patients, personal care providers, managed care organizations, and states. 
 
The Director, Site Reliability Engineering is responsible for leading the teams that manage and support all of our hosting services, including colocated hardware and cloud-based services, as well as defining and operating the processes for change management, financial management and incident response.
 
To perform this job successfully, an individual must be able to perform each essential job duty satisfactorily with or without reasonable accommodation.  Reasonable accommodations may be made to enable individuals with disabilities to perform the essential functions.

Essential Job Duties

    • Ensure high availability, scalability, and security of cloud services across multiple geographies.
    • Implement and improve automation, incident management, and capacity planning practices.
    • Lead and mentor a team of Site Reliability Engineers and leaders. Lead the transformation of the organization to an SRE model.
    • Integrate the technology, practices and policies of disparate organizations into a single cohesive team that supports disparate technologies and platforms with minimal variation in practice.
    • Develop and execute strategic plans for cloud infrastructure and operations to support business growth and acquisitions.
    • Oversee the management and optimization of cloud infrastructure for cost-efficiency.
    • Maintain and improve monitoring, logging, and alerting systems.
    • Collaborate closely with product development teams to facilitate delivery of new functionality and capabilities to our SaaS platform and hosted products.
    • Champion and support the transformation to a DevOps culture.
    • Develop and manage budgets for cloud infrastructure and tooling.
    • Evaluate and implement new technologies and tools to enhance cloud infrastructure and operations.
    • Foster a culture of continuous improvement, collaboration, and innovation.

Other Job Duties

    • Other duties as assigned by supervisor or HHA exchange leader.

Travel Requirements

    • Travel up to 10%, including overnight travel

Required Education, Experience, Certifications and Skills

    • Bachelor’s or master’s degree in Computer Science, Engineering, or a related field.
    • 10+ years of experience in cloud engineering and operations, with at least 5 years in a leadership role.
    • Proven experience with managing large scale AWS cloud platforms.
    • Deep understanding of modern SRE practices and principles.
    • Experience with cloud infrastructure tools (monitoring, deployment, security).
    • Excellent leadership, communication, and interpersonal skills.
    • Proven experience driving process and culture transformation across organizations.
    • Ability to work effectively with cross-functional teams and stakeholders.
    • Strong problem-solving and decision-making abilities.
The base salary range for this US-based, full-time, and exempt position is $185,000-205,000 not including variable compensation. An employee’s exact starting salary will be based on various factors including but not limited to experience, education, training, merit, location, and the ability to exemplify the HHAeXchange core values.
 
This is a benefits-eligible position. HHAeXchange offers competitive health plans, paid time-off, company paid holidays, 401K retirement program with a Company elected match, including other company sponsored programs.

HHAeXchange is an equal-opportunity employer. The Company offers employment opportunities to all applicants and employees without regard to race, color, religion, national origin, sex, sexual orientation, gender identity or expression, age, disability, medical condition, marital status, veteran status, citizenship, genetic information, hairstyles, or any other status protected by local or federal law.

Similar Jobs

Glean - Security Engineer

Glean

Bengaluru, Karnataka, India (On-Site)
2 Weeks ago
London stock Exchange - ETL Support Engineer

London stock Exchange

Heredia, Costa Rica (On-Site)
1 Week ago
Western Digital - Engineer, Manufacturing Equipment Engineering

Western Digital

Prachin Buri, Thailand (On-Site)
1 Month ago
Loft Orbital - Senior Platform and Tooling Software Engineer

Loft Orbital

Golden, Colorado, United States (Hybrid)
1 Month ago
Nagarro - Staff Engineer

Nagarro

Portugal (Remote)
7 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

ness digital  - Engineering Manager

ness digital

Bengaluru, Karnataka, India (On-Site)
1 Month ago
AI Dash - Senior Engineering Manager - Devops

AI Dash

Bengaluru, Karnataka, India (Hybrid)
3 Days ago
Starkflow - Principal Full Stack Developer

Starkflow

Bengaluru, Karnataka, India (Hybrid)
3 Months ago
Techland - Security Engineer (Blue Team)

Techland

Warsaw, Masovian Voivodeship, Poland (On-Site)
1 Week ago
PlayStation Global - Site Reliability Engineer

PlayStation Global

Carlsbad, California, United States (On-Site)
2 Months ago
Cognite - Principal Front-end Engineer

Cognite

Austin, Texas, United States (Hybrid)
11 Months ago
UXBERT Labs - Senior Solution Architect (IoT/Bluetooth Integration)

UXBERT Labs

Riyadh, Riyadh Province, Saudi Arabia (Hybrid)
4 Months ago
Tide - Senior Threat Detection Engineer

Tide

Bengaluru, Karnataka, India (Hybrid)
4 Days ago
CyberArk - Team Leader, Engineering

CyberArk

India (On-Site)
3 Weeks ago
Aryaka - Principle Solutions Engineer

Aryaka

United States (Remote)
2 Months ago

Get notifed when new similar jobs are uploaded

Jobs in New York, New York, United States

nexon america - Director, Gameplay Engineering

nexon america

El Segundo, California, United States (Hybrid)
1 Month ago
Apple - Software Integrity QA Engineer - Maps App

Apple

Culver City, California, United States (On-Site)
4 Days ago
Reddit - Director, HR - Consumer Product & Engineering

Reddit

San Francisco, California, United States (On-Site)
2 Weeks ago
Activision - Game Security Analyst

Activision

Los Angeles, California, United States (Hybrid)
3 Days ago
GameChanger - Senior Product Manager, Video Platform

GameChanger

United States (Remote)
2 Months ago
Univision - APC Operator

Univision

Los Angeles, California, United States (On-Site)
2 Weeks ago
HCL Tech - Technical Specialist

HCL Tech

Texas, United States (On-Site)
3 Weeks ago
WebFX - Junior Talent Acquisition Specialist

WebFX

Harrisburg, Pennsylvania, United States (On-Site)
1 Month ago
bytedance - Research Engineer / Scientist - AI for Databases

bytedance

Seattle, Washington, United States (On-Site)
1 Month ago
Apple - Wearables SoC Concept Engineering Program Manager

Apple

San Diego, California, United States (On-Site)
5 Days ago

Get notifed when new similar jobs are uploaded

Similar Category Jobs

Looks like we're out of matches

Set up an alert and we'll send you similar jobs the moment they appear!

About The Company

New York, United States (Remote)

New York, New York, United States (Hybrid)

California, United States (Remote)

Ohio, United States (On-Site)

United States (Remote)

New York, New York, United States (Remote)

Minneapolis, Minnesota, United States (On-Site)

New York, New York, United States (Remote)

New York, United States (Remote)

View All Jobs

Get notified when new jobs are added by hh exchange

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug