Cloud Site Reliability Staff Developer

2 Months ago • 10 Years + • Devops • $122,000 PA - $162,000 PA

Job Summary

Job Description

Barracuda is seeking a passionate and experienced Staff Site Reliability Engineer (SRE) to join their Managed Service Provider (MSP) and Managed Extended Detection and Response (XDR) business units. This role focuses on ensuring the availability and seamless scaling of high-volume, critical SaaS applications. Responsibilities include designing application infrastructure with a focus on scalability, security, and reliability, developing infrastructure automation tools and templates, leading architectural decisions, and approving major system design changes. The SRE will also be responsible for platform development, service level management, incident response, disaster recovery planning, and overseeing technical design for non-functional requirements. The role involves implementing modern solutions using technologies like AWS, Kubernetes, GitHub Actions, Jenkins, Terraform, and Pulumi, as well as building data infrastructure with Databricks, Spark, and the ELK stack. Participation in on-call rotation and mentoring junior team members is also expected.
Must have:
  • 10+ years infrastructure design experience
  • 5+ years cloud development experience
  • 3+ years SRE/DevOps experience
  • AWS cloud infrastructure expertise
  • Terraform, CloudFormation, or Pulumi experience
  • GitHub Actions, Jenkins, Ansible, or Puppet experience
  • Docker and Kubernetes experience
  • Python, Go, or Ruby programming skills
  • Advanced Linux knowledge
  • New Relic, Elastic APM, CloudWatch, Prometheus, or Grafana experience
  • Databricks, Apache Spark, or Kafka experience
  • Strong debugging and troubleshooting skills
  • Excellent communication skills
Good to have:
  • Experience with blue/green, canary, rolling deployments
  • Experience with Crossplane
  • Experience with Packer or Puppet
  • Experience with EKS in AWS environments
  • Experience with DataStage
  • AWS certifications (Solutions Architect, DevOps)
  • Kubernetes certifications (CKA, CKAD, CKS)
Perks:
  • Equity in the form of non-qualifying options
  • Opportunities for cross-training and internal mobility
  • Inclusive and barrier-free work environment

Job Details

Req ID: 26-124
 
Managed Service Provider (MSP) and Managed Extended Detection and Response (XDR)
Come join our passionate team! Barracuda is a leading cybersecurity company providing complete protection against complex threats. Our platform protects email, data, applications, and networks with innovative solutions, and a managed XDR service, to strengthen cyber resilience. Hundreds of thousands of IT professionals and managed service providers worldwide trust us to protect and support them with solutions that are easy to buy, deploy, and use.
 
We are committed to a candidate selection process and work environment that is inclusive and barrier free. To ensure candidates are assessed in a fair and equitable manner, accommodations will be provided to prospective employees in accordance with the Accessibility for Ontarians with Disabilities Act (AODA) and the Ontario Human Rights Code.
Envision yourself at Barracuda 
We seek a passionate and experienced Site Reliability Staff Engineer (SRE) for the Managed Service Provider (MSP)and Managed XDR business units with great technical acumen and a strong background in operations, automation, implementation, and development.
 
As a Staff SRE, you will be responsible for ensuring the availability of high volume, critical SaaS applications and seamless scaling. The application portfolio ranges from a broad spectrum of MSP and XDR products. 
What will you be working on: 
  • Application Infrastructure Design: Engage with internal customers to understand application design and cloud infrastructure needs, focusing on scalability, security, and reliability
  • Infrastructure Automation: Create and design templates, tools, and accelerators for deployment infrastructure to support development teams
  • Architectural Leadership: Lead architectural decisions and approve major system design changes, implementing contemporary architectural patterns
  • Platform Development: Design and develop self-service platforms for Product Engineering teams
  • Service Level Management: Define, implement, and track SLIs, SLOs, and SLAs across services
  • Incident Management: Lead incident response processes and conduct post-incident learning reviews
  • Disaster Recovery: Develop and maintain disaster recovery and business continuity plans
  • Technical Design: Plan and implement non-functional requirements including security, performance, deployment frequency, and monitoring
  • Solution Architecture: Oversee architecture snapshots, solution design, prototyping, and code reviews
  • Technology Stack Implementation: Drive modern solutions using AWS, Kubernetes, GitHub Actions, Jenkins, Terraform, Pulumi, and other current technologies
  • Data Infrastructure: Build support infrastructure for global data pipeline and storage using Databricks, Spark, and ELK stack
  • Deployment Automation: Lead initiatives to convert manual deployments to automated processes
  • Observability Systems: Build and enhance monitoring and reliability systems
  • On-Call Duties: Participate in on-call rotation to ensure 24/7 system reliability
  • Team Development: Mentor junior team members and foster a positive team culture
What you bring to the role:
  • Technical Expertise: 10+ years hands-on infrastructure design experience, including 5+ years cloud development and 3+ years in SRE/DevOps roles 
  • Cloud Infrastructure: Deep expertise in AWS cloud infrastructure development, security, and operations with proven success in large-scale production environments 
  • Infrastructure as Code: Extensive experience with Terraform, CloudFormation, Pulumi, and Crossplane for cloud infrastructure automation 
  • CI/CD & Automation: Strong background with GitHub, GitHub Actions, Jenkins, Packer, Ansible, and Puppet 
  • Deployment Patterns: Expertise in blue/green, canary, rolling deployments, and draining strategies 
  • Container Orchestration: Comprehensive experience with Docker, Kubernetes, and EKS in AWS environments 
  • Programming: Strong coding abilities in Python, Go, Ruby etc.  
  • Operating Systems: Advanced Linux knowledge including system internals 
  • Observability: Extensive experience with New Relic, Elastic APM, CloudWatch, Prometheus, and Grafana... 
  • Data Engineering: Experience with Databricks, Apache Spark, Kafka, and DataStage 
  • Problem Solving: Strong systematic debugging and troubleshooting capabilities 
  • Communication: Excellent verbal and written communication skills 
  • Certifications: AWS certifications (Solutions Architect, DevOps) and Kubernetes certifications (CKA, CKAD, CKS) a plus 
What you’ll get from us:
A team where you can voice your opinion, make an impact, and where you and your experience are valued. Internal mobility – there are opportunities for cross training and the ability to attain your next career step within Barracuda. In addition, you will receive equity, in the form of non-qualifying options.
 
The anticipated on-target earnings range for this role is CAD 122,000 to CAD 162,000. Actual compensation offered will be dependent upon the individual's skills, experience, and qualifications as they directly relate to the requirements of the position, the budget for the position, and applicable employment laws.
 
#LI-hybrid 
 
 

Similar Jobs

NCR Voyix - Software Engineer II - Frontend

NCR Voyix

Cebu City, Central Visayas, Philippines (On-Site)
3 Weeks ago
Resolver - Account Executive

Resolver

Toronto, Ontario, Canada (Hybrid)
1 Week ago
ElevenLabs - Account Executive - Brazil

ElevenLabs

Brazil (Remote)
3 Months ago
bytedance - Account Executive - Lark - Thailand

bytedance

Bangkok, Bangkok, Thailand (On-Site)
3 Months ago
high radius - Project Manager

high radius

Hyderabad, Telangana, India (On-Site)
2 Days ago
Granicus - Senior Software Engineer (SE4) - Ruby with AWS

Granicus

Bengaluru, Karnataka, India (Remote)
1 Month ago
Salesforce - Account Solution Engineer - Dutch / Flemish speaker

Salesforce

Dublin, County Dublin, Ireland (On-Site)
8 Months ago
Adyen - Solutions Architect

Adyen

Warsaw, Masovian Voivodeship, Poland (On-Site)
1 Month ago
Qualcomm - Platform Security Software Architect

Qualcomm

Santa Clara, California, United States (On-Site)
3 Weeks ago
Palo Alto Networks - Principal DevSecOps Engineer (Cortex Cloud)

Palo Alto Networks

Tel Aviv-Yafo, Tel Aviv District, Israel (On-Site)
2 Weeks ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Figma - Enterprise Solutions Consultant

Figma

San Francisco, California, United States (Hybrid)
2 Weeks ago
CGS Carrers - Platform Engineer I

CGS Carrers

Braga, Braga, Portugal (Remote)
1 Month ago
AiDash - Senior Engineering Manager - Devops

AiDash

Bengaluru, Karnataka, India (Hybrid)
1 Month ago
Saviynt - Senior Engineer II, Cloud Operations - Federal Team

Saviynt

Los Angeles, California, United States (Hybrid)
3 Months ago
ElevenLabs - Revenue Partnerships

ElevenLabs

India (Remote)
3 Months ago
CyberArk - R&D Manager for IGA group

CyberArk

Israel (Hybrid)
1 Month ago
Ion - Cloud Engineer Kubernetes

Ion

Italy (Hybrid)
8 Months ago
Forescout Technologies  Inc  - Strategic Account Manager

Forescout Technologies Inc

Dallas, Texas, United States (Remote)
4 Months ago
NCR Voyix - App Dev Engineer I

NCR Voyix

Gurugram, Haryana, India (On-Site)
3 Weeks ago
InFeedo AI - Account Executive

InFeedo AI

San Francisco, California, United States (Remote)
1 Month ago

Get notifed when new similar jobs are uploaded

Jobs in Ottawa, Ontario, Canada

Rockstar Games - Animation Systems Programmer

Rockstar Games

Oakville, Ontario, Canada (On-Site)
3 Months ago
Pragma - Game Services Engineer - Co-Dev

Pragma

Canada (Remote)
3 Months ago
Evolution  - Customer Service - Korean Speaking Online Game Show Host

Evolution

Burnaby, British Columbia, Canada (On-Site)
3 Weeks ago
Ubisoft - Team Lead - Character Modelling

Ubisoft

Toronto, Ontario, Canada (On-Site)
2 Months ago
Sika Group - Merchandiser

Sika Group

Edmonton, Alberta, Canada (On-Site)
3 Weeks ago
HoYoverse - Senior Brand Marketing Manager [CA]

HoYoverse

Montreal, Quebec, Canada (Remote)
1 Year ago
Cineplex - Part Time Cast Member

Cineplex

Nanaimo, British Columbia, Canada (On-Site)
1 Month ago
Jam City - Game Designer

Jam City

Canada (Remote)
1 Month ago
Obsidian Entertainment - Engine Programmer (Staff/Senior)

Obsidian Entertainment

Canada (On-Site)
10 Months ago
WildBrain - Modeling/Surfacing Supervisor, CG

WildBrain

Vancouver, British Columbia, Canada (Hybrid)
1 Month ago

Get notifed when new similar jobs are uploaded

Devops Jobs

Ion - Cloud Engineer/Architect (DevOps)

Ion

Italy (On-Site)
8 Months ago
Netomi - Devops Engineer - II

Netomi

Toronto, Ontario, Canada (Remote)
2 Weeks ago
Saviynt - Senior Solutions Engineer

Saviynt

Singapore (Hybrid)
3 Weeks ago
Vercel - Site Reliability Engineer, Compute

Vercel

(Remote)
1 Month ago
Toast - Senior Full Stack Software Engineer - Communication Platform

Toast

Dublin, County Dublin, Ireland (Hybrid)
2 Weeks ago
GoDaddy - Full Stack Software Engineer - AWS

GoDaddy

Serbia (Remote)
1 Month ago
Axon - Sr. Solutions Architect, Fusus

Axon

Atlanta, Georgia, United States (Hybrid)
1 Month ago
Canonical - Senior Site Reliability / Gitops Engineer

Canonical

(Remote)
1 Month ago
NinjaVan - Automation Engineer Assistant Manager

NinjaVan

Jakarta, Indonesia (On-Site)
2 Weeks ago
Luxoft - Solution Architect

Luxoft

Poland, Ohio, United States (Remote)
6 Months ago

Get notifed when new similar jobs are uploaded

About The Company

Alpharetta, Georgia, United States (On-Site)

Reading, England, United Kingdom (On-Site)

Vienna, Vienna, Austria (On-Site)

Chicago, Illinois, United States (On-Site)

Oregon, United States (On-Site)

Oregon, United States (Remote)

Alpharetta, Georgia, United States (On-Site)

Campbell, California, United States (On-Site)

Bengaluru, Karnataka, India (On-Site)

View All Jobs

Get notified when new jobs are added by Barracuda

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug