Manager, Cloud Site Reliability Engineering

1 Month ago • 5 Years +

Job Summary

Job Description

The Manager, Cloud Site Reliability Engineering will lead a team to ensure the availability of high-volume, critical SaaS applications and seamless scaling. Responsibilities include designing scalable infrastructure architectures, improving system reliability, leading incident management, and driving automation. This role involves leading a team, fostering innovation, and collaborating across teams to drive operational excellence. The application portfolio includes a broad spectrum of Data Protection and Network Security products.
Must have:
  • 5+ years experience leading SRE/DevOps teams.
  • Deep understanding of distributed systems and cloud platforms.
  • Experience with SLOs, SLIs, and SLAs implementation.
  • Experience in hiring, mentoring, and team development.
  • Ability to develop and execute technical roadmaps.
  • Track record of solving complex technical challenges.
  • Excellence in communicating technical concepts.
Perks:
  • Opportunity to voice your opinion and make an impact.
  • Internal mobility and cross-training opportunities.
  • Equity in the form of non-qualifying options.

Job Details

Req ID: 26-125
 
Come join our passionate team! Barracuda is a leading cybersecurity company providing complete protection against complex threats. Our platform protects email, data, applications, and networks with innovative solutions, and a managed XDR service, to strengthen cyber resilience. Hundreds of thousands of IT professionals and managed service providers worldwide trust us to protect and support them with solutions that are easy to buy, deploy, and use.
 
We know a diverse workforce adds to our collective value and strength as an organization. Barracuda Networks is proud to be an employer that complies with all applicable national, state and local laws pertaining to nondiscrimination and equal opportunity regardless of race, gender, religion, sex, sexual orientation, national origin, or disability.
Envision yourself at Barracuda 
We seek a passionate, experienced Manager, Cloud Site Reliability Engineering for Data Protection and Network Security business units with great technical acumen and a strong background in operations, automation, implementation, and development. 
 
As a Manager, Cloud Site Reliability Engineering, you will be leading a team responsible for ensuring the availability of high volume, critical SaaS applications, and seamless scaling. The application portfolio ranges from a broad spectrum of Data Protection and Network Security products. 
 
What you will be working on: 
  • Platform Architecture: Design and implement scalable infrastructure architectures that support high availability and reliability across multiple cloud environments 
  • Reliability Engineering: Lead initiatives to improve system reliability, establish SLOs, and implement monitoring and alerting strategies
  • Team Leadership: Build, mentor, and grow a high-performing SRE team while fostering a culture of innovation and continuous improvement
  • Incident Management: Establish and optimize incident response processes, lead major incident reviews, and drive systematic improvements
  • Automation Development: Spearhead automation initiatives to reduce manual operations and improve system reliability
  • Performance Optimization: Lead projects to optimize system performance, capacity planning, and cost efficiency
  • Cross-team Collaboration: Work closely with development teams to implement SRE best practices and drive operational excellence
  • Technical Strategy: Develop and execute technical roadmaps aligned with business goals and scaling requirements
  • Security Integration: Ensure security best practices are embedded in infrastructure and operational processes
  • Knowledge Management: Establish documentation standards and knowledge sharing practices across the organization
  • Vendor Management: Evaluate and manage relationships with technical vendors and service providers
  • Operational Excellence: Drive continuous improvement in operational processes, tooling, and methodologies
What you bring to the role: 
  • Technical Leadership Experience: 5+ years of experience leading and managing SRE/DevOps teams, with a proven track record of improving system reliability and performance 
  • Architectural Vision: Deep understanding of distributed systems, cloud platforms (AWS/GCP/Azure), and modern infrastructure technologies
  • Operational Excellence: Strong background in implementing SLOs, SLIs, and SLAs, with expertise in incident management and post-mortem processes
  • Team Development: Experience in hiring, mentoring, and growing high-performing technical teams while fostering a culture of continuous learning
  • Strategic Planning: Ability to develop and execute technical roadmaps aligned with business objectives and scalability requirements
  • Problem-Solving Skills: Track record of solving complex technical challenges and implementing sustainable solutions
  • Communication: Excellence in communicating technical concepts to both technical and non-technical stakeholders
  • Automation Expertise: Strong background in infrastructure automation, CI/CD pipelines, and DevOps practices
  • Risk Management: Experience in capacity planning, disaster recovery, and building resilient systems
  • Cross-functional Collaboration: Proven ability to work effectively with product, development, and business teams
  • Change Management: Experience in managing organizational change and driving adoption of new technologies and practices
  • Budget Management: Skills in resource allocation, cost optimization, and managing operational budgets
What you’ll get from us:
A team where you can voice your opinion, make an impact, and where you and your experiences are valued. Internal mobility – there are opportunities for cross training and the ability to attain your next career step within Barracuda. In addition, you will receive equity, in the form of non-qualifying options.
#LI-hybrid


 
 

Similar Jobs

Looks like we're out of matches

Set up an alert and we'll send you similar jobs the moment they appear!

Similar Skill Jobs

Looks like we're out of matches

Set up an alert and we'll send you similar jobs the moment they appear!

Jobs in Reading, England, United Kingdom

Looks like we're out of matches

Set up an alert and we'll send you similar jobs the moment they appear!

Similar Category Jobs

Looks like we're out of matches

Set up an alert and we'll send you similar jobs the moment they appear!

About The Company

Chicago, Illinois, United States (On-Site)

New Hampshire, United States (On-Site)

Miami, Florida, United States (On-Site)

Reading, England, United Kingdom (Hybrid)

Bengaluru, Karnataka, India (On-Site)

Ann Arbor, Michigan, United States (Remote)

United States (Remote)

Illinois, United States (Remote)

Chelmsford, Massachusetts, United States (Hybrid)

Kuala Lumpur, Federal Territory Of Kuala Lumpur, Malaysia (On-Site)

View All Jobs

Get notified when new jobs are added by Barracuda Networks Inc

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug