Manager, Site Reliability Engineering

8 Months ago • All levels • Devops

Job Summary

Job Description

Lead a team of SREs responsible for building and maintaining Flexera's Snow Atlas platform infrastructure. Manage the day-to-day execution of SRE best practices ensuring reliability, scalability, instrumentation, automation, and performance of Snow's cloud SaaS products. Must have experience as a Site Reliability Engineering in cloud environments, managing a team of Site Reliability Engineers, managing infrastructure in Azure and managing Kubernetes infrastructure.
Must have:
  • SRE Cloud Experience
  • Team Management
  • Azure Infrastructure
  • Kubernetes
Good to have:
  • Monitoring Observability
  • IaC Containers
  • CI/CD Tooling
  • Security Practices
Perks:
  • Remote Work
  • Team Growth

Job Details

Flexera saves customers billions of dollars in wasted technology spend. A pioneer in Hybrid ITAM and FinOps, Flexera provides award-winning, data-oriented SaaS solutions for technology value optimization (TVO), enabling IT, finance, procurement and cloud teams to gain deep insights into cost optimization, compliance and risks for each business service. Flexera One solutions are built on a set of definitive customer, supplier and industry data, powered by our Technology Intelligence Platform, that enables organizations to visualize their Enterprise Technology Blueprint™ in hybrid environments—from on-premises to SaaS to containers to cloud.

We’re transforming the software industry.  We’re Flexera.  With more than 50,000 customers across the world, were achieving that goal. But we know we can’t do any of that without our team Ready to help us re-imagine the industry during a time of substantial growth and ambitious plans?  Come and see why we’re consistently recognized by Gartner, Forrester and IDC as a category leader in the marketplace. Learn more at flexera.com

Build, grow and lead a team that is responsible for implementing the Site Reliability Engineering practices and tools that continually improve the operational readiness, instrumentation, reliability, performance and scalability of Flexera’s Snow Atlas global cloud infrastructure, platform and products. The team is central to the success of Flexera’s SaaS solutions and stakeholders will rely on your knowledge and expertise of SRE and DevOps practices.

Adopting DevOps principles of delivery, the manager is responsible for the deliverables of the central team and works with stakeholders to enable Site Reliability Engineers. The manager will engage with stakeholders to identify and deliver the highest value / priority work that improves SRE capabilities, tools and services. Generation of actionable insights from qualitative and quantitative metrics to continually improve the operational reliability of Snow’s systems.

What you will be doing:

  • Lead, manage and coach a team of Site Reliability Engineers (SREs) responsible for building and maintaining Flexera’s Snow Atlas platform infrastructure and tooling. Manage the day-to-day execution of high-quality, prioritized, deliverables of SRE best practices ensuring the reliability, scalability, instrumentation, automation and performance of Snow’s cloud SaaS products.
  • Being a passionate advocate of the SRE discipline and DevOps principles you will engage, influence, seek feedback, and evangelize best practices with development, operational and support teams to enable stakeholders to support self-service and “you build-it – you run it”.
  • Manage the operational reliability, fault-tolerance, performance, scalability, observability and efficiency of Flexera’s cloud platforms and products across environments.
  • Work on incidents in conjunction with team members and coordinating with wider stakeholders to resolve customer impacting service issues promptly.
  • Partners with security and other “shared services” teams to align, automate, integrate and orchestrate specialist tooling into a common set of SRE best practices that supports the wider Software Delivery Lifecycle and Product Lifecycle.
  • Plan and execute projects in support of the SRE objectives, and ensure projects are delivered with high quality, on time, and within budget
  • Hire, develop and retain a highly skilled SRE team
  • Evaluate hardware and software technologies to improve efficiency and performance

Responsibilities:

  • Manage a team responsible for supporting an international, 24x7, Azure cloud infrastructure powering Flexera’s customer facing service offerings
  • Participate in the design, implementation, and operation of a scalable and reliable systems infrastructure supporting a fast-growth SaaS offering
  • Ensure proper security, monitoring, alerting, and reporting for the infrastructure
  • Troubleshooting and resolving escalated issues
  • Capacity planning for all aspects of the infrastructure
  • Developing and maintaining processes, tools, and documentation in support of the production environment
  • Participate in evaluation of new software, hardware and infrastructure solutions
  • Participation in an on-call rotation and be available 24x7 in an escalation capacity

Required skills and knowledge:

  • Experience as a Site Reliability Engineering in cloud environments
  • Experience managing a team of Site Reliability Engineers
  • Experience managing infrastructure in Azure
  • Experience managing Kubernetes infrastructure in the cloud.
  • Experience in Monitoring & Observability practices in the cloud including tooling, logging, metrics, tracing, and alerting
  • Experience with IaC and Containers to achieve scalable, reliable, performant and secure SaaS platform infrastructure
  • Experience of CI/CD tooling to automate, orchestrate and integrate continuous delivery pipelines

Flexera is proud to be an equal opportunity employer.  Qualified applicants will be considered for open roles regardless of age, ancestry, color, family or medical care leave, gender identity or expression, genetic information, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran status, race, religion, sex (including pregnancy), sexual orientation, or any other characteristic protected by local/national laws, policies and/or regulations. 

Flexera understands the value that results from employing a diverse, equitable, and inclusive workforce. We recognize that equity necessitates acknowledging past exclusion and that inclusion requires intentional effort. Our DEI (Diversity, Equity, and Inclusion) council is the driving force behind our commitment to championing policies and practices that foster a welcoming environment for all.

We encourage candidates requiring accommodations to please let us know by emailing careers@flexera.com.

Similar Jobs

Gameopedia - Senior Backend Developer

Gameopedia

Hyderabad, Telangana, India (Hybrid)
9 Months ago
Luxoft - Data Modeller

Luxoft

Pune, Maharashtra, India (On-Site)
7 Months ago
Playrix - Lead QA Engineer

Playrix

Georgia (Remote)
8 Months ago
The Walt Disney Company - Sr Manager, Software Engineer, Quality Engineering

The Walt Disney Company

Glendale, California, United States (Hybrid)
7 Months ago
Diligent - Software Engineer in Test II

Diligent

Bengaluru, Karnataka, India (On-Site)
10 Months ago
Nielsen - Principal Data Engineer - AWS

Nielsen

Mumbai, Maharashtra, India (Hybrid)
8 Months ago
Google - Engineering Manager, Networking

Google

Bengaluru, Karnataka, India (On-Site)
7 Months ago
Luxoft - PostgreSQL Developer with Oracle

Luxoft

Chennai, Tamil Nadu, India (On-Site)
7 Months ago
Google - Cloud Architect Infrastructure, Global Services Delivery, Google Cloud

Google

Pune, Maharashtra, India (On-Site)
7 Months ago
Inworld AI - Staff Cloud DevOps/Site Reliability Engineer (SRE) - Canada

Inworld AI

Vancouver, British Columbia, Canada (On-Site)
7 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

N-iX - Senior Automation Test Engineer (JavaScript) (#2338)

N-iX

Ukraine (Remote)
7 Months ago
Astreya - DevOps Engineer

Astreya

Hyderabad, Telangana, India (On-Site)
9 Months ago
ION - Sitecore Performance UI/UX Developer - 446

ION

New York, New York, United States (Hybrid)
8 Months ago
Omnissa - C++ & iOS - Senior MTS & Member of Technical Staff - III

Omnissa

Bengaluru, Karnataka, India (Hybrid)
9 Months ago
PublicisGroupe - Senior Associate Infrastructure L1_AWS

PublicisGroupe

Bengaluru, Karnataka, India (On-Site)
7 Months ago
Smart Working - UI Developer

Smart Working

India (Remote)
9 Months ago
Magic Media - C++ Game Developer - Linux

Magic Media

Rio De Janeiro, State Of Rio De Janeiro, Brazil (Remote)
7 Months ago
Forescout Technologies Inc. - QA Engineer

Forescout Technologies Inc.

Ottawa, Ontario, Canada (On-Site)
7 Months ago
BigID - DevOps Engineer

BigID

Chennai, Tamil Nadu, India (On-Site)
7 Months ago
The Walt Disney Company - Software Engineer II

The Walt Disney Company

San Francisco, California, United States (On-Site)
7 Months ago

Get notifed when new similar jobs are uploaded

Jobs in undefined

Looks like we're out of matches

Set up an alert and we'll send you similar jobs the moment they appear!

Devops Jobs

Google - Technical Solutions Engineer, Networking, Google Cloud

Google

Warsaw, Masovian Voivodeship, Poland (On-Site)
7 Months ago
Meltwater - Content Platform Software Engineer

Meltwater

Hyderabad, Telangana, India (Hybrid)
9 Months ago
Luxoft - Cloud Security Architect

Luxoft

(Remote)
7 Months ago
Google - Google Flights Engineer, Google Cloud Professional Services

Google

Cambridge, Massachusetts, United States (On-Site)
7 Months ago
Google - Systems Development Engineer, Customer Deployments

Google

Munich, Bavaria, Germany (On-Site)
7 Months ago
Cision - Senior Site Reliability Engineer (SRE)

Cision

India (Remote)
8 Months ago
PwC - Manager-Senior Cloud Architect| Bangalore

PwC

Bengaluru, Karnataka, India (On-Site)
8 Months ago
Moon Active - DevOps Engineer

Moon Active

Tel Aviv-Yafo, Tel Aviv District, Israel (On-Site)
9 Months ago
Luxoft - Senior Infrastructure Engineer

Luxoft

(Remote)
7 Months ago
Omnissa - Senior Member of Technical Staff (C++ Windows)

Omnissa

Chennai, Tamil Nadu, India (On-Site)
8 Months ago

Get notifed when new similar jobs are uploaded

About The Company

How We Roll - Our Culture is built on these Core ValuesCandorTell it like it is — solve problems by dealing with them head onPassionWhat we do may not be for everyone, but we devour it and love making our customers successfulProfessionalism and EthicsAnyone can just "have a job" — we look for people that strive to “go pro”Keep ScoreAccountability and transparency are vitally importantCelebrate SuccessLife is short and we work hard to keep our company operating at a high levelGive BackWe expect to give back to the communities in which we do business

Bengaluru, Karnataka, India (Hybrid)

Bengaluru, Karnataka, India (On-Site)

United States (On-Site)

Bengaluru, Karnataka, India (On-Site)

United States (On-Site)

United Kingdom (On-Site)

Bengaluru, Karnataka, India (Hybrid)

United States (Remote)

Bengaluru, Karnataka, India (On-Site)

View All Jobs

Get notified when new jobs are added by Flexera Software

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug