Senior Site Reliability Engineer

1 Month ago • 4 Years +

About the job

SummaryBy Outscal

About the job:
Senior Site Reliability Engineer with 4+ years of experience in managing mission-critical production systems in AWS. Expertise in DevOps, infrastructure provisioning, container orchestration, Linux, networking, and CI/CD pipelines. Proven ability to automate operations, investigate and resolve issues, and mentor team members.
Must have:
  • AWS Experience
  • DevOps Experience
  • Container Orchestration
  • Linux Expertise
Good to have:
  • RESTful APIs
  • Scripting Languages
  • Go Lang
  • Helm
Perks:
  • Inclusive Culture
  • Ambitious Objectives

Flexera saves customers billions of dollars in wasted technology spend. A pioneer in Hybrid ITAM and FinOps, Flexera provides award-winning, data-oriented SaaS solutions for technology value optimization (TVO), enabling IT, finance, procurement and cloud teams to gain deep insights into cost optimization, compliance and risks for each business service. Flexera One solutions are built on a set of definitive customer, supplier and industry data, powered by our Technology Intelligence Platform, that enables organizations to visualize their Enterprise Technology Blueprint™ in hybrid environments—from on-premises to SaaS to containers to cloud.

We’re transforming the software industry.  We’re Flexera.  With more than 50,000 customers across the world, were achieving that goal. But we know we can’t do any of that without our team Ready to help us re-imagine the industry during a time of substantial growth and ambitious plans?  Come and see why we’re consistently recognized by Gartner, Forrester and IDC as a category leader in the marketplace. Learn more at flexera.com

Senior Site Reliability Engineer (SRE)

Flexera is looking for an experienced Site Reliability Engineer to join our SRE team. We're a fast-growing, category-leading organization with ambitious objectives and a positive, inclusive culture. We're looking for passionate professionals who want to grow their talents and achieve great things. If that sounds like you, we want to talk to you about joining our team.

As a Site Reliability Engineer, you will be tasked with everything from helping with product design, to diagnosing issues, and writing automated scripts for mediating issues that occur in our production systems. You will be driven to build fault tolerant, scalable systems and automate away as much operational toil as you can. You align with the goals of the DevOps movement in improving collaboration between the development and operations disciplines.

We are seeking someone with expensive experience working on a SaaS/Cloud product with a microservices architecture.

Responsibilities:

  • Help to eliminate operational toil - seek to automate repetitive operations work.
  • Establishing and enhancing CI/CD pipelines
  • Create dashboards with Grafana/Prometheus which help communicate the metrics for a given product service.
  • Collaboration with other teams
  • Investigate, debug and provide resolution for customer issues.
  • Mentoring of team-members on cloud computing, infrastructure, and best practices
  • Ensuring the security and reliability of shared Infrastructure with the Flexera cloud
  • Making Reliability a first-class citizen
  • Design, develop and deploy new features for Flexera products/platforms, as defined by goals from the SRE organization.
  • Create dashboards which help communicate the metrics for a given product service
  • Work with product owners and product engineering teams to perform capacity planning.
  • Work with product engineering teams to understand performance and behavior patterns.
  • Be part of an on-call rotation for alerts that require engineering expertise to diagnose.
  • Help carry out root cause analysis for incidents, and design solutions (both software and human processes) that will help to ensure the same problem doesn't happen in the same way again

Minimum Qualifications

  • Computer Science degree, or related industry experience managing a mission critical production system in AWS (or equivalent Azure/Google cloud) for at least 4 years.

Critical Skills / Competencies

Required:

  • Agile software delivery methodologies
  • Experience managing cloud-based services like AWS or Azure at scale
  • Experience with DevOps
  • Infrastructure provisioning experience
  • Experience deploying to and orchestrating containers (Docker, Kubernetes, etc.)
  • Expertise in Linux and good understanding of its commands
  • Good networking fundamentals
  • GitHub for collaboration and change management.
  • Experience with AWS services such as EC2, ECS, EKS, S3
  • Database exposure preferably MySQL, Amazon RDS and MongoDB

Good to have:

  • Understanding of RESTful APIs and other web-based application concepts
  • Any scripting language experience (Ruby is the current language, but comparable experience in Java, Python, Perl, etc. would suffice)
  • Knowledge on Go Lang.
  • Knowledge on Helm

#LI-PS1

#LI-Development

#LI-Remote

Flexera is proud to be an equal opportunity employer.  Qualified applicants will be considered for open roles regardless of age, ancestry, color, family or medical care leave, gender identity or expression, genetic information, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran status, race, religion, sex (including pregnancy), sexual orientation, or any other characteristic protected by local/national laws, policies and/or regulations. 

Flexera understands the value that results from employing a diverse, equitable, and inclusive workforce. We recognize that equity necessitates acknowledging past exclusion and that inclusion requires intentional effort. Our DEI (Diversity, Equity, and Inclusion) council is the driving force behind our commitment to championing policies and practices that foster a welcoming environment for all.

We encourage candidates requiring accommodations to please let us know by emailing careers@flexera.com.

View Full Job Description

Washington, United States (Hybrid)

Washington, United States (Hybrid)

Washington, United States (On-Site)

Washington, United States (On-Site)

Washington, United States (On-Site)

Washington, United States (On-Site)

Washington, United States (On-Site)

Washington, United States (On-Site)

View All Jobs

Similar Skill Jobs

PlayStation Global - Senior Machine Learning Engineer

England, United Kingdom (Hybrid)

PlayStation Global - Machine Learning Engineer

England, United Kingdom (Hybrid)

VGW - Senior Engineer

Mecklenburg-Vorpommern, Germany (On-Site)

Starkflow - Java/Groovy Developer

Morocco (Remote)

Patreon - Staff Data Engineer, Analytics

California, United States (Hybrid)

Patreon - Staff Data Engineer, Analytics

New York, United States (Hybrid)

Xsolla - Junior Data Scientist

Lisbon, Portugal (On-Site)

Xsolla - Junior Data Scientist

Belarus (On-Site)

Software Engineering Jobs

VGW - Senior Engineer

Mecklenburg-Vorpommern, Germany (On-Site)

PlayStation Global - Director, Information Technology

Washington, United States (On-Site)

DraftKings - Manager, Lottery Fulfillment

New Jersey, United States (On-Site)

Trek - Production Tech

Utah, United States (On-Site)

Scientific Games  - Machine Operator

Georgia, United States (On-Site)

company3methodstudios - Vault Assistant

Georgia, United States (On-Site)

Xsolla - VP of Architecture

Quebec, Canada (On-Site)

Fortis Games - IT Support Engineer

Romania (Remote)

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug