Principal SRE (Site Reliability Engineer)

1 Month ago • 10 Years + • Devops • $176,400 PA - $327,600 PA

Job Summary

Job Description

As a Principal Site Reliability Engineer at SailPoint, you will be a key member of the Reliability Engineering team, driving reliability practices for the Identity Security Cloud platform. You will be an evangelist to engineering teams, helping them improve reliability, capacity, and performance. Responsibilities include coaching teams on observability best practices, analyzing service performance, managing cross-functional requirements, and mentoring on design reviews, code, and automation. You will also influence architectural design for global scale and drive operational excellence.
Must have:
  • 10+ years of SRE or DevOps experience.
  • Experience with cloud infrastructure environments.
  • Proficiency with containerization and Kubernetes.
  • Experience with observability tools like Prometheus.
  • Experience with incident management.
  • Proficiency in one or more programming languages.
  • Strong understanding of Linux, systems, and networking.
  • Excellent communication skills.
  • US Citizenship is required.
Perks:
  • Health and wellness coverage (Medical, dental, vision)
  • Disability coverage (Short-term and long-term)
  • Life insurance and Accidental Death & Dismemberment (AD&D)
  • Flexible vacation policy
  • 401(k) Savings and Investment Plan with company matching
  • Paid parental leave
  • Employee Assistance Program (EAP)

Job Details

SailPoint is the leader in identity security for the cloud enterprise. Our identity security solutions secure and enable thousands of companies worldwide, giving our customers unmatched visibility into the entirety of their digital workforce, ensuring workers have the right access to do their job – no more, no less.  

Built on a foundation of AI and ML, our Identity Security Cloud Platform delivers the right level of access to the right identities and resources at the right time—matching the scale, velocity, and changing needs of today’s cloud-oriented, modern enterprise.

About you and the role:

As a Principal Site Reliability Engineer at SailPoint, you will be a key member on our Reliability Engineering team, driving reliability practices servicing the Identity Security Cloud platform. We are a team of people that write software to solve scalability, observability, security, reliability, and operability problems.  You are immensely passionate about reliability practices and operational excellence.  You will be an evangelist to our engineering teams for how they can improve reliability, capacity, and performance of our platform.  You enjoy technical leadership and solving complex challenges as well as getting your hands in the code.

What You’ll Make Happen:

  • Make it easy for everyone to create, consume, manage, and scale reliable cloud production services to achieve more

  • Keep up with industry trends to improve end-to-end reliability and maintainability for all services

  • Coach engineering teams on observability best practices such as setting up well defined Service Level Objectives (SLOs).

  • Analyze performance of services and recommend infrastructure/code changes that will improve capacity and performance

  • Enable our engineering teams to scale our enterprise operations by providing guidance, best practices and support as part of an SRE Center of Excellence

  • Manage cross-functional requirements working with Engineering, Product, Services, and other departments

  • Be a mentor of quality for design reviews, code, test cases, automation, observability, root cause analysis, and self-healing

  • Influence architectural design, implementation, consolidation, and simplification for global scale

  • Focuses on expanding own skills and looking at improving their teammates' skills

  • Drive operational excellence to deliver frictionless operation, happy on call, and optimal customer experience

Roadmap & Timeline for Success:
Within the first 30 days you will:

  • Onboard into your new role, get familiar with our product offering and technology stack

  • If applicable, come up to speed on Identity Access Management space

  • Get to know your peers, leaders and other engineers to understand current state, challenges and motivations 

  • Get to understand the current state of our reliability practices

  • Interview stakeholders to understand our business goals, product roadmap and challenges ahead of you

By 90 days:

  • Responsible for the technical architecture of our reliability and capacity planning practices, providing architectural leadership.

  • Look beyond the immediate backlog to create and share a forward-thinking technical vision for your team

  • You have a full grasp of our product and are developing a technology strategy for SRE

  • You are prioritizing projects and defining scopes of work and developing solutions.

By 6 months:

  • You are regularly mentoring and coaching members of your team 

  • You own multiple significant projects 

  • You provide technical leadership to your team while also delivering high quality code on your own

  • Lead significant and thoughtful critiques of others’ design documents

  • Consistently achieve targets and meet deadlines; you ensure the quality of deliverables exceeds expectations

  • You are flexing your lifelong learning muscles, staying abreast of emerging external technologies to understand when to introduce them to your team.

  • You are presenting and communicating the reliability leaps to executives as well as high level stakeholders

  • With a focus on talent development, you'll mentor and coach new team members, fostering a culture of growth and excellence.

Requirements

  • Due to FedRAMP requirements, US Citizenship is required to be considered for this role

  • 10+ years experience in SRE or DevOps production operations supporting a highly available environment for SaaS software or cloud service provider

  • Experience with cloud infrastructure environments, preferably AWS, and Infrastructure as code, preferably Terraform.

  • Strong proficiency with containerization technology and/or Kubernetes

  • In-depth experience with metrics, tracing, and logging observability tools such as Prometheus, Grafana, Honeycomb, and Kibana

  • Experience with incident management, including conducting incident reviews

  • Strong proficiency with one or more programming languages (Java, Python, Go, etc).

  • Strong understanding of Linux, software development, systems, networking, and Cloud concepts

  • A positive and collaborative demeanor, combined with the ability to coach, mentor, and delegate

  • Excellent communication skills

  • Life-long learner – you stay up to date with technology trends, spend time learning new technologies, and share your learnings with your team

  • Bachelor's degree in Computer Science or other technical discipline, or equivalent experience is preferred, not required

Benefits and Compensation listed vary based on the location of your employment and the nature of your employment with SailPoint.

As a part of the total compensation package, this role may be eligible for the SailPoint Corporate Bonus Plan or a role-specific commission, along with potential eligibility for equity participation. SailPoint maintains broad salary ranges for its roles to account for variations in knowledge, skills, experience, market conditions and locations, as well as reflect SailPoint’s differing products, industries, and lines of business. Candidates are typically placed into the range based on the preceding factors as well as internal peer equity. We estimate the base salary, for US-based employees, will be in this range from (min-mid-max, USD):

$176,400 - $252,000 - $327,600

Base salaries for employees based in other locations are competitive for the employee’s home location.

Benefits Overview

1. Health and wellness coverage: Medical, dental, and vision insurance

2. Disability coverage: Short-term and long-term disability

3. Life protection: Life insurance and Accidental Death & Dismemberment (AD&D)

4. Additional life coverage options: Supplemental life insurance for employees, spouses, and children

5. Flexible spending accounts for health care, and dependent care; limited purpose flexible spending account

6. Financial security: 401(k) Savings and Investment Plan with company matching

7. Time off benefits: Flexible vacation policy

8. Holidays: 8 paid holidays annually

9. Sick leave

10. Parental support: Paid parental leave

11. Employee Assistance Program (EAP) and Care Counselors

12. Voluntary benefits: Legal Assistance, Critical Illness, Accident, Hospital Indemnity and Pet Insurance options

13. Health Savings Account (HSA) with employer contribution

SailPoint is an equal opportunity employer and we welcome all qualified candidates to apply to join our team.  All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, protected veteran status, or any other category protected by applicable law.  

Alternative methods of applying for employment are available to individuals unable to submit an application through this site because of a disability.  Contact hr@sailpoint.com or mail to 11120 Four Points Dr, Suite 100, Austin, TX 78726, to discuss reasonable accommodations.

Similar Jobs

Nice - Solution Engineer - Justice

Nice

United States (Remote)
3 Days ago
Granicus - Business Development Manager (Digital Services & Platforms)

Granicus

Melbourne, Victoria, Australia (Hybrid)
1 Week ago
Veeam Software - Data Analytics Engineer

Veeam Software

(Remote)
1 Month ago
Pivotroots - US Sr. Account Manager

Pivotroots

New York, United States (Hybrid)
1 Week ago
bytedance - Lark APAC Customer Success Manager Intern

bytedance

Kuala Lumpur, Federal Territory Of Kuala Lumpur, Malaysia (On-Site)
2 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Tide - Senior Product Marketing Manager

Tide

United Kingdom (Hybrid)
1 Month ago
Nasdaq - Software Developer Specialist - Java Full Stack

Nasdaq

Bengaluru, Karnataka, India (On-Site)
3 Weeks ago
Axonius - BI Manager

Axonius

Tel Aviv-Yafo, Tel Aviv District, Israel (Hybrid)
1 Week ago
Cognite - Senior Account Executive, Pharma (Life Science)

Cognite

Boston, Massachusetts, United States (Remote)
3 Months ago
velotio technologies  - Senior Engineer (Data Engineer)

velotio technologies

Maharashtra, India (Remote)
2 Months ago
Veeam Software - Systems Engineer

Veeam Software

Belgium (Remote)
1 Month ago
Reltio - Sr Release Engineer

Reltio

Bengaluru, Karnataka, India (Hybrid)
3 Days ago
Highspot - Solutions Consultant

Highspot

United States (Remote)
1 Month ago
USE Insider - Account Director (Individual Contributor) - Indonesia

USE Insider

Jakarta, Jakarta, Indonesia (On-Site)
8 Months ago
USE Insider - Sales Development Representative - Malaysia

USE Insider

Kuala Lumpur, Federal Territory Of Kuala Lumpur, Malaysia (On-Site)
8 Months ago

Get notifed when new similar jobs are uploaded

Jobs in United States

Samsung Semiconductor - Principal Engineer, Device Modeling

Samsung Semiconductor

San Jose, California, United States (On-Site)
2 Months ago
Hypixel Studios - Principal Engineer - Project Technical Lead

Hypixel Studios

Seattle, Washington, United States (Remote)
7 Months ago
DMG - District Manager

DMG

Dayton, Ohio, United States (On-Site)
2 Weeks ago
MarketScale - Creative Producer

MarketScale

United States (Remote)
1 Month ago
Sony Pictures Entertainment - Director, Rights & Clearances

Sony Pictures Entertainment

Culver City, California, United States (Hybrid)
3 Days ago
Next Level Business Services - Go Lang C++ Developer

Next Level Business Services

Dallas, Texas, United States (On-Site)
8 Months ago
Nintendo - Manufacturing Engineer (Bilingual Japanese)

Nintendo

Redmond, Washington, United States (Hybrid)
10 Months ago
The E.W. Scripps Company - News Producer

The E.W. Scripps Company

Colorado Springs, Colorado, United States (On-Site)
3 Weeks ago
Scale AI - Incubation Product Manager

Scale AI

San Francisco, California, United States (On-Site)
1 Month ago
Patreon - Site Reliability Engineer

Patreon

United States (Remote)
2 Months ago

Get notifed when new similar jobs are uploaded

Devops Jobs

Looks like we're out of matches

Set up an alert and we'll send you similar jobs the moment they appear!

About The Company

SailPoint is a leading provider of identity security for the modern enterprise. Enterprise security starts and ends with identities and their access, yet the ability to manage and secure identities today has moved well beyond human capacity. Using a foundation of artificial intelligence and machine learning, the SailPoint Identity Security Platform delivers the right level of access to the right identities and resources at the right time—matching the scale, velocity, and environmental needs of today’s cloud-oriented enterprise.

Austin, Texas, United States (On-Site)

United States (On-Site)

Austin, Texas, United States (On-Site)

United States (Remote)

United States (Remote)

United States (Remote)

Mexico City, Mexico (Remote)

View All Jobs

Get notified when new jobs are added by Sailpoint

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug