Jobs Courses Resources Companies Placements

Home >

Jobs >

Sr Observability Engineer (SRE)

Sailpoint

Maharashtra, India (On-site)

Sr Observability Engineer (SRE)

4 Months ago • 5 Years + • Devops

Job Summary

Job Description

As a Site Reliability Engineer (SRE), you will be embedded within the development team to ensure system reliability, scalability, and performance. Your responsibilities include designing and implementing solutions to improve system reliability, owning operational metrics, developing monitoring and alerting, and collaborating with teams for capacity planning and performance optimization. The role requires participation in on-call rotations and incident management, as well as troubleshooting and documentation of systems and processes. This role requires at least 5 years of experience.

Must have:

5+ years of SRE experience
Understanding of SRE principles and practices
Experience with cloud platforms (AWS, GCP, or Azure)
Proficiency in scripting languages (e.g., Python)
Experience with monitoring and logging tools (e.g., Prometheus)
Experience with containerization and orchestration technologies (e.g., Kubernetes)
Understanding of network protocols, and security best practices
Experience with DevOps culture and practices and experience with CI/CD toolchains
Experience with Incidence Response processes and config management tools (PagerDuty, Git)
Strong problem-solving and troubleshooting skills
Excellent communication and collaboration skills

Good to have:

Experience with Kafka, relational databases
Experience with Grafana K6 – Continuous Performance Tool
Experience with Infrastructure as Code (IaC) tools (e.g., Terraform)

20 skills required

20 skills required for this role

Add these skills to join the top 1% applicants for this job

communication

problem-solving

github

game-texts

prototyping

incident-response

aws

azure

prometheus

ansible

terraform

grafana

ci-cd

docker

kubernetes

git

python

bash

java

system-design

Job Details

We are seeking a highly motivated and experienced Site Reliability Engineer (SRE) to join our [Team Name] software development team. This is an embedded role, meaning you will be a full member of the development team, working closely with software engineers, infrastructure platform services, engineering managers, and other stakeholders to ensure the reliability, scalability, and performance of teams’ services. You will be responsible for leveraging the infrastructure, tooling, and processes that support our applications in dev and production, as well as participating in on-call rotations. This role offers a unique opportunity to directly influence the design and architecture of our systems from a reliability and performance perspective.

Responsibilities:

Work with the developments and service owners at the intersection of development and operations to solve performance issues and ensure system scalability.

Reliability Engineering: Design, develop, and implement solutions to improve the reliability, availability, performance, and scalability of our systems. Work with technical leaders and infrastructure platform services to develop alerts and dashboards.
Operational Excellence: Own and improve key operational metrics (SLIs, SLOs, Error Budgets, monitoring and alerting) for team related services and drive continuous improvement through post-incident reviews and blameless postmortems of non-functional issues. Develop and maintain comprehensive monitoring, alerting to proactively identify and resolve issues. ConductCreate and maintain dashboards and , conducting ongoing reviews to address and optimize gaps. Improve operational processes and improve operational processes and team practices, working with technical leaders and NOC team.
Monitoring and Alerting: Develop and maintain comprehensive monitoring, alerting to proactively identify and resolve issues.
Capacity Planning: Collaborate with technical leads, DevOps/SRE and infra teams to forecast capacity needs and ensure sufficient resources are available to support growth.
Performance Optimization: Collaborate with performance SMEs to identify and address production performance bottlenecks through profiling, tuning, and optimization of services and infrastructure.
Automation: Automate repetitive tasks and processes to improve efficiency and reduce manual intervention.
Collaboration: Work closely with Software, Performance and Test Engineers to influence system design and architecture for operability and reliability.
Documentation: Create and maintain clear and concise documentation for systems, processes, runbooks, and procedures.
On-Call: Participate in on-call rotation.
Incident Management: Participate in on-call rotations and lead incident response efforts, ensuring timely resolution and effective communication. Conduct in-depth incident analysis and help drive completion of post-incident action.
Troubleshooting skills: Excellent diagnostic and problem-solving skills, with the ability to analyze complex systems and data

Qualifications:

Bachelor’s degree in computer science, a related field, or equivalent practical experience.
Proven 5+ years of SRE experience
Strong understanding of SRE principles and practices.
Experience with cloud platforms (AWS, GCP, or Azure).
Proficiency in at least one scripting language (e.g., Python, Bash, Go).
Experience with monitoring and logging tools (e.g., Prometheus, Grafana).
Level of coding experience beyond simple scripts with one of the programming languages such as Go, Java, or Python to help build reliability engineering
Experience with containerization and orchestration technologies (e.g., Docker, Kubernetes).
Understanding of network protocols, and security best practices
Familiarity with DevOps culture and practices and experience with CI/CD toolchains
Experience with Incidence Response processes and config management tools (PagerDuty, Git),
Strong problem-solving and troubleshooting skills.
Excellent communication and collaboration skills.
Ability to work independently and as part of a team to achieve the SRE agenda.

Preferred Qualifications:

Experience with technologiesTechnology experience with: Kafka, what DBs, ???relational databases, performance tuning (JVM, Go)
Experience with Grafana K6 – Continuous Performance Tool
Infrastructure as Code (IaC) tools (e.g., Terraform, CloudFormation, Ansible).

What success looks like in the role
Within the first 30 days you will:

Onboard into your new role, get familiar with our product offering and technology, proactively meet peers and stakeholders, set up your test and development environment.
Seek to deeply understand business problems or common engineering challenges and propose software architecture designs to solve them elegantly by abstracting useful common patterns.

By 90 days:

Proactively collaborate on, discuss, debate and refine ideas, problem statements, and software designs with different (sometimes many) stakeholders, architects and members of your team.
Take a committed approach to prototyping and co-implementing systems alongside less experienced engineers on your team—there’s no room for ivory towers here.

By 6 months:

Share support of critical team systems by participating in call, learning the characteristics of currently running systems, and participating in improvements.
Occasionally serve as a debugging and implementation expert during escalations of systems issues that have evaded the ability of less experienced engineers to solve in a timely manner.
Collaborates with Support Management and Engineering Manager to quick resolution of escalation.

SailPoint is an equal opportunity employer and we welcome all qualified candidates to apply to join our team. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, protected veteran status, or any other category protected by applicable law.

Alternative methods of applying for employment are available to individuals unable to submit an application through this site because of a disability. Contact hr@sailpoint.com or mail to 11120 Four Points Dr, Suite 100, Austin, TX 78726, to discuss reasonable accommodations.

Similar Jobs

Splunk Engineer - TS/SCI with FS Poly

Optiv

Herndon, Virginia, United States (On-Site)

• 2 Months ago

Football Video Systems Technician

Hawkeye Innovations

Rome, Lazio, Italy (On-Site)

• 2 Months ago

Senior AI Machine Learning Engineer

PayPal

San Jose, California, United States (Hybrid)

• 2 Months ago

Director, Human Resources Business Partner (SSG)

Fox Factory

Scottsdale, Arizona, United States (On-Site)

• 1 Month ago

Principal Software Dev Engineer

Yahoo

Taiwan (Hybrid)

• 2 Months ago

Senior DevOps Programmer

Epic Games

Porto Alegre, State Of Rio Grande Do Sul, Brazil (On-Site)

• 5 Months ago

Software Engineer, ML Infrastructure - Training Platform

Scale AI

San Francisco, California, United States (On-Site)

• 4 Months ago

Staff Platform Solutions Engineer

Activision

New York, United States (On-Site)

• 2 Months ago

Principal Engineer (Full Stack - Node, React, AWS)

Autodesk

Bengaluru, Karnataka, India (On-Site)

• 3 Months ago

Cloud Security Operations Lead

Fortra

Canada (On-Site)

• 1 Month ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Murex FO BA

luxsoft

London, England, United Kingdom (On-Site)

• 3 Months ago

Body Refurb Technician

Feld Entertainment

Ellenton, Florida, United States (On-Site)

• 10 Months ago

HVAC Technician II (Night Shift)

BioFire

Salt Lake City, Utah, United States (On-Site)

• 3 Months ago

Senior Regulatory Counsel

Amsterdam, North Holland, Netherlands (On-Site)

• 10 Months ago

Front End Developer - Italy

Ion

Rome, Lazio, Italy (On-Site)

• 10 Months ago

AI Solutions PM

Yodo1

(Remote)

• 3 Months ago

Senior CX/UX Research Consultant

Wolters Kluwer

Coppell, Texas, United States (Hybrid)

• 1 Month ago

Director & GM, Consumer Products, Korea

The Walt Disney Company

Seoul, South Korea (On-Site)

• 5 Months ago

Senior Data Scientist

fairmatic

Tel Aviv-Yafo, Tel Aviv District, Israel (Hybrid)

• 10 Months ago

Software Engineer - AI/ML

Egnyte

Mountain View, California, United States (Hybrid)

• 7 Months ago

Get notifed when new similar jobs are uploaded

Jobs in Pune, Maharashtra, India

Visual / UX Designer

Digital Jalebi

Noida, Uttar Pradesh, India (Remote)

• 2 Years ago

Atlassian Administrator

Morning Star

Mumbai, Maharashtra, India (Hybrid)

• 2 Months ago

Technical Project Manager

Dentsu

Pune, Maharashtra, India (On-Site)

• 2 Months ago

Staff Engineer, Java

Nagarro

India (Remote)

• 10 Months ago

FX Lead (DNEG Animation)

DNEG

Bengaluru, Karnataka, India (On-Site)

• 10 Months ago

Service Delivery Junior Specialist

Capgemini

Noida, Uttar Pradesh, India (On-Site)

• 2 Months ago

IN- Manager_ Employee Central_Enterprise Apps SAP_Advisory_Noida

PwC

Noida, Uttar Pradesh, India (On-Site)

• 10 Months ago

Team Lead – BI & Analytics

Aeries technology

Bengaluru, Karnataka, India (On-Site)

• 4 Months ago

CNET Editor I - Commerce

Ziff Davis

Pune, Maharashtra, India (On-Site)

• 2 Months ago

Lead Product Software Engineer

Wolters Kluwer

Pune, Maharashtra, India (Hybrid)

• 2 Months ago

Get notifed when new similar jobs are uploaded

Devops Jobs

Infrastructure Engineer

1047 games

(Remote)

• 1 Month ago

Senior Solution Architect

Sailpoint

United States (On-Site)

• 3 Months ago

Senior Solutions Engineer

Saviynt

Singapore (Hybrid)

• 2 Months ago

Senior Cloud Architect

Bethesda

Rockville, Maryland, United States (On-Site)

• 1 Month ago

Senior Solutions Architect

Amazon games

Seattle, Washington, United States (On-Site)

• 1 Month ago

Site Reliability Engineer

Argus

Calgary, Alberta, Canada (Remote)

• 5 Months ago

Software Engineer Intern (AIGC Platform - Monetization GenAI)

bytedance

San Jose, California, United States (On-Site)

• 4 Months ago

DevOps Engineer

Varonis

Herzliya, Tel Aviv District, Israel (Hybrid)

• 5 Months ago

Orchestrade - Azure infrastructure cloud Senior engineer

Luxoft

Poland, Ohio, United States (Remote)

• 9 Months ago

Software Engineer - Edge Cloud Infrastructure

bytedance

Singapore (On-Site)

• 2 Months ago

Get notifed when new similar jobs are uploaded

About The Company

Sailpoint

111 Active Jobs

SailPoint is a leading provider of identity security for the modern enterprise. Enterprise security starts and ends with identities and their access, yet the ability to manage and secure identities today has moved well beyond human capacity. Using a foundation of artificial intelligence and machine learning, the SailPoint Identity Security Platform delivers the right level of access to the right identities and resources at the right time—matching the scale, velocity, and environmental needs of today’s cloud-oriented enterprise.

Get notified when new jobs are added by Sailpoint

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

A global community of game builders. Helping people upskill and land jobs in the best gaming studios.

Company

Key Links

hello@outscal.com

Made in INDIA 💛💙

Sr Observability Engineer (SRE)

Job Summary

Job Description

20 skills required

20 skills required for this role

Job Details

Similar Jobs

Splunk Engineer - TS/SCI with FS Poly

Football Video Systems Technician

Senior AI Machine Learning Engineer

Director, Human Resources Business Partner (SSG)

Principal Software Dev Engineer

Senior DevOps Programmer

Software Engineer, ML Infrastructure - Training Platform

Staff Platform Solutions Engineer

Principal Engineer (Full Stack - Node, React, AWS)

Cloud Security Operations Lead

Similar Skill Jobs

Murex FO BA

Body Refurb Technician

HVAC Technician II (Night Shift)

Senior Regulatory Counsel

Front End Developer - Italy

AI Solutions PM

Senior CX/UX Research Consultant

Director & GM, Consumer Products, Korea

Senior Data Scientist

Software Engineer - AI/ML

Jobs in Pune, Maharashtra, India

Visual / UX Designer

Atlassian Administrator

Technical Project Manager

Staff Engineer, Java

FX Lead (DNEG Animation)

Service Delivery Junior Specialist

IN- Manager_ Employee Central_Enterprise Apps SAP_Advisory_Noida

Team Lead – BI & Analytics

CNET Editor I - Commerce

Lead Product Software Engineer

Devops Jobs

Infrastructure Engineer

Senior Solution Architect

Senior Solutions Engineer

Senior Cloud Architect

Senior Solutions Architect

Site Reliability Engineer

Software Engineer Intern (AIGC Platform - Monetization GenAI)

DevOps Engineer

Orchestrade - Azure infrastructure cloud Senior engineer

Software Engineer - Edge Cloud Infrastructure

About The Company

Sales Executive

Recruiter

Senior Salesforce Developer

Customer Success Manager

Account Executive

Senior Software Engineering Manager

GTM System Administrator – IT Front Door

Digital Sales Representative

Sales Executive

Senior Data Engineer

Level Up Your Career in Game Development!