Jobs Courses Resources Companies Placements

Home >

Jobs >

Sr. Observability Engineer

Sailpoint

Maharashtra, India (On-site)

Sr. Observability Engineer

4 Months ago • 5 Years +

Job Summary

Job Description

As a Sr. Observability Engineer - UI, you will enhance the monitoring, visibility, and performance of front-end applications. Responsibilities include designing and implementing observability solutions for NodeJS based services and micro front-end applications, improving system reliability and scalability, owning key operational metrics, developing monitoring and alerting systems, optimizing performance, automating tasks, collaborating with engineers, and creating documentation. You will also participate in on-call rotations and lead incident response efforts. Within the first 30 days, you will onboard and understand the technology. By 90 days, you'll collaborate on designs. By 6 months, you'll support critical systems and assist with issue resolution.

Must have:

5+ years of SRE experience
Strong understanding of SRE principles
Experience with cloud platforms
Proficiency in scripting languages
Experience with monitoring and logging tools
Experience with NodeJS services and front-end tooling
Coding experience in various languages
Experience with containerization and orchestration
Understanding of network protocols and security best practices
Familiarity with DevOps culture and CI/CD
Experience with Incident Response processes
Strong problem-solving and troubleshooting skills
Excellent communication and collaboration skills

15 skills required

15 skills required for this role

Add these skills to join the top 1% applicants for this job

front-end

problem-solving

python

communication

java

bash

javascript

typescript

prototyping

ci-cd

npm

github

kubernetes

nestjs

grafana

Job Details

Sr .Observability Engineer - UI: We are seeking a Front-End focused Site Reliability Engineer to enhance the monitoring, visibility, and performance of our front-end applications. This roles bridges UI software development with observability best practices, ensuring our enterprise applications deliver seamless, high-performance experiences. You will be responsible for designing and implementing observability and reliability solutions that provide actionable insights into user experience, performance bottlenecks, and system reliability.Responsibilities:
Observability Solutions:
Design and implement observability solutions (logs, metrics, traces, dashboards, alerts) tailored for NodeJS based services and micro front-end applications. Collaborate with front-end developers to instrument code for better observability and debugging.
Reliability Engineering: Design, develop, and implement solutions to improve the reliability, availability, performance, and scalability of our systems. Work with technical leaders and infrastructure platform services to develop alerts and dashboards.
Operational Excellence: Own and improve key operational metrics (SLIs, SLOs, Error Budgets, monitoring and alerting) for team related services and drive continuous improvement through post-incident reviews and blameless postmortems of non-functional issues. Develop and maintain comprehensive monitoring, alerting to proactively identify and resolve issues. Conduct ongoing reviews to address and optimize gaps. Work with technical leaders and NOC team to improve operational processes and team practices.
Monitoring and Alerting: Develop and maintain comprehensive monitoring and alerting to proactively identify and resolve issues.
Performance Optimization: Collaborate with performance subject matter experts to identify and address production performance bottlenecks through profiling, tuning, and optimization of services and infrastructure.
Automation: Automate repetitive tasks and processes to improve efficiency and reduce manual intervention.
Collaboration: Work closely with Software, Performance and Test Engineers to influence system design and architecture for operability and reliability.
Documentation: Create and maintain clear and concise documentation for systems, processes, runbooks, and procedures.
On-Call: Participate in on-call rotation.
Incident Management: Participate in on-call rotations and lead incident response efforts, ensuring timely resolution and effective communication. Conduct in-depth incident analysis and help drive completion of post-incident action.
Troubleshooting skills: Excellent diagnostic and problem-solving skills, with the ability to analyze complex systems and data
Qualifications:

Bachelor’s degree in computer science, a related field, or equivalent practical experience.
Proven 5+ years of SRE or similar experience
Strong understanding of SRE principles and practices.
Experience with cloud platforms (AWS, GCP, or Azure).
Proficiency in at least one scripting language (e.g., Python, Bash, Go).
Experience with monitoring and logging tools (e.g., Prometheus, Grafana).
Experience with NodeJS based services (e.g. NestJS) and front-end tooling (e.g. webpack, npm, single-spa)
Level of coding experience beyond simple scripts with programming languages such as Java, JavaScript, TypeScript, Go, or Python to help build reliability engineering
Experience with containerization and orchestration technologies (e.g., Docker, Kubernetes).
Understanding of network protocols, and security best practices
Familiarity with DevOps culture and practices and experience with CI/CD toolchains
Experience with Incidence Response processes and config management tools (PagerDuty, Git),
Strong problem-solving and troubleshooting skills.
Excellent communication and collaboration skills.
Ability to work independently and as part of a team to achieve the SRE agenda

What success looks like in the role
Within the first 30 days you will:

Onboard into your new role, get familiar with our product offering and technology, proactively meet peers and stakeholders, set up your test and development environment.
Seek to deeply understand business problems or common engineering challenges and propose software architecture designs to solve them elegantly by abstracting useful common patterns.

By 90 days:

Proactively collaborate on, discuss, debate and refine ideas, problem statements, and software designs with different (sometimes many) stakeholders, architects and members of your team.
Take a committed approach to prototyping and co-implementing systems alongside less experienced engineers on your team—there’s no room for ivory towers here.

By 6 months:

Share support of critical team systems by participating in call, learning the characteristics of currently running systems, and participating in improvements.
Occasionally serve as a debugging and implementation expert during escalations of systems issues that have evaded the ability of less experienced engineers to solve in a timely manner.
Collaborates with Support Management and Engineering Manager to quick resolution of escalation.

SailPoint is an equal opportunity employer and we welcome all qualified candidates to apply to join our team. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, protected veteran status, or any other category protected by applicable law.

Alternative methods of applying for employment are available to individuals unable to submit an application through this site because of a disability. Contact hr@sailpoint.com or mail to 11120 Four Points Dr, Suite 100, Austin, TX 78726, to discuss reasonable accommodations.

Similar Jobs

Software Engineer, Fullstack

Grab

Beijing, China (On-Site)

• 4 Months ago

R&D Verification Engineer

Ansys

(Remote)

• 3 Months ago

Software Engineer III, Full Stack, Core

Google

Mexico City, Mexico City, Mexico (On-Site)

• 4 Months ago

Staff Software Engineer - Front End

Illuminia

Singapore (On-Site)

• 3 Months ago

Software Developer

Trend Micro

Austin, Texas, United States (On-Site)

• 3 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Embedded Software Engineer

Motorola solutions

Penang, Malaysia (On-Site)

• 3 Months ago

Senior/Lead Full Stack Engineer (.NET+React)

N-ix

Colombia (Remote)

• 6 Months ago

Technical Leader Fullstack

Thales

Nantes, Pays De La Loire, France (On-Site)

• 3 Months ago

Principal Software Engineer

Insight Software

Hyderabad, Telangana, India (On-Site)

• 3 Months ago

Principal Full Stack Software Engineer

GoDaddy

Colombia (Remote)

• 3 Months ago

Software Development Engineer 2 - React Native

DMG

Bengaluru, Karnataka, India (On-Site)

• 5 Months ago

Senior Software Engineer

Immutable

Australia (Hybrid)

• 5 Months ago

Senior Software Engineer, Full Stack, Google Cloud

Google

(On-Site)

• 9 Months ago

Software Engineer (Frontend)

bazzar voice

Belfast, Northern Ireland, United Kingdom (Hybrid)

• 5 Months ago

(Senior) Frontend Engineer

Cognite

Phoenix, Arizona, United States (Hybrid)

• 4 Months ago

Get notifed when new similar jobs are uploaded

Jobs in Pune, Maharashtra, India

Drupal Developer

hogarth

Chennai, Tamil Nadu, India (On-Site)

• 4 Months ago

Oracle HCM Cloud Fusion Consultant

Capgemini

India (On-Site)

• 3 Months ago

Senior Software Engineer Core MSG

sinch

Noida, Uttar Pradesh, India (Hybrid)

• 3 Months ago

Assistant Manager:: QCN Sales

Qube Cinema

Surat, Gujarat, India (On-Site)

• 3 Months ago

NOC Manager

Rockstar Games

Bengaluru, Karnataka, India (On-Site)

• 3 Months ago

Software Development Engineer

quience

Bengaluru, Karnataka, India (On-Site)

• 3 Months ago

Consultant - RDC TC MSOFT

PwC

Kolkata, West Bengal, India (On-Site)

• 11 Months ago

Customer Insights (CDP) Consultant

Hitachi

Pune, Maharashtra, India (Remote)

• 10 Months ago

Order to Cash Operations Senior Analyst

Accenture

Bengaluru, Karnataka, India (On-Site)

• 4 Months ago

IT Manager

nextgen-clearing

Ahmedabad, Gujarat, India (On-Site)

• 6 Months ago

Get notifed when new similar jobs are uploaded

Similar Category Jobs

Looks like we're out of matches

Set up an alert and we'll send you similar jobs the moment they appear!

About The Company

Sailpoint

107 Active Jobs

SailPoint is a leading provider of identity security for the modern enterprise. Enterprise security starts and ends with identities and their access, yet the ability to manage and secure identities today has moved well beyond human capacity. Using a foundation of artificial intelligence and machine learning, the SailPoint Identity Security Platform delivers the right level of access to the right identities and resources at the right time—matching the scale, velocity, and environmental needs of today’s cloud-oriented enterprise.

Get notified when new jobs are added by Sailpoint

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

A global community of game builders. Helping people upskill and land jobs in the best gaming studios.

Company

Key Links

hello@outscal.com

Made in INDIA 💛💙

Sr. Observability Engineer

Job Summary

Job Description

15 skills required

15 skills required for this role

Job Details

Similar Jobs

Software Engineer, Fullstack

R&D Verification Engineer

Software Engineer III, Full Stack, Core

Staff Software Engineer - Front End

Software Developer

Similar Skill Jobs

Embedded Software Engineer

Senior/Lead Full Stack Engineer (.NET+React)

Technical Leader Fullstack

Principal Software Engineer

Principal Full Stack Software Engineer

Software Development Engineer 2 - React Native

Senior Software Engineer

Senior Software Engineer, Full Stack, Google Cloud

Software Engineer (Frontend)

(Senior) Frontend Engineer

Jobs in Pune, Maharashtra, India

Drupal Developer

Oracle HCM Cloud Fusion Consultant

Senior Software Engineer Core MSG

Assistant Manager:: QCN Sales

NOC Manager

Software Development Engineer

Consultant - RDC TC MSOFT

Customer Insights (CDP) Consultant

Order to Cash Operations Senior Analyst

IT Manager

Similar Category Jobs

Looks like we're out of matches

About The Company

Sales Executive

Recruiter

Senior Salesforce Developer

Customer Success Manager

Account Executive

Senior Software Engineering Manager

GTM System Administrator – IT Front Door

Digital Sales Representative

Sales Executive

Sales Executive

Level Up Your Career in Game Development!