Senior Site Reliability Engineer

1 Month ago • 7 Years + • Devops • $175,000 PA - $190,000 PA

Job Summary

Job Description

As a Senior Site Reliability Engineer at Exabeam, you will be responsible for ensuring the high availability, reliability, security, and scalability of Exabeam products and services. Your responsibilities will include maintaining the 24x7 production environment, creating and monitoring dashboards for key infrastructure metrics, ensuring services are designed for 24/7 availability, automating infrastructure management, and implementing automation for cloud services. You will also be involved in documenting actions, defining non-functional requirements, and resolving service defects. This role requires close collaboration with software engineering, architecture, and operations teams.
Must have:
  • Maintain 24x7 production environment with high service availability.
  • Create and monitor dashboards and alerts for key infrastructure metrics.
  • Ensure services are designed with 24/7 availability and readiness.
  • Develop processes, tools, automation, and software changes.
  • Automate infrastructure management and maintenance.
Good to have:
  • Professional experience leveraging public cloud solutions.
  • Experience with Kubernetes & Docker and deployment via pipeline.
  • Experience with cloud architecture patterns.
  • Experience in massive scale web operations.
Perks:
  • Extensive medical, dental and vision coverage.
  • Generous 401(k) employer match.
  • Paid Time off including flex time, volunteer day, birthday, holidays.
  • Learning center for career planning and skill development.
  • A culture of passionate, diverse, committed professionals.

Job Details

Description

Exabeam is a leader in intelligence and automation that powers security operations for the world’s smartest companies. As a global cybersecurity innovator, Exabeam provides industry-proven, security-focused, and flexible solutions for faster, more accurate threat detection, investigation, and response (TDIR). Learn more at www.exabeam.com.  
You’re someone who enjoys being directly accountable for the reliability of a business-critical, large-scale enterprise system. You’re comfortable guiding and making decisions with limited information and are capable of operating within the trade-offs present when solving for immediate needs versus solving with bigger scale solutions. You might be considered a subject matter expert in systems reliability and you feel rewarded by working to develop operability culture in a quickly growing and changing environment. You’re comfortable owning a wide and diverse set of problem areas and are willing to go out of your lane to affect change. You may have developed one or more metrics, log aggregation or performance analysis systems in your career.          
          
This is a fantastic opportunity to work and collaborate closely with our software engineering, architecture and operations teams at Exabeam. Our Site Reliability Engineers are responsible for ensuring Exabeam products and services are highly available, reliable, secure and scalable. The ideal candidates are fluent in systems programming and/or automation and can leverage their experience to solve complex problems associated with running production environments at massive scale in multi-tenant environments. We’re creating cool, disruptive products …come join us!   
       
What You'll Do          
  • Maintain 24x7 production environment with a high level of service availability. Perform quality reviews, manage operational issues
  • Create and monitor dashboards and alerts for key infrastructure metrics, and business KPIs that relate to site reliability. Make monitoring and alerting alert on symptoms and not on outages.
  • Ensure services are designed with 24/7 availability and operational readiness and rigor
  • Develop processes, tools, automation, and software changes to address operational issues
  • Automate infrastructure management and maintenance with the aim of empowering the team and ensuring site reliability
  • Implement automation and orchestration for manual processes required to operate and deploy cloud services, be at the heart of developing new ideas into internal OPS/SRE tools by working closely with advanced technology
  • Document every action so your findings turn into repeatable actions–and then into automation.
  • Define non-functional requirements as part of the product lifecycle to influence the new designs, standards, and methods for scalable, highly available distributed systems
  • Resolution of product/service defects or design changes, infrastructure changes, or operational changes
  • Identifies, evaluates and executes preventive measures to minimize/avoid impact to the customers experience. Proactive v/s Customer escalated
Who You Are          
  • A self-starter who's comfortable working independently without a ton of supervision
  • A software engineer with a curiosity for operations, or an operations engineer that wants to work closely with software engineers to help improve response times, scalability and availability.
  • You're obsessive compulsive, in a good way. Your systems and scripts are clean, well-documented and comprehensible.
  • You hate doing the same thing twice, you'd rather spend the time to automate a problem away rather than having to spend time on it again.
  • You are collaborative and are excited to empower the engineering team to work better and faster
  • Fluency with at least one current generation scripting language used by DevOps professionals (Python, Perl, PHP, Ruby) + Java Development
  • You have a passion for learning when it comes to working with new technologies or languages
  • You live and breathe scalable web architectures.
  • You're cool in a crisis and can align with others to ensure complex problems meet a timely and effective resolution.
  • You've worked with Linux, containers/namespaces, and system automation tools for Unix and/or cloud platforms.
  • You have 7+ years of relevant technical experience
  • BS in Computer Science, Computer Engineering, Math, or equivalent professional experience
Bonus Points For          
  • Professional experience leveraging public cloud solutions, with an emphasis on AWS
  • Experience with Kubernetes & Docker and automated deployment via pipeline
  • Experience with elastically scalable, fault tolerance and other cloud architecture patterns
  • Deep understanding of the software delivery process with the ability to implement and enforce that process across the organization
  • Demonstrated strength in SaaS services, experience in massive scale web operations
  • You have experience infrastructure-as-code tooling and approaches
  • Advanced knowledge of Unix/Linux systems: feel very comfortable at the command line
  • In-depth understanding of web operations best practices + infrastructure as code
  • Experience with DevOps (+ DevSecOps) methodologies is a plus
Exabeam Total Rewards offers you: 
(Subject to applicable eligibility requirements)
    • Extensive medical, dental and vision coverage to meet your healthcare needs and employer Health Savings Account contribution to help pay for health expenses now or in the future
    • Generous 401(k) employer match to help you save for your future
    • Paid Time off including “take what you need” flex time, volunteer day of service, your birthday, parental leave, holidays and more
    • Widespread learning center for career planning and skill development to grow your career
    •  A culture of passionate, diverse, committed professional
 
The annual starting salary for this position is between $175,000-$190,000 annually depending on experience and other qualifications of the successful candidate.
 
Bring your Whole Self to Work!
Diversity, equity, and inclusion are at the core of who we are. At Exabeam, we know that diverse perspectives spark innovation, improve creativity, and position our team for success. Creating a culture where all are welcomed, valued, and empowered to achieve their full potential is important to who we are today and in the future. We hire the best of the best and do not discriminate based on race, gender, age, religion, sexual orientation, identity, or other personal factors. 

Similar Jobs

CyberArk - Account Executive

CyberArk

Calgary, Alberta, Canada (On-Site)
1 Month ago
Canva - Senior Product Marketing Manager - Monetization (12 Month Fixed-Term Contract)

Canva

Auckland, Auckland, New Zealand (Remote)
6 Days ago
USE Insider - Senior Marketing Manager, North America

USE Insider

United States (Remote)
4 Months ago
Trek - IT Procurement Analyst - Level I

Trek

Haryana, India (On-Site)
4 Months ago
attentive - Account Manager, Mid-Market Sales

attentive

United States (Remote)
2 Weeks ago
GoTo Group - Senior DevOps Engineer

GoTo Group

Jakarta, Indonesia (On-Site)
2 Months ago
Sailpoint - Senior SRE (Site Reliability Engineer)

Sailpoint

Mexico (Remote)
1 Week ago
Netomi - Devops Engineer - II

Netomi

Toronto, Ontario, Canada (Remote)
1 Week ago
Enphase Energy - Senior Staff Engineer, Energy Management Cloud (Backend)

Enphase Energy

Bengaluru, Karnataka, India (On-Site)
5 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Dialpad AI - AI Productivity Analyst

Dialpad AI

San Ramon, California, United States (On-Site)
4 Weeks ago
clevertap - Senior Account Executive (New Business)

clevertap

Jakarta, Indonesia (Hybrid)
7 Months ago
Zenoti - Project Manager, Customer Onboarding (SaaS)

Zenoti

Seattle, Washington, United States (On-Site)
1 Month ago
Sailpoint - Digital Sales Representative

Sailpoint

London, England, United Kingdom (Hybrid)
1 Month ago
Cognite - Portfolio Manager

Cognite

Oslo, Oslo, Norway (Hybrid)
4 Months ago
Postman - People Resource Business Partner

Postman

San Francisco, California, United States (Hybrid)
1 Month ago
Agara labs - Senior Enterprise Account Executive

Agara labs

California City, California, United States (Remote)
1 Month ago
HHA Exchange - Customer Success Manager

HHA Exchange

New York, New York, United States (On-Site)
1 Month ago
Glean - Solutions Architect

Glean

Seattle, Washington, United States (On-Site)
1 Month ago
Make - Business Development Representative

Make

Raleigh, North Carolina, United States (Hybrid)
1 Month ago

Get notifed when new similar jobs are uploaded

Jobs in United States

The Walt Disney Company - Parks Signature Fine Dining Culinary - Full Time

The Walt Disney Company

Anaheim, California, United States (On-Site)
1 Month ago
Open Systems Technologies - Delivery Driver

Open Systems Technologies

Big Bear Lake, California, United States (On-Site)
1 Week ago
Jane Street - FTR Trader

Jane Street

New York, United States (On-Site)
1 Month ago
Meta - Software Engineer, Intern/Co-op

Meta

Menlo Park, California, United States (On-Site)
7 Months ago
Intel  - GPU IP Verification Engineer

Intel

Folsom, California, United States (Hybrid)
1 Month ago
rivos - GPGPU Runtime Software Engineer

rivos

Santa Clara, California, United States (Hybrid)
1 Month ago
Zinnia - Senior Manager, Commercial Strategy

Zinnia

Greenwich, Connecticut, United States (Hybrid)
1 Month ago
Hawkeye Innovations - NBA Operations Lead - Officiating

Hawkeye Innovations

Atlanta, Georgia, United States (Hybrid)
2 Months ago
Expedia - Sr Manager, People Partner

Expedia

San Francisco, California, United States (On-Site)
3 Weeks ago
Fandom  - YouTube Audience Specialist

Fandom

Los Angeles, California, United States (Remote)
6 Days ago

Get notifed when new similar jobs are uploaded

Devops Jobs

T systems - IT Process Software Architect

T systems

Pune, Maharashtra, India (On-Site)
3 Months ago
UXBERT Labs - Senior Solution Architect (IoT/Bluetooth Integration)

UXBERT Labs

Riyadh, Riyadh Province, Saudi Arabia (Hybrid)
5 Months ago
Penn Interactive - Senior Machine Learning Engineer, Platform

Penn Interactive

Philadelphia, Pennsylvania, United States (On-Site)
3 Weeks ago
bytedance - Cloud Technical Support Engineer

bytedance

Singapore (On-Site)
3 Months ago
NVIDIA - Senior Software Architect, Accelerated Computing SDN

NVIDIA

Tel Aviv-Yafo, Tel Aviv District, Israel (On-Site)
3 Months ago
Wind River - Senior Solutions Architect

Wind River

Bengaluru, Karnataka, India (On-Site)
1 Month ago
Apple - Sr. Software Engineer - Cloud Platform, Kubernetes (ASE)

Apple

Cupertino, California, United States (On-Site)
4 Weeks ago
Ion - Site Reliability Engineer

Ion

Milan, Lombardy, Italy (Hybrid)
8 Months ago
bytedance - Senior Software Engineer, Traffic Platform

bytedance

San Jose, California, United States (On-Site)
7 Months ago
Nagarro - SAP SuccessFactors Solution Architect (m/f/d)

Nagarro

Germany (Remote)
8 Months ago

Get notifed when new similar jobs are uploaded

About The Company

Workplace equality and inclusion are not just words or topics for LogRhythm, they are part of our core values, beliefs, and essential to our culture. We hire the best of the best and do not discriminate based on race, gender, age, religion, sexual orientation, identity, or other personal factors. LogRhythm was built on the principals of innovation, dedication, creativity, and commitment. It is through these essential areas we were able to grow as an equal and inclusive workplace, one where our employees feel respected and safe in.

Pune, Maharashtra, India (On-Site)

United States (On-Site)

Broomfield, Colorado, United States (On-Site)

United States (On-Site)

Maidenhead, England, United Kingdom (On-Site)

Indonesia (On-Site)

Minato City, Tokyo, Japan (On-Site)

Pune, Maharashtra, India (On-Site)

Bengaluru, Karnataka, India (On-Site)

View All Jobs

Get notified when new jobs are added by Logrhytm

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug