Staff Engineer - DevOps Site Reliability

6 Months ago • All levels • Devops

Job Summary

Job Description

Experienced L3 SRE engineer needed for a business-critical SaaS application. Responsibilities include L3 support across the full stack (infra, backend, frontend), automating SRE tools, proactive monitoring, handling business pressure, communicating effectively with various teams and end-users, incident/problem management, and working with multitenant applications. Requires strong understanding of networking, CI/CD, Python, and AWS services (especially EKS, serverless technologies, and databases). Experience with Kubernetes, Prometheus, and monitoring/logging tools is essential.
Must have:
  • EKS
  • Github Actions
  • Python (Strong)
  • Kubernetes (Expert)
  • Prometheus
  • L3 support across full-stack
  • Automation of SRE tools
  • Incident/Problem Management
Good to have:
  • GenAI/LLM application experience
  • AWS Managed Services
  • FastAPI and NextJS
  • Websockets
  • Cloud security concepts
  • Terraform

Job Details

Company Description

We are a Digital Product Engineering company that is scaling in a big way! We build products, services, and experiences that inspire, excite, and delight. We work at scale — across all devices and digital mediums, and our people exist everywhere in the world (19000+ experts across 33 countries, to be exact). Our work culture is dynamic and non-hierarchical. We are looking for great new colleagues. That is where you come in!

Job Description

  • Experienced L3 SRE engineer based on business-critical SaaS application.
  • Capacity to L3 across the full stack including infra, backend and front-end, before escalation to engineering business unit.
  • Capacity to automate SRE tools to provide proactive.
  • L3 support, close to our tech monitoring strategy.
  • Capacity to work under business pressure for business critical applications.
  • Capacity to communicate accordingly with L1,L2, Engineering, Product managers, leadership and end-users during troubleshooting.
  • Capacity to communicate accordingly.
  • Experience with incident and problem management.
  • Experience with multitenant applications.
  • Solid understanding of networking concepts(TCP/IP, DNS, Routing, etc) like VPCs, subnets, firewalls, and load balancing, TLS and SSL.
  • Experience with CI/CD pipelines (e.g., Jenkins, Github Actions) & version control.
  • Python, react/next.
  • Monitoring and logging to analyze & track resource utilization, application performance, and identify potential issues, Grafana, Prometheus, Loki or ELK.
  • Experience with AWS, particularly EKS, serverless, queue & various databases.
  • Solid knowledge Kubernetes.

Qualifications

Must have Skills: EKS, Github Actions, Python (Strong), Kubernetes (Expert), Prometheus.

Good to Have Skills: 

  • Previous experience building a user-facing GenAI/LLM software application.
  • Security best practices in cloud environments. - AWS Managed Services (RDS, Batch, Lambda, Fargate, Step Functions, SQS/SNS, etc.).
  • FastAPI and NextJS experience (if we're still using the latter).
  • Websockets, Server-Side Events, Pub/Sub (RabbitMQ, Kafka, etc.).
  • Cloud security concepts (IAM, access control).
  • Terraform experience. 

Similar Jobs

Survay Monkey - Staff Site Reliability Engineer

Survay Monkey

Bengaluru, Karnataka, India (Hybrid)
3 Months ago
Ajmera Infotech - Site Reliability Engineer (SRE) - Kubernetes

Ajmera Infotech

Bengaluru, Karnataka, India (On-Site)
3 Months ago
Survay Monkey - Senior Software Engineer in Test II

Survay Monkey

Bengaluru, Karnataka, India (Hybrid)
3 Months ago
Riot Games - Manager, Software Engineering - Infrastructure / Cloud Foundations

Riot Games

Los Angeles, California, United States (On-Site)
6 Months ago
Rockstar Games - Senior DevOps Engineer

Rockstar Games

Edinburgh, Scotland, United Kingdom (On-Site)
10 Months ago
Interactive Brokers - Senior DevOps/Software Engineer

Interactive Brokers

Greenwich, Connecticut, United States (Hybrid)
9 Months ago
N-iX - Senior Data Engineer

N-iX

Kyiv, Kyiv City, Ukraine (Hybrid)
3 Months ago
Luxoft - Senior DevOps Engineer (Azure)

Luxoft

New Delhi, Delhi, India (Remote)
8 Months ago
Rackspace Technology - DEVOP Engineer (AWS Terraform)-PSDE III

Rackspace Technology

India (Remote)
8 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

SS8 - Senior Software Engineer

SS8

Toronto, Ontario, Canada (Hybrid)
4 Months ago
Sumo Logic - Senior Software Engineer II, Open Telemetry Projects

Sumo Logic

Noida, Uttar Pradesh, India (On-Site)
3 Months ago
Lytx - Staff DevSecOps Engineer

Lytx

India (On-Site)
3 Months ago
Tide - Senior Engineer, Backend

Tide

(Remote)
3 Months ago
Thatgamecompany - Live Ops Engineer

Thatgamecompany

United States (Remote)
4 Months ago
Interface AI - SDET IV

Interface AI

(Remote)
3 Months ago
Rackspace Technology - SOC Analyst L3 (Sentinel is mandatory) - R-19060

Rackspace Technology

Gurugram, Haryana, India (Hybrid)
9 Months ago
IManage - Senior AI Software Engineer

IManage

London, England, United Kingdom (Hybrid)
5 Months ago
N-iX - Senior AQA Engineer (Python + Robot)

N-iX

Colombia (Remote)
3 Months ago
Unity - Mobile Automation Engineer

Unity

Tel Aviv-Yafo, Tel Aviv District, Israel (On-Site)
4 Months ago

Get notifed when new similar jobs are uploaded

Jobs in Colombia

Easygo - Legal & Compliance Senior Officer

Easygo

Bogotá, Bogota, Colombia (On-Site)
4 Months ago
N-iX - Senior AQA Engineer

N-iX

Colombia (Remote)
3 Months ago
N-iX - Key Accounts Engagement Manager

N-iX

Medellín, Antioquia, Colombia (Flexible)
3 Months ago
N-iX - Junior Automation QA Engineer (with Python)

N-iX

Colombia (Remote)
3 Months ago
N-iX - Lead Full Stack Engineer (.NET+React) (#2638)

N-iX

Colombia (Remote)
7 Months ago
Unisys - Vulnerability Analyst - Darktrace Specialist

Unisys

Colombia (On-Site)
3 Months ago
Nagarro - Associate Principal Engineer - Project Manager

Nagarro

Colombia (Remote)
3 Months ago
In labs - Ionic Framework - Hybrid Mobile Developer

In labs

Bogotá, Bogota, Colombia (Hybrid)
3 Months ago
Evolution - Account Payables Specialist

Evolution

Medellín, Antioquia, Colombia (On-Site)
7 Months ago
Nagarro - Senior Staff Engineer - IBM i Systems Administrator

Nagarro

Colombia (Remote)
4 Months ago

Get notifed when new similar jobs are uploaded

Devops Jobs

ByteDance - Software Engineer - Compute Infrastructure (Orchestration & Scheduling)

ByteDance

San Jose, California, United States (On-Site)
3 Months ago
Google - Customer Engineer, Data Management, Google Cloud

Google

Riyadh, Riyadh Province, Saudi Arabia (On-Site)
3 Months ago
Trend Micro - (Sr.) Cloud Developer (Vision One)

Trend Micro

Taipei City, Taiwan (On-Site)
10 Months ago
Passion Gaming - AWS DevOps Engineer

Passion Gaming

Haryana, India (On-Site)
1 Year ago
The Walt Disney Company - Senior Software Engineer

The Walt Disney Company

London, England, United Kingdom (On-Site)
4 Months ago
Rackspace Technology - Data Architect

Rackspace Technology

Vietnam (Remote)
6 Months ago
ByteDance - Backend Software Engineer - Foundational Technology

ByteDance

Singapore (On-Site)
4 Months ago
Playground Games - Build Engineer - Contract

Playground Games

England, United Kingdom (Hybrid)
4 Months ago
Google - Software Engineer, Site Reliability Engineering, User Data

Google

Sydney, New South Wales, Australia (On-Site)
3 Months ago
Google - Software Engineering Manager II

Google

Warsaw, Masovian Voivodeship, Poland (On-Site)
3 Months ago

Get notifed when new similar jobs are uploaded

About The Company

Abu Dhabi, Abu Dhabi, United Arab Emirates (On-Site)

Abu Dhabi, Abu Dhabi, United Arab Emirates (On-Site)

Abu Dhabi, Abu Dhabi, United Arab Emirates (On-Site)

Abu Dhabi, Abu Dhabi, United Arab Emirates (On-Site)

Abu Dhabi, Abu Dhabi, United Arab Emirates (On-Site)

Abu Dhabi, Abu Dhabi, United Arab Emirates (On-Site)

Abu Dhabi, Abu Dhabi, United Arab Emirates (On-Site)

Abu Dhabi, Abu Dhabi, United Arab Emirates (On-Site)

Abu Dhabi, Abu Dhabi, United Arab Emirates (On-Site)

Abu Dhabi, Abu Dhabi, United Arab Emirates (On-Site)

View All Jobs

Get notified when new jobs are added by Nagarro

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug