Incident Management

2 Months ago • 6 Years +

Job Summary

Job Description

The Incident Management role involves overseeing major incident management, ensuring rapid response and resolution within the organization. Responsibilities include collaborating with cross-functional teams to identify, analyze, and remediate incidents, communicating updates to stakeholders, and proactively identifying potential issues. The role requires utilization of DataDog for monitoring and incident detection, and awareness of AWS/OCI cloud operations, Micro Services, including node and pod management. Furthermore, this position demands performing L1 troubleshooting based on logs and documented processes to save time for development teams. This role is crucial for maintaining system stability and minimizing downtime.
Must have:
  • 6+ years of experience in incident management
  • Proven work experience in incident management within a streaming or media organization
  • Strong understanding of troubleshooting DataDog, AWS K8S, and other microservices
  • Proactive mindset and ability to work under pressure
Good to have:
  • Familiarity with Slack for team communication
  • Knowledge of JIRA for ticketing and documentation
  • Knowledge of PagerDuty for incident response
  • Understanding of Tableau for data visualization

Job Details

Key Responsibilities

  • Oversee major incident management within the organization, ensuring rapid response and resolution.
  • Collaborate with cross-functional teams to identify, analyze, and remediate incidents.
  • Communicate effectively with stakeholders and provide timely updates.
  • Proactively identify potential issues and prepare mitigation strategies.
  • Utilize DataDog for monitoring and incident detection
  • Aware of AWS / OCI cloud operations, Micro Services, including node and pod management
  • Perform L1 troubleshooting based on logs, documented processes to save time for development teams.

Qualifications

  • 6+ years experience
  • Proven work experience in incident management within a streaming or media organization.
  • Strong understanding and troubleshooting on DataDog, AWS K8S and other microservices.
  • Proactive mindset and ability to work under pressure.

Good to Have

  • Familiarity with Slack for team communication.
  • Knowledge of JIRA for ticketing and documentation
  • Knowledge of PagerDuty for incident response.
  • Understanding of Tableau for data visualization.

Similar Jobs

Social Discovery Group - Jira Developer

Social Discovery Group

(Remote)
2 Months ago
Aristocrat Gaming - Delivery Manager - Technical Projects

Aristocrat Gaming

Sofia, Sofia City Province, Bulgaria (Hybrid)
3 Months ago
Wargaming - Art Project Manager (World of Warships Franchise)

Wargaming

Belgrade, Serbia (Hybrid)
2 Months ago
Jumio - Customer Escalation Engineer

Jumio

Bengaluru, Karnataka, India (On-Site)
2 Months ago
Epic Games - Senior Producer (Machine Learning)

Epic Games

London, England, United Kingdom (On-Site)
2 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

roof games - Chief Operating Officer (COO)

roof games

İstanbul, İstanbul, Türkiye (On-Site)
2 Months ago
Info Stretch - Programmer Analyst 5

Info Stretch

Lansing, Michigan, United States (Hybrid)
7 Months ago
Ubisoft - Internal Communication Assistant - Intranet Project - Internship (12 months)

Ubisoft

Paris, Île-de-France, France (On-Site)
2 Months ago
Globalization Partners - Benefits Specialist

Globalization Partners

(Remote)
4 Months ago
Boomi - Software Quality Engineer (Automation)

Boomi

Bengaluru, Karnataka, India (On-Site)
2 Months ago
Ten Square Games - Junior Game Designer

Ten Square Games

Wrocław, Lower Silesian Voivodeship, Poland (Hybrid)
1 Month ago
DNEG - Project Coordinator

DNEG

Bengaluru, Karnataka, India (On-Site)
8 Months ago
Rockstar Games - Production Coordinator: Technology

Rockstar Games

Edinburgh, Scotland, United Kingdom (On-Site)
1 Month ago
Zynga - Senior Software Engineer - Website Development

Zynga

Bengaluru, Karnataka, India (On-Site)
2 Months ago
Epic Games - Senior Tools Programmer - Interoperability

Epic Games

(On-Site)
3 Months ago

Get notifed when new similar jobs are uploaded

Jobs in Mexico City, Mexico City, Mexico

Google - Product Marketing Manager, Android

Google

Mexico City, Mexico City, Mexico (On-Site)
2 Months ago
Xepelin - Operations Associate

Xepelin

Mexico City, Mexico (Hybrid)
1 Month ago
Scale AI - Growth Recruiter (Mexico)

Scale AI

Mexico City, Mexico (On-Site)
2 Months ago
Philips - Demand Planning Intern

Philips

Huixquilucan De Degollado, State Of Mexico, Mexico (On-Site)
1 Month ago
McDonald's Corporation - Software Engineer I - Android

McDonald's Corporation

Mexico City, Mexico (On-Site)
2 Months ago
LTI Mindtree - Salesforce Release Engineering

LTI Mindtree

Mexico City, Mexico City, Mexico (On-Site)
2 Months ago
London stock Exchange - Specialist Solutions Developer

London stock Exchange

Mexico City, Mexico (On-Site)
1 Month ago
Valeo - Product Technical Leader

Valeo

Querétaro, Mexico (On-Site)
2 Months ago
Lionbridge Games - Test Manager

Lionbridge Games

Mexico City, Mexico City, Mexico (On-Site)
2 Months ago
Marsh McLennan - Bilingual Korean Sales Executive

Marsh McLennan

Mexico City, Mexico (Hybrid)
2 Months ago

Get notifed when new similar jobs are uploaded

Similar Category Jobs

Looks like we're out of matches

Set up an alert and we'll send you similar jobs the moment they appear!