Director, IT Incident and Problem Management

4 Months ago • 10-15 Years • Operations

Job Summary

Job Description

Job Details

The Director, IT Incident and Problem Management is responsible for overseeing the processes related to incident and problem management within the organization. This role ensures that incidents are resolved efficiently and effectively and that root causes of problems are identified and addressed to prevent recurrence. The Director will lead a team, collaborate with various departments, inclusive of Engineering, Product Management Customer Support, to maintain a high standard of service delivery
You will leverage your expertise in ITIL framework and Google Site Reliability Engineering (SRE) methodologies to maintain high availability and reliability of our SaaS platform through effective incident response and robust problem management strategies.

What will you do?

    • Provide strategic direction and oversight for the IT incident and problem management function, ensuring 24/7 coverage and effective response to incidents.
    • Develop and refine IT incident and problem management strategies aligned with ITIL and Google SRE methodologies to enhance service reliability and minimize business impact.
    • Lead major incident and problem resolution efforts, conducting thorough root cause analysis and implementing preventive actions based on Google SRE principles.
    • Collaborate closely with cross-functional teams including IT operations, development, and customer support to ensure coordinated incident and problem resolution efforts.
    • Define and monitor key performance indicators (KPIs) and metrics related to incident and problem management, driving continuous improvement initiatives.
    • Present incident and problem management reports to stakeholders, including senior executives and Product Managers, offering insights into trends, risks, and opportunities for improvement. Additionally, develop and deliver customer-facing metrics and reports.

What will you bring?

    • Experience in IT Incident, Problem Management or SRE roles: 10-15 years of experience in IT, with at least 5 years in incident, problem management or SRE and least 3 years in a managerial position.
    • Experience in SaaS Environments: Proven experience in IT incident, problem management or SRE for B2B SaaS providers, ideally within the FinTech sector.
    • Leadership: Proven track record in senior leadership roles, with the ability to inspire and empower cross-functional teams to achieve operational excellence and drive continuous improvement.
    • IT Incident Management: Deep understanding of ITIL framework with extensive hands-on experience in incident identification, prioritization, resolution, and escalation.
    • Problem Management: Expertise in leading comprehensive root cause analysis and problem resolution efforts, incorporating Google SRE principles for preventive actions.
    • Google SRE Methodologies: In-depth knowledge of Google SRE philosophies, including error budget management, service level indicators/objectives (SLIs/SLOs), and effective incident response strategies.
    • Technical Acumen:
    • Broad technical understanding across IT infrastructure, networks, applications and their incident and problem management practices.
    • Broad technical understanding of modern cloud technologies (AWS, Azure, GCP) and their incident and problem management practices.
    • Analytical Skills: Strong ability to analyze incidents and problems, identify root causes, and drive the implementation of effective solutions.
    • Communication and Stakeholder Management: Excellent communication skills, with the ability to engage and influence stakeholders at all levels, including technical teams and senior management.
    • Collaboration: Effective collaboration skills to work with cross-functional teams and stakeholders.
    • Strategic Thinking: Strong analytical and strategic thinking abilities, capable of driving alignment between incident and problem management processes and organizational goals.
The above salary range represents Smarsh's good faith and reasonable estimate of the range of possible base compensation at the time of posting.

Any applicable bonus programs will be discussed during the recruiting process.

The salary for this role will be set based on a variety of factors, including but not limited to, internal equity, experience, education, location, specialty and training. Local cost of living assessments are done for each new hire at the time of offer.

Similar Jobs

OpenGov - Sr. Manager, Engineering

OpenGov

Boston, Massachusetts, United States (Hybrid)
4 Months ago
Luxoft - Senior Data Analyst/Data QA

Luxoft

New Delhi, Delhi, India (Remote)
3 Months ago
Vontier - Device Management Lead

Vontier

Bengaluru, Karnataka, India (On-Site)
3 Months ago
Ness Digital - Java & React Senior Engineers II (T17)

Ness Digital

Timișoara, Timiș, Romania (Remote)
1 Month ago
 Betfair - Customer Service Operator (Part-Time, Day Team)

Betfair

Darwin City, Northern Territory, Australia (On-Site)
2 Months ago
Rank group - Team Leader

Rank group

Wednesbury, England, United Kingdom (On-Site)
2 Months ago
Ubisoft - Productrice associée, Producteur associé [Codev Outsourcing - Services]

Ubisoft

Montreal, Quebec, Canada (On-Site)
2 Months ago
Netflix - Workplace Specialist, Media Space

Netflix

Los Angeles, California, United States (On-Site)
1 Month ago
Playtika - Loyalty Manager

Playtika

Israel (On-Site)
1 Month ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

In The Pocket - SOLUTION ARCHITECT

In The Pocket

Ghent, Flanders, Belgium (On-Site)
3 Months ago
Microsoft - Senior Research Engineer, MSR AI for Science

Microsoft

North Holland, Netherlands (On-Site)
1 Month ago
Microsoft - Senior Software Engineer - LLM Performance

Microsoft

(On-Site)
1 Month ago
Axinous - Architect, Software Development

Axinous

San Jose, California, United States (Hybrid)
3 Months ago
Kaseya - Senior Engineer - Cloud Ops

Kaseya

Bengaluru, Karnataka, India (On-Site)
4 Months ago
Growe - Cybersecurity Engineer

Growe

(Remote)
1 Month ago
Ubisoft Blue Byte - LOCAL IT TECHNICIAN (F/M/D)

Ubisoft Blue Byte

Berlin, Berlin, Germany (Hybrid)
3 Months ago
Microsoft - Senior Software Engineer - Azure Agents

Microsoft

Bengaluru, Karnataka, India (On-Site)
1 Month ago
HP - Machine Learning Engineer

HP

Palo Alto, California, United States (On-Site)
5 Months ago
PwC - D365 Finance-Associate

PwC

Mumbai, Maharashtra, India (On-Site)
4 Months ago

Get notifed when new similar jobs are uploaded

Jobs in Atlanta, Georgia, United States

Amazon Games - Senior Software Developer, Amazon Games AI

Amazon Games

San Diego, California, United States (On-Site)
1 Month ago
Intel Corporation - Sr. Infrastructure Engineer - Linux OS

Intel Corporation

Hillsboro, Oregon, United States (On-Site)
2 Months ago
Google - Software Engineer III, Google Ads

Google

Kirkland, Washington, United States (On-Site)
3 Months ago
Netflix - Engineering Manager, DevEx

Netflix

United States (Remote)
1 Month ago
Next Level Business Services - Sr. SAP WM/Shipping Consultant

Next Level Business Services

Chicago, Illinois, United States (On-Site)
4 Months ago
Glean - Product Manager, Glean for Engineering

Glean

Palo Alto, California, United States (On-Site)
3 Months ago
The Walt Disney Company - Lead Software Engineer (Identity)

The Walt Disney Company

Burbank, California, United States (On-Site)
3 Months ago
Unity - Sr. Growth Partnerships Manager

Unity

United States (Remote)
4 Months ago
Google - Staff Software Engineer, Core Machine Learning, Google Cloud

Google

Sunnyvale, California, United States (On-Site)
3 Months ago
Aristocrat Gaming - Field Service Technician

Aristocrat Gaming

Phoenix, Arizona, United States (Hybrid)
1 Month ago

Get notifed when new similar jobs are uploaded

Operations Jobs

PhonePe - CX Associate Manager, VKYC

PhonePe

Bengaluru, Karnataka, India (On-Site)
3 Months ago
Tesla - Store Manager

Tesla

Budapest, Hungary (On-Site)
3 Weeks ago
Nintendo - Manager, Retail Operations

Nintendo

San Francisco, California, United States (Hybrid)
7 Months ago
The Walt Disney Company - Manager, Studio Design & Development

The Walt Disney Company

Bristol, Connecticut, United States (On-Site)
3 Months ago
CloudHire - Operations Support Specialist

CloudHire

Philippines (Remote)
4 Months ago
Grindr - Director of Product Management, Trust & Safety

Grindr

Los Angeles, California, United States (Hybrid)
2 Months ago
PwC - TLS | Associate Legal Sevilla

PwC

Seville, Andalusia, Spain (On-Site)
4 Months ago
Tesla - Charging Operations Specialist, APAC Charging

Tesla

Bangkok, Bangkok, Thailand (On-Site)
3 Weeks ago
ComeOn Group - Scandinavian Speaking Customer Experience Agent

ComeOn Group

St. Julian's, Malta (Hybrid)
1 Month ago
Sporty Group - IN Project Manager

Sporty Group

India (Remote)
4 Months ago

Get notifed when new similar jobs are uploaded

About The Company

United States (On-Site)

London, England, United Kingdom (On-Site)

Portland, Oregon, United States (Hybrid)

New York, New York, United States (Hybrid)

Pleasanton, California, United States (Hybrid)

Atlanta, Georgia, United States (Hybrid)

United States (Remote)

India (Hybrid)

View All Jobs

Get notified when new jobs are added by Smarsh

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug