Lead Problem Manager - Platform Azure

2 Minutes ago • 5 Years +

Job Summary

Job Description

Join SimCorp, a leading FinTech innovator, as a Lead Problem Manager within the Platform Azure Operations team. This pivotal role involves identifying, managing, and resolving systemic and recurring issues across cloud infrastructure and services. You will lead root cause analysis, prevent recurring problems, and continuously improve platform stability and performance. Responsibilities include deep-dive investigations, ensuring RCA completion, mitigating systemic risks, developing tooling integrations, implementing automation, maintaining knowledge bases, and delivering analytics to drive operational excellence.
Must have:
  • Lead deep-dive investigations into major and recurring issues.
  • Facilitate root cause analysis (RCA) sessions.
  • Coordinate across engineering and operations.
  • Maintain thorough problem records.
  • Ensure RCAs are completed and delivered within agreed service level agreements (SLAs).
  • Recognize systemic risks and take steps to mitigate them.
  • Prevent recurrence of high-impact issues.
  • Develop and maintain integrations across incident, change, and problem management systems.
  • Ensure seamless data flow and traceability from incidents to RCAs and preventive actions.
  • Partner with Service Enablement specialists, SREs and platform engineers to design and implement automation.
  • Create and maintain clear RCA documentation, known error databases, and self-service materials.
  • Deliver visibility into recurring issues, mean time to resolution (MTTR), and problem trends via dashboards and reports.
  • Drive post-incident reviews and retrospectives.
  • Champion a culture of learning and accountability through feedback loops and operational best practices.
Perks:
  • Attractive salary
  • Bonus scheme
  • Pension
  • Flexible working hours
  • Hybrid workplace model
  • Personalized approach to professional development

Job Details

WHAT MAKES US, US

Join some of the most innovative thinkers in FinTech as we lead the evolution of financial technology. If you are an innovative, curious, collaborative person who embraces challenges and wants to grow, learn and pursue outcomes with our prestigious financial clients, say Hello to SimCorp!

At its foundation, SimCorp is guided by our values — caring, customer success-driven, collaborative, curious, and courageous. Our people-centered organization focuses on skills development, relationship building, and client success. We take pride in cultivating an environment where all team members can grow, feel heard, valued, and empowered.

If you like what we’re saying, keep reading!

WHY THIS ROLE IS IMPORTANT FOR US

As part of the Deutsche Börse Group, and with delivery centers worldwide (Manila, Noida, Kyiv, Warsaw, and Mexico City), we support our clients 24/7. Our strategy focuses on Platform leadership, SaaS acceleration, and Ecosystem scaling by 2025.

Within this journey, Platform Azure Operations plays a vital role by ensuring reliability, availability, and operational excellence across our cloud-native services. The Lead Problem Manager is a pivotal role in this mission, responsible for driving root cause analysis, preventing recurring issues, and continuously improving platform stability and performance.

The Problem Manager is organized in the Delivery Excellence team, which is a part of the Platform organization.

As Lead Problem Manager you will be responsible for identifying, managing, and resolving systemic and recurring issues across our cloud infrastructure and services. You’ll work across monitoring tools, ticketing systems (e.g., Opsgenie, Salesforce, Cadalys etc.), and operational platforms (Azure DevOps, Grafana, etc.) to surface insights, lead problem investigations, and drive cross-functional corrective actions.

You will establish problem management workflows, track key metrics, and ensure that lessons learned from incidents are institutionalized into future improvements, including automation opportunities. This position plays a vital role in ensuring a data-driven approach to reliability and operational excellence.

WHAT YOU WILL BE RESPONSIBLE FOR

Problem Analysis & Root Cause Identification:

  • Lead deep-dive investigations into major and recurring issues. Facilitate root cause analysis (RCA) sessions, coordinate across engineering and operations, and maintain thorough problem records.
  • Ensure RCAs are completed and delivered within agreed service level agreements (SLAs) to meet compliance and stakeholder expectations.

Platform Stability & Preventive Action:

  • Recognize systemic risks and take steps to mitigate them through changes in infrastructure, updates to configurations, or automation solutions. Prevent recurrence of high-impact issues.

Tooling & Process Integration:

  • Develop and maintain integrations (in collaboration with our Service Enablement team) across incident, change, and problem management systems. Ensure seamless data flow and traceability from incidents to RCAs and preventive actions.

Automation & Self-Healing Initiatives:

  • Partner with Service Enablement specialists, SREs and platform engineers to design and implement automation that detects, mitigates, or resolves known errors and platform vulnerabilities.

Knowledge Enablement & RCA Documentation:

  • Create and maintain clear RCA documentation, known error databases, and self-service materials to upskill engineering and support teams.

Analytics & Reporting:

  • Deliver visibility into recurring issues, mean time to resolution (MTTR), and problem trends via dashboards and reports. Use insights to influence prioritization of technical debt and improvement initiatives.

Continuous Improvement Leadership:

  • Drive post-incident reviews and retrospectives. Champion a culture of learning and accountability through feedback loops and operational best practices.

WHAT WE VALUE

We are looking for candidates with the following qualifications:

  • 5+ years of experience in Problem Management, Service Management in general (ITIL) or platform operations, beneficially in cloud environments.
  • Well-versed in ITIL practices, especially in problem, incident, and change management
  • Experience with cloud monitoring and alerting platforms (e.g. Opsgenie, Grafana, Prometheus)
  • Proficiency in RCA methodologies (e.g., 5 Whys, Fishbone, Pareto) and problem tracking systems (e.g., Salesforce, Cadalys, ServiceNow, Jira)
  • Familiarity with automation and self-healing frameworks for cloud operations
  • Experience with reporting and data analysis tools (e.g., Power BI, Azure Log Analytics)
  • Proficient communication, facilitation, and cross-functional collaboration skills
  • Ability to prioritize effectively in a diverse, global, and multicultural environment.

BENEFITS

Attractive salary, bonus scheme, and pension are essential for any work agreement. However, in SimCorp, we believe we can offer more. Therefore, in addition to the traditional benefit scheme, we provide a good work and life balance: flexible working hours and a hybrid workplace model. SimCorp follows a global hybrid policy, asking employees to work from the office two days each week while allowing remote work on other days.

There is never just only one route - we practice a personalized approach to professional development to support the direction you want to take.

NEXT STEPS

Please send us your application in English via our career site as soon as possible, we process incoming applications continually. Please note that only applications sent through our system will be processed. At SimCorp, we recognize that bias can unintentionally occur in the recruitment process. To uphold fairness and equal opportunities for all applicants, we kindly ask you to exclude personal data such as photo, age, or any non-professional information from your application. Thank you for aiding us in our endeavor to mitigate biases in our recruitment process.

If you are interested in being a part of SimCorp but are not sure this role is suitable, submit your CV anyway. SimCorp is on an exciting growth journey, and our Talent Acquisition Team is ready to assist you discover the right role for you. The approximate time to consider your CV is three weeks.

We are eager to continually improve our talent acquisition process and make everyone’s experience positive and valuable. Therefore, during the process we will ask you to provide your feedback, which is highly appreciated.

Similar Jobs

Looks like we're out of matches

Set up an alert and we'll send you similar jobs the moment they appear!

Similar Skill Jobs

Looks like we're out of matches

Set up an alert and we'll send you similar jobs the moment they appear!

Jobs in Warsaw, Masovian Voivodeship, Poland

Looks like we're out of matches

Set up an alert and we'll send you similar jobs the moment they appear!

Similar Category Jobs

Looks like we're out of matches

Set up an alert and we'll send you similar jobs the moment they appear!

About The Company

SimCorp is a provider of industry-leading integrated investment management solutions for the global buy side. Founded in 1971, with more than 3,000 employees across five continents, SimCorp is a truly global technology leader that empowers more than half of the world’s top 100 financial companies through its integrated platform, services, and partner ecosystem. SimCorp is a subsidiary of Deutsche Börse Group. As of 2024, SimCorp includes Axioma, the leading provider of risk and management and portfolio optimization solutions for the global buy side.

Copenhagen, Denmark (Hybrid)

Bad Homburg, Hessen, Germany (Hybrid)

Warsaw, Masovian Voivodeship, Poland (Hybrid)

Copenhagen, Denmark (Hybrid)

Mexico City, Mexico (Hybrid)

Copenhagen, Denmark (Hybrid)

Manila, Metro Manila, Philippines (Hybrid)

Warsaw, Masovian Voivodeship, Poland (Hybrid)

View All Jobs

Get notified when new jobs are added by Simcorp

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug