Lead Site Reliability Engineer

23 Minutes ago • 5-8 Years • Devops

Job Summary

Job Description

The Lead Site Reliability Engineer (SRE) will spearhead efforts to enhance the reliability, scalability, and performance of SimCorp's products and services. This role involves close collaboration with product development teams and other stakeholders. The SRE will leverage in-depth expertise in Azure Cloud to tackle infrastructure challenges, implement automation, and boost operational efficiency. Key responsibilities include developing SRE solutions, designing reliability strategies, optimizing application performance, managing incident response, and driving continuous improvement.
Must have:
  • Lead development of SRE solutions, including monitoring, anomaly detection, self-healing, and reliability testing strategies.
  • Design and implement reliability, scalability, and performance strategies, leading initiatives for capacity planning and automation.
  • Collaborate with product development teams to optimize application performance and infrastructure.
  • Manage incident response and root cause analysis, ensuring timely resolution of outages.
  • Maintain high-quality documentation for operational processes and system configurations.
  • Drive continuous improvement and adoption of best practices, including change management and observability.
  • Stay current with technology trends through formal and self-directed learning.
  • Mentor and guide junior SREs, promoting a culture of collaboration and knowledge-sharing.
  • Participate in on-call rotations, including weekends, as needed.
  • Bachelor’s or Master’s degree in Computer Science or a related field.
  • 5–8+ years in SRE or Cloud Infrastructure leadership.
  • Extensive expertise in Microsoft Azure, including production-grade design, operations, and cloud-native reliability practices.
  • Proficiency in Infrastructure-as-Code tools such as Bicep, Terraform, ARM, Ansible.
  • Comprehensive understanding of monitoring and incident frameworks.
  • Direct experience leading incident response and applying ITIL practices (problem, change, incident management).
  • Broad technical knowledge across Azure DevOps, Kubernetes, Docker, CI/CD, APIs, scripting, SQL, Cosmos DB, and MongoDB Atlas.
  • Established expertise in mentoring engineers, steering architectural decisions, and synchronizing long-term strategies.
Good to have:
  • Financial industry experience.
Perks:
  • Hybrid work policy (2 days in office, rest remote)
  • Benefits package (details vary by country)

Job Details

WHAT MAKES US, US

Join some of the most innovative thinkers in FinTech as we lead the evolution of financial technology. If you are an innovative, curious, collaborative person who embraces challenges and wants to grow, learn and pursue outcomes with our prestigious financial clients, say Hello to SimCorp!

At its foundation, SimCorp is guided by our values – caring, customer success-driven, collaborative, curious, and courageous. Our people-centered organization focuses on skills development, relationship building, and client success. We take pride in cultivating an environment where all team members can grow, feel heard, valued, and empowered.

If you like what we’re saying, keep reading!

WHY THIS ROLE IS IMPORTANT TO US

Lead Site Reliability Engineer (SRE) will be responsible for leading the efforts to maintain and improve the reliability, scalability, and performance of various SimCorp products and services. This individual will work collaboratively with product development teams across several lines of business and other stakeholders as a requirement to fulfill their responsibilities effectively. The candidate will need in-depth expertise and experience in Azure Cloud and associated technologies to address infrastructure challenges, implement automation solutions, and boost overall operational effectiveness

WHAT YOU WILL BE RESPONSIBLE FOR

  • Lead the development of SRE solutions, including monitoring and alerting, machine learning-based anomaly detection, self-healing mechanisms, and reliability testing strategies.
  • Design and implement reliability, scalability, and performance strategies, while leading the development of initiatives for capacity planning, resource management, and automation opportunities across systems and onboarding pipelines.
  • Collaborate with product development teams to optimize application performance and infrastructure, applying design-thinking and agile methodologies in cross-functional environments.
  • Manage incident response and root cause analysis, ensuring timely resolution of outages and performance issues, and maintaining high-quality documentation for operational processes and system configurations.
  • Drive continuous improvement and adoption of best practices, including change management, observability, and operational excellence, while staying current with technology trends through formal and self-directed learning.
  • Mentor and guide junior SREs, promoting a culture of collaboration and knowledge-sharing, and participate in on-call rotations (including weekends) as needed.

WHAT WE VALUE

  • Bachelor’s or Master’s degree in Computer Science or a related field, with 5–8+ years in SRE or Cloud Infrastructure leadership.
  • Extensive expertise in Microsoft Azure, including production-grade design, operations, and cloud-native reliability practices.
  • Proficiency in Infrastructure-as-Code tools such as Bicep, Terraform, ARM, Ansible, and comprehensive understanding of monitoring and incident frameworks.
  • Direct experience leading incident response, platform-wide reliability improvements, and applying ITIL practices (problem, change, incident management).
  • Broad technical knowledge across Azure DevOps, Kubernetes, Docker, CI/CD, APIs, scripting, SQL, Cosmos DB, and MongoDB Atlas.
  • Established expertise in mentoring engineers, steering architectural decisions, and synchronizing long-term strategies with actionable implementation
  • Financial industry experience is a plus.

Benefits

SimCorp offers several benefits that might play a significant factor in considering whether to accept a job offer. Since SimCorp operates in 30+ offices worldwide, the benefits package may vary from country to country.

Simcorp follows a global hybrid policy, asking employees to work from the office two days each week while allowing remote work on other days.

NEXT STEPS

Please send us your application in English via our career site as soon as possible, we process incoming applications continually. Please note that only applications sent through our system will be processed. At SimCorp, we recognize that bias can unintentionally occur in the recruitment process. To uphold fairness and equal opportunities for all applicants, we kindly ask you to exclude personal data such as photo, age, or any non-professional information from your application. Thank you for aiding us in our endeavor to mitigate biases in our recruitment process.

If you are interested in being a part of SimCorp but are not sure this role is suitable, submit your CV anyway. SimCorp is on an exciting growth journey, and our Talent Acquisition Team is ready to assist you discover the right role for you. The approximate time to consider your CV is three weeks.

We are eager to continually improve our talent acquisition process and make everyone’s experience positive and valuable. Therefore, during the process we will ask you to provide your feedback, which is highly appreciated.

WHO WE ARE

For over 50 years, we have worked closely with investment and asset managers to become the world’s leading provider of integrated investment management solutions. We are 3,000+ colleagues with a broad range of nationalities, educations, professional experiences, ages, and backgrounds in general.

SimCorp is an independent subsidiary of the Deutsche Börse Group. Following the recent merger with Axioma, we leverage the combined strength of our brands to provide an industry-leading, full, front-to-back offering for our clients.

SimCorp is an equal opportunity employer. We are committed to building a culture where diverse perspectives and expertise are integrated in our everyday work. We believe in the continual growth and development of our employees, so that we can provide best-in-class solutions to our clients.

#LI-Hybrid

Similar Jobs

Looks like we're out of matches

Set up an alert and we'll send you similar jobs the moment they appear!

Similar Skill Jobs

Looks like we're out of matches

Set up an alert and we'll send you similar jobs the moment they appear!

Jobs in Mexico City, Mexico

Looks like we're out of matches

Set up an alert and we'll send you similar jobs the moment they appear!

Devops Jobs

Looks like we're out of matches

Set up an alert and we'll send you similar jobs the moment they appear!

About The Company

SimCorp is a provider of industry-leading integrated investment management solutions for the global buy side. Founded in 1971, with more than 3,000 employees across five continents, SimCorp is a truly global technology leader that empowers more than half of the world’s top 100 financial companies through its integrated platform, services, and partner ecosystem. SimCorp is a subsidiary of Deutsche Börse Group. As of 2024, SimCorp includes Axioma, the leading provider of risk and management and portfolio optimization solutions for the global buy side.

Mexico City, Mexico (Hybrid)

Noida, Uttar Pradesh, India (Remote)

Mexico City, Mexico (Hybrid)

Noida, Uttar Pradesh, India (Hybrid)

Helsinki, Uusimaa, Finland (Hybrid)

The Hague, South Holland, Netherlands (Hybrid)

Metro Manila, Philippines (Hybrid)

Metro Manila, Philippines (Hybrid)

Metro Manila, Philippines (Hybrid)

View All Jobs

Get notified when new jobs are added by Simcorp

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug
Contact Us
hello@outscal.com
Made in INDIA 💛💙