Senior Systems Engineer, SRE

7 Minutes ago • 6 Years +
Devops

Job Description

Iron Mountain is seeking a highly experienced and results-driven Global Operations and Support Engineer to join our Global Service Delivery team. This role involves leading the technical strategy, ensuring application reliability, and managing global support operations for critical Records Management and Data Management applications. The engineer will optimize MTTR, collaborate with SRE and other teams for escalations, and manage operational processes for various systems, ensuring compliance and supporting billing cycles.
Must Have:
  • Lead technical strategy for core infrastructure, alerting, and monitoring systems, optimizing Mean Time To Resolution (MTTR).
  • Collaborate with Engineering, Development, SRE teams, Customer Care, and Global Account Management to manage observability workstreams, triage technical issues, and handle high-level customer escalations.
  • Build and manage operational processes for Cloud/Web/4GL/DataCenter systems, ensuring compliance with organizational standards, and supporting critical monthly, quarterly, and annual billing cycles.
  • Minimum 6 years of experience as an Operations and Support Engineer in a global environment.
  • Strong knowledge of Records Management Applications and Data Management Applications, including capabilities like storing, archiving, shredding, and asset transfer.
  • Proven ability in Site Reliability Engineering (SRE) management, including defining log-based metrics, Service Level Objectives (SLO), Service Level Indicators (SLI), Error Budgets, and creating event dashboards.
  • Bachelor of Science in Computer Science and Engineering (4-year degree) or equivalent experience.
  • Experience supporting applications built on Linux/Windows, Apache/Tomcat, and Java.
  • Experience with Cloud-native applications in Google Cloud Platform (GCP).

Add these skills to join the top 1% applicants for this job

account-management
data-analytics
game-texts
linux
google-cloud-platform
mean
swift
java

At Iron Mountain we know that work, when done well, makes a positive impact for our customers, our employees, and our planet. That’s why we need smart, committed people to join us. Whether you’re looking to start your career or make a change, talk to us and see how you can elevate the power of your work at Iron Mountain.

We provide expert, sustainable solutions in records and information management, digital transformation services, data centers, asset lifecycle management, and fine art storage, handling, and logistics. We proudly partner every day with our 225,000 customers around the world to preserve their invaluable artifacts, extract more from their inventory, and protect their data privacy in innovative and socially responsible ways.

Are you curious about being part of our growth stor​y while evolving your skills in a culture that will welcome your unique contributions? If so, let's start the conversation.

Job Summary

Iron Mountain is seeking a highly experienced and results-driven Global Operations and Support Engineer to join our Global Service Delivery team. In this role, you will be responsible for leading the technical strategy, ensuring application reliability, and managing global support operations for our critical Records Management and Data Management applications. You will be a key player on a collaborative team that values a "no-blame" approach to problem-solving and continuous improvement.

What You'll Do

In this role, you will:

  • Lead Technical Strategy and Incident Resolution: Direct the technical strategy for our core infrastructure, alerting, and monitoring systems, focusing on optimizing Mean Time To Resolution (MTTR) targets for swift incident resolution.
  • Collaborate and Triage Global Escalations: Partner with Engineering, Development, Site Reliability Engineering (SRE) teams, Customer Care, and Global Account Management to manage observability workstreams, triage technical issues, and handle high-level customer escalations.
  • Ensure Compliance and Billing Integrity: Build and manage operational processes for Cloud/Web/4GL/DataCenter systems, ensuring compliance with organizational standards, and supporting critical monthly, quarterly, and annual billing cycles that impact the company's financial health.

What You'll Bring

The ideal candidate will have:

  • Minimum 6 years of experience as an Operations and Support Engineer in a global environment.
  • Strong knowledge of Records Management Applications and Data Management Applications, including capabilities like storing, archiving, shredding, and asset transfer.
  • Proven ability in Site Reliability Engineering (SRE) management, including defining log-based metrics, Service Level Objectives (SLO), Service Level Indicators (SLI), Error Budgets, and creating event dashboards.
  • Bachelor of Science in Computer Science and Engineering (4-year degree) or equivalent experience, with experience supporting applications built on Linux/Windows, Apache/Tomcat, and Java, as well as Cloud-native applications in Google Cloud Platform (GCP)

Call to Action

If you are a driven SRE engineer ready to lead the global reliability of mission-critical applications, apply now and help us protect what matters most!

Category: Information Technology

Set alerts for more jobs like Senior Systems Engineer, SRE
Set alerts for new jobs by Iron Mountain
Set alerts for new Devops jobs in India
Set alerts for new jobs in India
Set alerts for Devops (Remote) jobs

Contact Us
hello@outscal.com
Made in INDIA 💛💙