Site Reliability Engineer (SRE)

7 Minutes ago • 5 Years + • $150,000 PA - $180,000 PA
Devops

Job Description

The Site Reliability Engineer (SRE) leads technology teams to deliver scalable, resilient, and secure infrastructure platforms and services. This role empowers SS&C’s global business units to innovate with confidence, accelerate application modernization, and effectively manage tech debt. It is crucial for establishing trust and improving collaboration between product, engineering, and operations teams by embedding reliability, automation, and compliance into every product developed and hosted. This position is part of the Global Technology Infrastructure SRE team.
Good To Have:
  • 3+ years in financial services or other regulated industries preferred
  • Certifications such as TOGAF, AWS Certified Solutions Architect, VMware VCP, or Red Hat Certified Architect are a plus
  • Familiarity with ISO 27001, NIST 800-53, and other security frameworks is a plus
Must Have:
  • 5+ years of professional experience in a SRE role
  • Minimum Bachelor’s degree in Computer Science, Engineering, or a related field
  • Proven expertise in architecting, designing and operating private cloud environments and Kubernetes clusters
  • Hands-on experience with building, deploying, and operating infrastructure as code platforms, CI/CD pipelines, and observability platforms
  • Strong understanding of modern systems reliability standards and practices, including KPIs, SLAs, and SLOs
  • Familiarity with financial services regulatory frameworks
  • Experience with financial-grade network segmentation, micro-segmentation, and zero-trust architecture
  • Outstanding organization, project management skills, and attention to detail
  • Powerful verbal and written communication skills
  • Flexibility to work non-traditional hours as needed
Perks:
  • Hybrid Work Model and Business Casual Dress Code, including jeans
  • 401k Matching Program
  • Professional Development Reimbursement
  • Flexible Personal/Vacation Time Off, Sick Leave, Paid Holidays
  • Medical, Dental, Vision, Employee Assistance Program, Parental Leave
  • Committed to Celebrating the Variety of Backgrounds, Talents, and Experiences of Our Employees
  • Hands-On, Team-Customized Training, including SS&C University
  • Discounts on fitness clubs, travel and more!

Add these skills to join the top 1% applicants for this job

team-management
game-texts
aws
openshift
prometheus
openstack
vmware
ci-cd
kubernetes
splunk

Job Description

Site Reliability Engineer (SRE)

Locations: San Francisco, CA / Boston, MA / New York / Denver, CO / Dallas, TX | Hybrid

Get to know us

SS&C Technologies is a global investment and financial services software provider for the economic and healthcare industries. Named to the Fortune 1000 list as the top U.S. company based on revenue, SS&C is headquartered in Windsor, Connecticut, and has over 28,000 employees in more than 100 offices across 35 countries. Approximately 18,000 financial services and healthcare organizations, from the world’s largest companies to small and mid-market firms, rely on SS&C for expertise, scale, and technology.

About the Role

The Site Reliability Engineer (SRE) is responsible for leading technology teams to deliver scalable, resilient, and secure infrastructure platforms and services. This role empowers SS&C’s global business units to innovate with confidence, accelerate application modernization, and effectively manage tech debt. It is key to establishing trust and improving collaboration between our product, engineering, and operations teams as we embed reliability, automation, and compliance into every product we develop and host.

This role reports into and is part of the Global Technology Infrastructure SRE team.

Why You Will Love It Here!

  • Flexibility: Hybrid Work Model and Business Casual Dress Code, including jeans
  • Your Future: 401k Matching Program, Professional Development Reimbursement
  • Work/Life Balance: Flexible Personal/Vacation Time Off, Sick Leave, Paid Holidays
  • Your Wellbeing: Medical, Dental, Vision, Employee Assistance Program, Parental Leave
  • Wide Ranging Perspectives: Committed to Celebrating the Variety of Backgrounds, Talents, and Experiences of Our Employees
  • Training: Hands-On, Team-Customized, including SS&C University
  • Extra Perks: Discounts on fitness clubs, travel and more!

What You Will Get To Do

  • Collaborate with Technology Infrastructure teams to build and operate reusable, cloud-native platforms that abstract complexity and accelerate delivery while incorporating reliability from design through operations.
  • Work with business units and technical teams to improve application availability, observability, and reliability as our business applications are migrated to the Private Cloud.
  • Enhance platform reliability through automatic problem detection, self-healing systems, and well-architected notification and escalation protocols.
  • Use SLOs, SLIs, and KPIs to guide prioritization, measure impact, and drive continuous improvement.
  • Eliminate toil using intelligent automation and agentic workflows.
  • Conduct blameless retrospectives and share learnings across the organization.
  • Foster a culture of ownership, positive thinking, and continuous learning while remaining grounded in practicality, experimentation, and engineering excellence.
  • Integrate DevSecOps, zero-trust principles, and policy-as-code into every pipeline.
  • Produce and promote Architecture Decision Records (ADRs) and Cloud Well-Architected Frameworks that our business units can adopt to improve our technology standardization.
  • Maintain 24x5 active coverage with seamless regional handoffs and weekend escalation protocols.

What You Will Bring

  • 5+ years of professional experience in a SRE role, with 3+ years in financial services or other regulated industries preferred.
  • Minimum Bachelor’s degree in Computer Science, Engineering, or a related field.
  • Proven expertise in architecting, designing and operating private cloud environments (e.g., VMware, OpenStack, OpenShift Virtualization) and Kubernetes clusters from a micro to a global scale.
  • Hands-on experience with building, deploying, and operating infrastructure as code platforms, CI/CD pipelines, and observability platforms (e.g., Prometheus, Splunk).
  • Strong understanding of modern systems reliability standards and practices, including establishing KPIs, monitoring and reporting on SLAs and SLOs, and sorting through the noise to establish actionable insights.
  • Familiarity with various financial services regulatory frameworks and their impact on infrastructure design and operations.
  • Familiarity with structured naming conventions and asset management for global infrastructure.
  • Experience with financial-grade network segmentation, micro-segmentation, and zero-trust architecture.
  • Certifications such as TOGAF, AWS Certified Solutions Architect, VMware VCP, or Red Hat Certified Architect are a plus.
  • Familiarity with ISO 27001, NIST 800-53, and other security frameworks is a plus.

Our Expectations

  • Outstanding organization, project management skills, and attention to detail with a proven track record of effective decision-making and problem-solving.
  • Tenacious problem solver and continuous learner who can make routine decisions, solve complex problems, and readily adapt to new technologies in a fast-paced, complex, and demanding environment.
  • Powerful verbal and written communicator, able to articulate concepts and ideas, break through barriers, engage and encourage people, and work effectively with others under pressure.
  • Quickly establish credibility with multiple technical stakeholders, including executives, clients, product, engineering, systems, and cybersecurity teams.
  • Treat confidential information in a discreet and appropriate manner.
  • Uphold and adhere to our compliance and regulatory requirements.
  • Demonstrate a commitment to highly professional and ethical standards in a diverse workplace.
  • Flexibility to work non-traditional hours as needed, including nights, weekends, and local holidays.
  • Occasional work travel as needed (< 25%), including to US domestic and international locations, corporate offices, clients, and conferences.

Set alerts for more jobs like Site Reliability Engineer (SRE)
Set alerts for new jobs by SSC Technologies
Set alerts for new Devops jobs in United States
Set alerts for new jobs in United States
Set alerts for Devops (Remote) jobs

Contact Us
hello@outscal.com
Made in INDIA 💛💙