Site Reliability Engineer

9 Hours ago • All levels

Job Summary

Job Description

The Site Reliability Engineer (SRE) will be responsible for ensuring the scalability, reliability, and performance of systems, infrastructure, and applications. This role involves collaborating with software engineers, system administrators, and DevOps professionals to design and implement solutions that improve uptime, system availability, and overall service health. The SRE will maintain scalable and reliable infrastructure, design and implement monitoring and alerting systems, develop automation tools, conduct root cause analysis, participate in on-call rotations, and continuously improve deployment processes.
Must have:
  • Experience with Linux/Unix systems administration.
  • Proficient in scripting and programming languages (Python, Go, or Bash).
  • Hands-on experience with cloud platforms such as AWS.
  • Experience with infrastructure-as-code tools (Terraform, Ansible).
  • Experience with containerization and orchestration tools (Docker, Kubernetes).
  • Deep understanding of CI/CD pipelines, networking, and security best practices.
  • Excellent troubleshooting and problem-solving skills.

Job Details

As a leading financial services and healthcare technology company based on revenue, SS&C is headquartered in Windsor, Connecticut, and has 27,000+ employees in 35 countries. Some 20,000 financial services and healthcare organizations, from the world's largest companies to small and mid-market firms, rely on SS&C for expertise, scale, and technology.

Job Description

We are looking for a highly skilled Site Reliability Engineer (SRE) to join our engineering team. As an SRE, you will be responsible for ensuring the scalability, reliability, and performance of our systems, infrastructure, and applications.

You will collaborate closely with software engineers, system administrators, and DevOps professionals to design and implement solutions that improve uptime, system availability, and overall service health.

Key Responsibilities:

  • Maintain a scalable and reliable infrastructure to support mission-critical systems.
  • Design and implement monitoring, alerting, and incident response systems to ensure high availability and performance.
  • Develop tools and automation to eliminate manual operations and improve system efficiency.
  • Collaborate with development teams to ensure that reliability and performance are considered from the outset.
  • Conduct root cause analysis and postmortems to learn from system failures and prevent recurrence.
  • Participate in on-call rotations and respond to incidents, minimising downtime and customer impact.
  • Continuously improve deployment, configuration, and observability processes.

Qualifications:

  • Bachelor’s degree in Computer Science, Engineering, or a related field, or equivalent experience.
  • Strong experience with Linux/Unix systems administration.
  • Proficient in scripting and programming languages such as Python, Go, or Bash.
  • Hands-on experience with cloud platforms such as AWS and infrastructure-as-code tools (Terraform, Ansible)
  • Experience with containerization and orchestration tools (Docker, Kubernetes, Linux, Windows, Citrix).
  • Deep understanding of CI/CD pipelines, networking, and security best practices.
  • Excellent troubleshooting and problem-solving skills.
  • Strong communication and collaboration abilities.

Preferred Qualifications:

  • Experience with large-scale distributed systems.
  • Familiarity with SLAs, SLOs, and SLIs.
  • Previous experience in a DevOps or SRE role in a production environment.

We encourage applications from people of all backgrounds and particularly welcome applications from under-represented groups, to enable us to bring a diversity of perspectives to our thinking and conversation. It's important to us that we strive to have a workforce that is diverse in the widest sense.

Unless explicitly requested or approached by SS&C Technologies, Inc. or any of its affiliated companies, the company will not accept unsolicited resumes from headhunters, recruitment agencies, or fee-based recruitment services.

SS&C Technologies is an Equal Employment Opportunity employer and does not discriminate against any applicant for employment or employee on the basis of race, color, religious creed, gender, age, marital status, sexual orientation, national origin, disability, veteran status or any other classification protected by applicable discrimination laws.

Similar Jobs

Looks like we're out of matches

Set up an alert and we'll send you similar jobs the moment they appear!

Similar Skill Jobs

Looks like we're out of matches

Set up an alert and we'll send you similar jobs the moment they appear!

Jobs in London, England, United Kingdom

Looks like we're out of matches

Set up an alert and we'll send you similar jobs the moment they appear!

Similar Category Jobs

Looks like we're out of matches

Set up an alert and we'll send you similar jobs the moment they appear!

About The Company

As a leading financial services and healthcare technology company based on revenue, SS&C is headquartered in Windsor, Connecticut, and has 27,000+ employees in 35 countries. Some 20,000 financial services and healthcare organizations, from the world's largest companies to small and mid-market firms, rely on SS&C for expertise, scale, and technology.

London, England, United Kingdom (On-Site)

Bangkok, Thailand (Hybrid)

Dublin, Ohio, United States (Hybrid)

Auckland, Auckland, New Zealand (Remote)

Toronto, Ontario, Canada (Hybrid)

Houston, Texas, United States (Remote)

Jersey City, New Jersey, United States (Hybrid)

New York, United States (Hybrid)

View All Jobs

Get notified when new jobs are added by SSC Technologies

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug