Site Reliability Engineer

1 Month ago • All levels • Devops

Job Summary

Job Description

The Site Reliability Engineer (SRE) will be responsible for ensuring the scalability, reliability, and performance of systems, infrastructure, and applications. This role involves collaborating with software engineers, system administrators, and DevOps professionals to design and implement solutions that improve uptime, system availability, and overall service health. The SRE will maintain scalable and reliable infrastructure, design and implement monitoring and alerting systems, develop automation tools, conduct root cause analysis, participate in on-call rotations, and continuously improve deployment processes.
Must have:
  • Experience with Linux/Unix systems administration.
  • Proficient in scripting and programming languages (Python, Go, or Bash).
  • Hands-on experience with cloud platforms such as AWS.
  • Experience with infrastructure-as-code tools (Terraform, Ansible).
  • Experience with containerization and orchestration tools (Docker, Kubernetes).
  • Deep understanding of CI/CD pipelines, networking, and security best practices.
  • Excellent troubleshooting and problem-solving skills.

Job Details

As a leading financial services and healthcare technology company based on revenue, SS&C is headquartered in Windsor, Connecticut, and has 27,000+ employees in 35 countries. Some 20,000 financial services and healthcare organizations, from the world's largest companies to small and mid-market firms, rely on SS&C for expertise, scale, and technology.

Job Description

We are looking for a highly skilled Site Reliability Engineer (SRE) to join our engineering team. As an SRE, you will be responsible for ensuring the scalability, reliability, and performance of our systems, infrastructure, and applications.

You will collaborate closely with software engineers, system administrators, and DevOps professionals to design and implement solutions that improve uptime, system availability, and overall service health.

Key Responsibilities:

  • Maintain a scalable and reliable infrastructure to support mission-critical systems.
  • Design and implement monitoring, alerting, and incident response systems to ensure high availability and performance.
  • Develop tools and automation to eliminate manual operations and improve system efficiency.
  • Collaborate with development teams to ensure that reliability and performance are considered from the outset.
  • Conduct root cause analysis and postmortems to learn from system failures and prevent recurrence.
  • Participate in on-call rotations and respond to incidents, minimising downtime and customer impact.
  • Continuously improve deployment, configuration, and observability processes.

Qualifications:

  • Bachelor’s degree in Computer Science, Engineering, or a related field, or equivalent experience.
  • Strong experience with Linux/Unix systems administration.
  • Proficient in scripting and programming languages such as Python, Go, or Bash.
  • Hands-on experience with cloud platforms such as AWS and infrastructure-as-code tools (Terraform, Ansible)
  • Experience with containerization and orchestration tools (Docker, Kubernetes, Linux, Windows, Citrix).
  • Deep understanding of CI/CD pipelines, networking, and security best practices.
  • Excellent troubleshooting and problem-solving skills.
  • Strong communication and collaboration abilities.

Preferred Qualifications:

  • Experience with large-scale distributed systems.
  • Familiarity with SLAs, SLOs, and SLIs.
  • Previous experience in a DevOps or SRE role in a production environment.

We encourage applications from people of all backgrounds and particularly welcome applications from under-represented groups, to enable us to bring a diversity of perspectives to our thinking and conversation. It's important to us that we strive to have a workforce that is diverse in the widest sense.

Unless explicitly requested or approached by SS&C Technologies, Inc. or any of its affiliated companies, the company will not accept unsolicited resumes from headhunters, recruitment agencies, or fee-based recruitment services.

SS&C Technologies is an Equal Employment Opportunity employer and does not discriminate against any applicant for employment or employee on the basis of race, color, religious creed, gender, age, marital status, sexual orientation, national origin, disability, veteran status or any other classification protected by applicable discrimination laws.

Similar Jobs

Google - Software Engineer III, Embedded Systems/Firmware, AR

Google

Austin, Texas, United States (On-Site)
2 Months ago
Scale AI - SEAL Research Scientist, Scalable Oversight

Scale AI

San Francisco, California, United States (On-Site)
2 Months ago
logifuture - C# Tech Lead

logifuture

Bucharest, Bucharest, Romania (Hybrid)
3 Months ago
nubank - Regulatory Compliance Senior Analyst

nubank

Mexico City, Mexico (On-Site)
1 Month ago
Spruce Systems - Sr Software Engineer, Cross-Platform Rust

Spruce Systems

United States (Remote)
2 Months ago
Cadence - Cloud Engineer / Full Stack Developer

Cadence

San Jose, California, United States (On-Site)
1 Month ago
bytedance - Site Reliability Engineer Graduate (Technical Infrastructure) - 2025 Start (BS/MS)

bytedance

San Jose, California, United States (On-Site)
8 Months ago
bytedance - Staff Frontend Software Engineer - Customer Service Platform - Seattle

bytedance

Seattle, Washington, United States (On-Site)
8 Months ago
NetEase Games - Infrastructure Engineer

NetEase Games

Quebec, Canada (Remote)
3 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

SEGA - Senior Development Manager

SEGA

Horsham, England, United Kingdom (Hybrid)
1 Week ago
White board games - QA Analyst (SSR)

White board games

Argentina (Remote)
2 Months ago
Capgemini - R2R Delivery Lead

Capgemini

Bengaluru, Karnataka, India (On-Site)
3 Months ago
Rockstar Games - UI Tools Programmer

Rockstar Games

Dundee, Scotland, United Kingdom (On-Site)
1 Month ago
Jane Street - Machine Learning Researcher

Jane Street

London, England, United Kingdom (On-Site)
3 Days ago
PayPal - Staff Software Development Manager

PayPal

Bengaluru, Karnataka, India (Hybrid)
3 Weeks ago
Activision - Expert Animation Engineer

Activision

Los Angeles, California, United States (On-Site)
2 Months ago
Guardian - Lead Engineer - IT

Guardian

Chennai, Tamil Nadu, India (On-Site)
1 Month ago
Dialpad AI - IT Support Specialist

Dialpad AI

Pasig, Metro Manila, Philippines (On-Site)
1 Month ago
Riot Games - Staff Software Engineer, Gameplay

Riot Games

Los Angeles, California, United States (On-Site)
2 Months ago

Get notifed when new similar jobs are uploaded

Jobs in London, England, United Kingdom

fuse games - Distributed Development Artist

fuse games

Guildford, England, United Kingdom (Hybrid)
3 Months ago
Cloud Imperium Games - VFX Artist

Cloud Imperium Games

Manchester, England, United Kingdom (On-Site)
4 Months ago
GlobalStep - Director of Sales

GlobalStep

United Kingdom (On-Site)
8 Months ago
Vercel - Enterprise Account Executive

Vercel

United Kingdom (Remote)
5 Months ago
LeoVegas - Senior Commercial Analyst

LeoVegas

Newcastle Upon Tyne, England, United Kingdom (Hybrid)
2 Months ago
Thales - Manufacturing Technician - Test

Thales

Glasgow, Scotland, United Kingdom (On-Site)
2 Months ago
Dentsu - Planning Manager

Dentsu

London, England, United Kingdom (Hybrid)
1 Month ago
Moloco - Senior Data Scientist, Growth Analytics

Moloco

London, England, United Kingdom (On-Site)
1 Month ago
bytedance - Creator Operations Manager (DE)

bytedance

London, England, United Kingdom (On-Site)
2 Months ago
Varonis  - Customer Success Operations Manager

Varonis

London, England, United Kingdom (On-Site)
1 Week ago

Get notifed when new similar jobs are uploaded

Devops Jobs

AiDash - Software Development Engineer II - DevOps

AiDash

Bengaluru, Karnataka, India (Hybrid)
2 Weeks ago
PhonePe - Site Reliability Engineer

PhonePe

Pune, Maharashtra, India (On-Site)
1 Month ago
Visa - Sr. Site Reliability Engineer - ServiceNow

Visa

Ashburn, Virginia, United States (Hybrid)
4 Weeks ago
NVIDIA - Senior Software and System Architect

NVIDIA

New York, New York, United States (Remote)
4 Months ago
Nagarro - Associate Staff Engineer, Mobile Cross Platform

Nagarro

Riyadh, Riyadh Province, Saudi Arabia (On-Site)
8 Months ago
zoox - Staff/Senior Staff Software Platform Engineer

zoox

Foster City, California, United States (Hybrid)
8 Months ago
Ubisoft - Vulnerability DevOps Specialist

Ubisoft

Saint-Mandé, Île-de-France, France (Hybrid)
1 Month ago
Cadence - Solutions Engineer II

Cadence

Hsinchu, Hsinchu City, Taiwan (On-Site)
3 Weeks ago
bytedance - Site Reliability Engineer, Traffic Infrastructure

bytedance

Singapore (On-Site)
8 Months ago
Power Integrations - DevOps Engineer

Power Integrations

Pasig, Metro Manila, Philippines (On-Site)
1 Year ago

Get notifed when new similar jobs are uploaded

About The Company

As a leading financial services and healthcare technology company based on revenue, SS&C is headquartered in Windsor, Connecticut, and has 27,000+ employees in 35 countries. Some 20,000 financial services and healthcare organizations, from the world's largest companies to small and mid-market firms, rely on SS&C for expertise, scale, and technology.

Los Angeles, California, United States (Hybrid)

Gurugram, Haryana, India (On-Site)

Bangkok, Thailand (Hybrid)

New York, United States (Hybrid)

Melbourne, Victoria, Australia (Hybrid)

Mumbai, Maharashtra, India (On-Site)

Mumbai, Maharashtra, India (On-Site)

London, England, United Kingdom (Hybrid)

Dallas, Texas, United States (Hybrid)

Basildon, England, United Kingdom (On-Site)

View All Jobs

Get notified when new jobs are added by SSC Technologies

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug