Site Reliability Engineer - III (UK Shift)

2 Months ago • All levels • Devops

Job Summary

Job Description

Rackspace is seeking a Site Reliability Engineer / Observability Engineer for their Professional Services Center of Excellence. The role focuses on Application Performance Monitoring Suites, helping customers accelerate digital transformation by building next-generation modern applications. You will solve complex business problems, improve customer experiences by leveraging modern interpretations of SRE and Observability using tools like Datadog, New Relic, AppDynamics, or Dynatrace. The position involves working with customers, implementing observability solutions, building scalable systems, developing monitoring tools, and analyzing data for anomaly detection and performance tuning.
Must have:
  • Bachelor’s degree in engineering/computer science or equivalent
  • Senior-level SRE/DevOps experience
  • AWS infrastructure design, implementation, and optimization
  • Automation for deployment, scaling, and reliability
  • Experience with observability tools (Splunk, Datadog, etc.)
  • Experience with AWS ecosystem
  • Proactive problem-solving skills
  • Proficiency in scripting languages (Python, PHP, Perl, Ruby, Linux Shell)
  • Experience with Terraform or Cloud Formation
  • Experience with configuration management (Ansible, Chef, Puppet)
  • Experience with Git and agile development
  • Understanding of AWS pricing models
  • Knowledge of network & system management solutions
  • Excellent organizational and project management skills
  • Excellent communication, critical thinking & analytical skills

Job Details

Site Reliability Engineer / Observability Engineer
Public Cloud - Offerings and Delivery – Workforce Mgmt & Delivery Ops /
Full - Time / Remote
Rackspace is building up its Professional Services Center of Excellence on Application Performance Monitoring Suites.  
If you enjoy solving complex business problems and can contribute to building next generation of modern applications for our customers helping them understand the connections between application performance, user experience and business outcomes creating amazing customer experiences, with modern interpretations of SRE, Observability using Datadog, New Relic, AppDynamics or Dynatrace, working with their suite of products and integrations, then join us!  
Rackspace enables businesses to accelerate digital transformation through our innovative data, integration solutions tools that help you fix problems quickly, maintain complex systems and improve code. We believe Datadog, AppDynamics or New Relic will be a large contributor to what we do, and we want talented, creative, and thoughtful individuals to join our team to shape Observability Engineering for our customers.


You Will
  • Work with customers and implement Observability solutions
  • Build and maintain scalable systems and robust automation that supports engineering goals.
  • Develop and maintain monitoring tools, alerts, and dashboards to provide visibility into system health and performance
  • Proactively gather and analyze both metric and log data from systems and applications to perform anomaly detection, performance tuning, capacity planning and fault isolation.
  • Collaborate with development teams to implement and deploy new features and enhancements, ensuring they meet reliability, security and performance standards
  • Collaborate with team members to document and share solutions 
  • Maintain a deep understanding of the customer’s business as well as their technical environment 
  • Identifying performance bottlenecks, identifying anomalous system behavior, and resolving root cause of service issues


You Have:
  • Bachelor’s degree in engineering/computer science or equivalent
  • Senior-level experience with Site Reliability Engineering, DevOps, Code level application support and troubleshooting, AWS Infrastructure design, implementation and optimization, Automation for deployment, scaling and reliability.  
  • Experience with observability solutions tools like Splunk, Datadog, SignalFx, etc.
  • Experience deploying, maintaining and supporting software applications/services in the AWS ecosystem
  • Proactive approach to identifying problems and solutions
  • Experience writing code with one or more interpreted languages such as Python, PHP, Perl, Ruby,Linux  Shell
  • Experience with Terraform or Cloud Formation scripting
  • Experience with configuration management tools like Ansible, Chef or Puppet
  • Experience with standard software development best practices and tools such as code repositories (Git preferred)
  • Experience executing in an agile software development environment
  • Good understanding of pricing/cost models across AWS services, especially compute, storage, and database offerings
  • A clear understanding of network & system Management solutions
  • Excellent organizational and project management skills
  • Excellent communication, critical thinking & analytical skills


About Rackspace Technology
  • We are the multicloud solutions experts. We combine our expertise with the world’s leading technologies — across applications, data and security — to deliver end-to-end solutions. We have a proven record of advising customers based on their business challenges, designing solutions that scale, building and managing those solutions, and optimizing returns into the future. Named a best place to work, year after year according to Fortune, Forbes and Glassdoor, we attract and develop world-class talent. Join us on our mission to embrace technology, empower customers and deliver the future.


More on Rackspace Technology
  • Though we’re all different, Rackers thrive through our connection to a central goal: to be a valued member of a winning team on an inspiring mission. We bring our whole selves to work every day. And we embrace the notion that unique perspectives fuel innovation and enable us to best serve our customers and communities around the globe. We welcome you to apply today and want you to know that we are committed to offering equal employment opportunity without regard to age, color, disability, gender reassignment or identity or expression, genetic information, marital or civil partner status, pregnancy or maternity status, military or veteran status, nationality, ethnic or national origin, race, religion or belief, sexual orientation, or any legally protected characteristic. If you have a disability or special need that requires accommodation, please let us know.


Similar Jobs

quience - Retail Facilities & Workplace Manager

quience

San Francisco, California, United States (On-Site)
1 Month ago
Dentsu - Account Director

Dentsu

Singapore (On-Site)
2 Months ago
Cubic corporation - EMEA Project Control Manager

Cubic corporation

Salfords, England, United Kingdom (On-Site)
2 Months ago
zoox - Software Engineer - Mission Planning

zoox

Foster City, California, United States (Hybrid)
2 Years ago
AECOM - Civil / Highway Engineer - Transportation Design

AECOM

Pittsburgh, Pennsylvania, United States (Hybrid)
2 Months ago
Loft Orbital - Senior Site Reliability Engineer

Loft Orbital

Golden, Colorado, United States (Remote)
3 Months ago
binance - DevOps Engineer

binance

Asia, Lima Region, Peru (Remote)
5 Months ago
Accenture - Solution Architect

Accenture

Bengaluru, Karnataka, India (On-Site)
2 Months ago
Turbulent - DevOps Senior

Turbulent

Montreal, Quebec, Canada (On-Site)
1 Month ago
BigID - Site Reliability Engineer

BigID

Hyderabad, Telangana, India (Hybrid)
3 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

kaizen gaming  - Backoffice Specialist

kaizen gaming

São Paulo, Brazil (Hybrid)
2 Months ago
WongDoody - PRODUCT SERVICE DESIGNER, TAIWAN

WongDoody

Taipei City, Taiwan (On-Site)
9 Months ago
WebFX - Digital Marketing Specialist - Account Manager- Ft Myers, FL

WebFX

Fort Myers, Florida, United States (On-Site)
10 Months ago
Toast - Senior Data Analyst Talent Operations

Toast

Chennai, Tamil Nadu, India (Hybrid)
2 Months ago
Adobe - Senior Organizational Effectiveness Partner

Adobe

San Jose, California, United States (On-Site)
3 Months ago
Blinkhealth - People and Culture Partner, Pharmacy Operations

Blinkhealth

Chesterfield, Missouri, United States (On-Site)
3 Months ago
Tesla - Training Coordinator - Parts Operations

Tesla

Barcelona, Catalonia, Spain (On-Site)
6 Months ago
Devoteam - Customer Experience Consultant

Devoteam

Amman, Amman Governorate, Jordan (On-Site)
3 Months ago
Square - Manager, Consolidation & Long-Range Planning

Square

Charlotte, North Carolina, United States (On-Site)
3 Weeks ago
London stock Exchange - Customer Success Manager

London stock Exchange

New York, United States (On-Site)
2 Months ago

Get notifed when new similar jobs are uploaded

Jobs in India

zeta - Site Reliability Engineer I / II

zeta

Hyderabad, Telangana, India (On-Site)
1 Year ago
smarsh - Full Stack Engineer (SE III)

smarsh

India (Hybrid)
1 Month ago
WebTech Corporation - LEAD - Finance

WebTech Corporation

Bengaluru, Karnataka, India (On-Site)
1 Month ago
Capgemini - SAP YL

Capgemini

Mumbai, Maharashtra, India (On-Site)
3 Months ago
DNEG - FX Lead (DNEG Animation)

DNEG

Mumbai, Maharashtra, India (On-Site)
10 Months ago
T systems - Control-M L3 Engineer

T systems

Pune, Maharashtra, India (On-Site)
4 Weeks ago
Accenture - Software Development Lead

Accenture

Hyderabad, Telangana, India (On-Site)
2 Months ago
Capgemini - Java Developer with Springboot

Capgemini

Gurugram, Haryana, India (On-Site)
3 Months ago
PhonePe - Head, Business Marketing

PhonePe

Bengaluru, Karnataka, India (On-Site)
1 Month ago
Rackspace Technology - AWS Devops III

Rackspace Technology

Bengaluru, Karnataka, India (Hybrid)
2 Months ago

Get notifed when new similar jobs are uploaded

Devops Jobs

Nagarro - Senior Engineer, DevOps

Nagarro

India (Remote)
10 Months ago
Flexra Software - Member of Technical Staff, Site Reliability Engineer

Flexra Software

India (Remote)
3 Months ago
Workato - Intern - Platform Solutions Engineer

Workato

Tokyo, Japan (On-Site)
3 Months ago
bytedance - Backend Software Engineer (SRE) Intern

bytedance

Singapore (On-Site)
3 Months ago
Flexera Software - Principal Architect - ITAM Solutions

Flexera Software

United Kingdom (Remote)
1 Month ago
Arkose Labs - Site Reliability Engineer

Arkose Labs

San José Province, Costa Rica (Remote)
3 Weeks ago
PhonePe - Site Reliability Engineer - Systems

PhonePe

Bengaluru, Karnataka, India (On-Site)
1 Month ago
Google - Software Engineer III, Full Stack, Google Cloud Business Platforms

Google

Kirkland, Washington, United States (On-Site)
4 Months ago
Brillio - Informatica Intelligent Cloud Services (IICS) Developer

Brillio

McLean, Virginia, United States (On-Site)
1 Month ago

Get notifed when new similar jobs are uploaded