Site Reliability Engineer

3 Months ago • 3-5 Years • Devops

Job Summary

Job Description

As a Site Reliability Engineer at Constant Contact, you will be responsible for maintaining the reliability and uptime of critical services, focusing on CentOS servers, Java application support, incident management, change management, and Kubernetes administration. You will monitor production systems, applications, and overall performance while conducting security checks and routine system and application maintenance. The role involves responding to operational alerts, collaborating with developers and operations personnel to resolve issues, and participating in post-mortem meetings to prevent future incidents. You will also be expected to write and maintain policy and procedure documents, write scripts or code to develop tools and/or services, and manage service-level objectives.
Must have:
  • Administer Kubernetes clusters with ArgoCD.
  • Monitor and manage applications on CentOS servers.
  • Manage incidents, perform root cause analysis.
  • Use basic Linux scripting for automation.
  • Knowledge of Project Management Tools like JIRA/Confluence.
  • Experience with database systems like MySQL and DB2.
  • Drive incidents using Incident Management processes.
  • Execute change management procedures.
  • Experience as a Linux (CentOS / RHEL) administrator.
  • Experience with managing deployments using Jenkins.
  • Working with monitoring tools like New Relic, Splunk and Nagios.
  • Experience with log aggregation tools like Splunk, Loki or Grafana.

Job Details

About Us Aeries Technology is a Nasdaq listed global professional services and consulting partner, headquartered in Mumbai, India, with centers in the USA, Mexico, Singapore, and Dubai. We provide mid-size technology companies with the right mix of deep vertical specialty, functional expertise, and the right systems & solutions to scale, optimize and transform their business operations with unique customized engagement models. Aeries is Great Place to Work certified by GPTW India, reflecting our commitment to fostering a positive and inclusive workplace culture for our employees. Read about us at https://aeriestechnology.com About Business Unit "Constant Contact is a technology product company, headquartered in Waltham, Massachusetts, United States. We are one of the top 2 providers of email marketing, social media marketing, event marketing, and online survey tools. We support 0.5 million SMBs to grow their businesses by building stronger relationships with their customers, with a wide range of intuitive marketing applications designed to help small businesses and nonprofits expand their customer bases and nurture relationships. Read about us at https://www.constantcontact.com/about In 2021, Constant Contact partnered with Aeries to set up its GTC with an aim of consolidating the former’s global operations in Bengaluru (Bangalore), India; with teams set up in the areas of IT, Engineering, Customer Support, and other General and Administrative functions. The GTC is a dedicated center, focused on providing best practices, research, support, and training for specific business functions." Big Reasons to Support Small - https://constantcontact.wistia.com/medias/pmlrsyb6hu Roles and Responsibility At Constant Contact, we’re looking for individuals well rounded in several aspects of Technical Operations. You will be taking on the role of a responder to the Operational alerts and monitoring within Constant Contact. This role requires you to work with both Developers and Operational personnel to address and resolve issues and requests. We are looking for a highly skilled and motivated Site Reliability Engineer to join our team. The successful candidate will be responsible for maintaining the reliability and uptime of critical services, with a focus on CentOS servers, Java application support, incident management, change management and Kubernetes administration. The ideal candidate will possess strong ArgoCD for Kubernetes management, Linux skills, basic scripting knowledge and familiarity with modern monitoring, alerting and automation tools. We are looking for someone that is self-motivated, possesses excellent communication skills (both oral and written) and is able to work both independently and collaboratively. What you’ll do: Conduct regular routine tasks for system and application maintenance. Follow SOP's to correct/prevent issues Monitor production systems, applications and overall performance. Observability is a process that prepares the software team for uncertainties when the software goes live for end users. Site reliability engineering uses tools to detect abnormal behaviors in the software and, more importantly, collect information that helps developers understand what causes the problem. Conduct security checks Run meetings with our business partners following in place processes and procedures. Writing, updating and maintaining policy and procedure documents Write scripts or code as necessary to develop tools and/or services in order to support the product Learn from Post Mortems and prevent new incidents from occurring Performing admin work on various tools and applications such as JIRA and New Relic Maintain Service-level objectives, specific and quantifiable goals related to maintaining the parameters set for our “Golden Metrics”. Who you are: 3-5+ years of experience working in a SaaS and Cloud environment. Administer Kubernetes clusters, including management of applications using ArgoCD. Monitor, maintain, and manage applications on CentOS servers, ensuring high availability and performance. Respond to and manage running incidents, including running post mortem meetings, peforming root cause analysis and ensuring timely resolution. Use basic Linux scripting to automate routine tasks and improve operational efficiency. Knowledge in Project Management Tools like JIRA/Confluence Knowledge of Database systems like MySQL and DB2 Understand and drive incidents using Incident Management processes and procedures Execute change management procedures, run change management meetings and enforce safe and compliant changes to production environments. Experience as a Linux (CentOS / RHEL) administrator Deep knowledge of on-call responsibilities and awareness of time management. Include maintaining On-call management tools such as xMatters software. Experience with managing deployments using Jenkins Working with a suite of monitoring tools including New Relic, Splunk and Nagios Experience with log aggregation tools like Splunk, Loki or Grafana You must be comfortable troubleshooting and debugging web applications across the entire stack (i.e. the application layer, the database layer, the OS). Production MySQL experience: replication, performance tuning, query optimization. You should have familiarity with Ansible or other configuration management tools like Puppet.

Similar Jobs

Imanage - CX Risk Enablement Strategist

Imanage

Chicago, Illinois, United States (Hybrid)
3 Weeks ago
Vercel - Senior Legal Counsel, Product and Commercial

Vercel

San Francisco, California, United States (Hybrid)
2 Months ago
truecaller - Senior Customer Success Manager - ROW

truecaller

Cairo, Cairo Governorate, Egypt (On-Site)
4 Weeks ago
USE Insider - Software Quality Assurance Tester

USE Insider

Istanbul, İstanbul, Türkiye (Remote)
1 Month ago
Sonar Source - Enterprise Account Executive - Melbourne

Sonar Source

Melbourne, Victoria, Australia (On-Site)
8 Months ago
Motorola solutions - Windows Platform Engineer

Motorola solutions

Kraków, Lesser Poland Voivodeship, Poland (On-Site)
2 Weeks ago
bytedance - Software Engineer - Serverless Compute Infrastructure

bytedance

Seattle, Washington, United States (On-Site)
6 Months ago
Aeries technology - Sr. DevOps Engineer

Aeries technology

Bengaluru, Karnataka, India (On-Site)
2 Months ago
bytedance - Solution Architect (GenAI), BytePlus

bytedance

Singapore (On-Site)
7 Months ago
EMA - Deployment Engineer

EMA

Bengaluru, Karnataka, India (Hybrid)
6 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

GoMotive - Technical Support Engineer

GoMotive

Pakistan (Remote)
2 Months ago
ISS Stoxx - Senior Full Stack Developer

ISS Stoxx

Mumbai, Maharashtra, India (On-Site)
1 Month ago
Temporal Technologies - Senior Engineering Manager - Open Source Server

Temporal Technologies

United States (On-Site)
2 Months ago
Nasdaq - Commercial Management – Sr. Analyst

Nasdaq

Mumbai, Maharashtra, India (On-Site)
1 Year ago
USE Insider - Experienced Customer Success Manager

USE Insider

Ho Chi Minh City, Vietnam (On-Site)
3 Years ago
Sprinkler - Senior Managed Services Consultant

Sprinkler

New York, United States (Remote)
3 Weeks ago
Diligent Corporation - Product Marketing Manager

Diligent Corporation

New York, United States (Hybrid)
3 Weeks ago
Toast - Engineering Manager II, Toast Delivery Services

Toast

United States (Remote)
1 Month ago
Scale AI - Contracts Manager

Scale AI

San Francisco, California, United States (On-Site)
1 Month ago
GHX - Vice President, Enterprise Architecture

GHX

United States (On-Site)
1 Month ago

Get notifed when new similar jobs are uploaded

Jobs in Bengaluru, Karnataka, India

Philips - Category Sourcing Manager - Marketing APAC

Philips

Gurugram, Haryana, India (On-Site)
2 Months ago
Postman - Software Engineer, IAM

Postman

Hyderabad, Telangana, India (Hybrid)
2 Months ago
Trek - Accounts Payable Executive

Trek

Gurugram, Haryana, India (Hybrid)
1 Year ago
bounteous - Senior Adobe Analytics Consultant

bounteous

India (Hybrid)
2 Years ago
Kulfi Collective - Sr. AI Artist

Kulfi Collective

Mumbai, Maharashtra, India (Hybrid)
2 Weeks ago
Global Business Travel - Technical Lead Architect (.NET and BI)

Global Business Travel

India (On-Site)
2 Months ago
frames store - Imaging Support Engineer

frames store

Mumbai, Maharashtra, India (On-Site)
4 Months ago
Qualcomm - Senior Engineer- Python automation framework Machine learning

Qualcomm

Hyderabad, Telangana, India (On-Site)
2 Weeks ago
beghou consulting - Consultant- Advanced Analytics

beghou consulting

Pune, Maharashtra, India (Hybrid)
1 Year ago
Interactive Brokers - Operation Analysts

Interactive Brokers

Mumbai, Maharashtra, India (Hybrid)
2 Months ago

Get notifed when new similar jobs are uploaded

Devops Jobs

gitlab - Solutions Architect

gitlab

Canada (Remote)
2 Months ago
Kulfi Collective - Lead AI & Platform Engineer

Kulfi Collective

Mumbai, Maharashtra, India (On-Site)
2 Months ago
CRB workforce  - Senior Cloud Engineer

CRB workforce

Salt Lake City, Utah, United States (On-Site)
2 Months ago
WireWheel - Software Architect

WireWheel

(Remote)
2 Months ago
Thousand Eyes - Senior Site Reliability Engineer II, Efficiency and Performance

Thousand Eyes

Bengaluru, Karnataka, India (On-Site)
2 Months ago
Capgemini - Devops

Capgemini

Bengaluru, Karnataka, India (On-Site)
2 Months ago
Ziff Davis - DevOps Engineer

Ziff Davis

New York, United States (On-Site)
1 Month ago
Bright Machines - Automation Controls Engineer

Bright Machines

San Francisco, California, United States (On-Site)
3 Months ago
luxsoft - Senior Azure AI Engineer

luxsoft

United States (Remote)
2 Months ago
ClimateCamp - AI / Machine Learning Engineer - Azure AI

ClimateCamp

Belgium (Hybrid)
3 Weeks ago

Get notifed when new similar jobs are uploaded

About The Company

Bengaluru, Karnataka, India (Remote)

Mumbai, Maharashtra, India (On-Site)

Mumbai, Maharashtra, India (On-Site)

Bengaluru, Karnataka, India (On-Site)

Mumbai, Maharashtra, India (On-Site)

Bengaluru, Karnataka, India (On-Site)

Mumbai, Maharashtra, India (On-Site)

Mumbai, Maharashtra, India (On-Site)

Mumbai, Maharashtra, India (On-Site)

Mumbai, Maharashtra, India (On-Site)

View All Jobs

Get notified when new jobs are added by Aeries technology

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug