Site Reliability Engineer

5 Hours ago • 5 Years +
Devops

Job Description

GoDaddy is seeking an exceptional Site Reliability Engineer in India to build and maintain infrastructure powering millions of entrepreneurs. This role focuses on driving reliability, observability, and cost efficiency across large-scale systems. You will design for resilience, automate operations, and proactively prevent incidents to ensure flawless system performance. The position involves implementing end-to-end observability, architecting AWS infrastructure, managing containerized workloads, and leading incident management.
Good To Have:
  • Bachelor’s degree or equivalent experience in computer science, engineering, or a related technical field.
Must Have:
  • Implement end-to-end observability using Prometheus, Grafana, CloudWatch, and ServiceNow.
  • Define and maintain SLIs/SLOs/SLAs across infrastructure and applications.
  • Architect and automate AWS infrastructure using CDK, CloudFormation, Python, Go, or Bash.
  • Manage and troubleshoot containerized workloads across Docker, Kubernetes (EKS), ECS, and Fargate.
  • Ensure configuration consistency through Ansible, Puppet, or Chef.
  • Design, build, deploy, and maintain large-scale, production-grade systems in AWS.
  • Drive platform reliability by proactively identifying risks and planning for scale and performance.
  • Lead incident management with blameless postmortems and standardized SOPs.
  • Enhance infrastructure and CI/CD pipelines to improve performance and cost-effectiveness.
  • 5+ years of SRE experience supporting production-scale systems.
  • Strong understanding of SLIs/SLOs, distributed systems reliability, and troubleshooting complex production issues.
  • Deep hands-on expertise with AWS services (EKS, ECS, Fargate, EC2, S3, RDS, SQS, SNS, CloudFormation, CDK, IAM, CloudWatch).
  • Proficient in incident management tools (BigPanda, Site24x7), ServiceNow integration, and configuration management (Ansible, Puppet, Chef).
  • Strong automation skills in Python/Go/Bash with expertise in CI/CD pipelines using GitHub Actions, Jenkins, and container orchestration.
  • Skilled in monitoring and observability tools including Prometheus, Grafana, and CloudWatch.
Perks:
  • Paid time off
  • Retirement savings (e.g., 401k, pension schemes)
  • Bonus/incentive eligibility
  • Equity grants
  • Participation in employee stock purchase plan
  • Competitive health benefits
  • Family-friendly benefits including parental leave
  • Employee Resource Groups
  • Support for entrepreneurs/side hustles

Add these skills to join the top 1% applicants for this job

problem-solving
forecasting-budgeting
github
game-texts
aws
prometheus
ansible
grafana
chef
puppet
ci-cd
docker
kubernetes
python
github-actions
bash
jenkins

At GoDaddy the future of work looks different for each team. Some teams work in the office full-time; others have a hybrid arrangement (they work remotely some days and in the office some days) and some work entirely remotely.

This is a remote position, so you’ll be working remotely from your home. You may occasionally visit a GoDaddy office to meet with your team for events or meetings.

Join Our team..

At GoDaddy, we are searching for an outstanding Site Reliability Engineer with exceptional skills to join our ambitious team. This role is outstanding because it offers the chance to create, build, and maintain the infrastructure that powers the dreams of millions of entrepreneurs worldwide!

You will be at the forefront of driving reliability, observability, and cost efficiency across our large-scale systems. By crafting for resilience, automating operations, and proactively preventing incidents, you will ensure that our systems run flawlessly!

What you'll get to do...

  • Implement end-to-end observability using Prometheus, Grafana, CloudWatch, and ServiceNow while defining and maintaining SLIs/SLOs/SLAs across infrastructure and applications.
  • Architect and automate AWS infrastructure using CDK, CloudFormation, Python, Go, or Bash, with deployments orchestrated via GitHub Actions or Jenkins.
  • Manage and troubleshoot containerized workloads across Docker, Kubernetes (EKS), ECS, and Fargate while ensuring configuration consistency through Ansible, Puppet, or Chef.
  • Design, build, deploy, and maintain large-scale, production-grade systems in AWS, with full ownership of end-to-end system reliability, performance, and availability
  • Drive platform reliability by proactively identifying risks, planning for scale and performance, and collaborating with engineering teams to embed reliability and cost awareness into all builds.
  • Lead incident management with blameless postmortems and standardized SOPs for response, deployments, capacity, DR, and security using tools like BigPanda, Site24x7, and ServiceNow.
  • Enhance infrastructure and CI/CD pipelines to improve performance and cost-effectiveness, taking ownership of capacity planning, forecasting, and governance.

Your experience should include...

  • 5+ years of proven SRE experience supporting production-scale systems with strong understanding of SLIs/SLOs, distributed systems reliability, and troubleshooting complex production issues.
  • Deep hands-on expertise with AWS services (EKS, ECS, Fargate, EC2, S3, RDS, SQS, SNS, CloudFormation, CDK, IAM, CloudWatch).
  • Proficient in incident management tools (BigPanda, Site24x7), ServiceNow integration, and configuration management (Ansible, Puppet, Chef).
  • Strong automation skills in Python/Go/Bash with expertise in CI/CD pipelines using GitHub Actions, Jenkins, and container orchestration.
  • Skilled in monitoring and observability tools including Prometheus, Grafana, and CloudWatch.

You might also have...

  • Bachelor’s degree or equivalent experience in computer science, engineering, or a related technical field.

We've got your back... We offer a range of total rewards that may include paid time off, retirement savings (e.g., 401k, pension schemes), bonus/incentive eligibility, equity grants, participation in our employee stock purchase plan, competitive health benefits, and other family-friendly benefits including parental leave. GoDaddy’s benefits vary based on individual role and location and can be reviewed in more detail during the interview process.

We also embrace our diverse culture and offer a range of Employee Resource Groups. Have a side hustle? No problem. We love entrepreneurs! Most importantly, come as you are and make your own way.

About us... GoDaddy is empowering everyday entrepreneurs around the world by providing the help and tools to succeed online, making opportunity more inclusive for all. GoDaddy is the place people come to name their idea, build a professional website, attract customers, sell their products and services, and manage their work. Our mission is to give our customers the tools, insights, and people to transform their ideas and personal initiative into success. To learn more about the company, visit About Us.

At GoDaddy, we know diverse teams build better products—period. Our people and culture reflect and celebrate that sense of diversity and inclusion in ideas, experiences and perspectives. But we also know that’s not enough to build true equity and belonging in our communities. That’s why we prioritize integrating diversity, equity, inclusion and belonging principles into the core of how we work every day—focusing not only on our employee experience, but also our customer experience and operations. It’s the best way to serve our mission of empowering entrepreneurs everywhere, and making opportunity more inclusive for all. To read more about these commitments, as well as our representation and pay equity data, check out our Diversity and Pay Parity annual report which can be found on our Diversity Careers page.

GoDaddy is proud to be an equal opportunity employer. GoDaddy will consider for employment qualified applicants with criminal histories in a manner consistent with local and federal requirements. Refer to our full EEO policy.

Our recruiting team is available to assist you in completing your application. If they could be helpful, please reach out to myrecruiter@godaddy.com.

GoDaddy doesn’t accept unsolicited resumes from recruiters or employment agencies.

##### Apply Now and Join the #GoDaddyLife today!

Please review these directions prior to submitting your application:

  • To ensure we address you correctly throughout the process, please enter your Preferred Name if different than your Legal Name. We may ask you to confirm your Legal Name at a later stage when necessary.
  • If you do not have a legal first name, please input FNU where you are asked to input your first name.
  • If you do not have a legal last name/surname, please input LNU where you are asked to input your last name/surname.
  • Submit your resume and/or cover letter in PDF or Docx format.
  • For phone number, please enter your mobile number.

Colorado Residents: In any materials you submit, you may redact or remove age-identifying information such as age, date of birth, or dates of school attendance or graduation. You will not be penalized for redacting or removing this information.

Massachusetts Residents: It is unlawful in Massachusetts to require or administer a lie detector test as a condition of employment or continued employment. An employer who violates this law shall be subject to criminal penalties and civil liability.

Set alerts for more jobs like Site Reliability Engineer
Set alerts for new jobs by GoDaddy
Set alerts for new Devops jobs in India
Set alerts for new jobs in India
Set alerts for Devops (Remote) jobs

Contact Us
hello@outscal.com
Made in INDIA 💛💙