Lead Site Reliability Engineer

1 Month ago • 7 Years +

Job Summary

Job Description

As a Lead Site Reliability Engineer at Coupa, you will be responsible for ensuring the availability, scalability, and performance of the company's cloud applications. You will combine software engineering, infrastructure knowledge, and operations expertise to build automation, improve observability, and proactively solve system issues. Responsibilities include administering Linux and Windows environments, writing code for automation, collaborating with teams on releases, monitoring systems, and participating in incident response.
Must have:
  • 7+ years managing large-scale applications
  • 5+ years experience with Linux and/or Windows administration
  • Proficiency in scripting or programming (Python, PowerShell, Bash)
  • Cloud expertise in AWS or Azure
  • Understanding of container orchestration (Kubernetes, EKS, AKS)

Job Details

Coupa makes margins multiply through its community-generated AI and industry-leading total spend management platform for businesses large and small. Coupa AI is informed by trillions of dollars of direct and indirect spend data across a global network of 10M+ buyers and suppliers. We empower you with the ability to predict, prescribe, and automate smarter, more profitable business decisions to improve operating margins.


Why join Coupa?


🔹 Pioneering Technology: At Coupa, we're at the forefront of innovation, leveraging the latest technology to empower our customers with greater efficiency and visibility in their spend.

🔹 Collaborative Culture: We value collaboration and teamwork, and our culture is driven by transparency, openness, and a shared commitment to excellence.

🔹 Global Impact: Join a company where your work has a global, measurable impact on our clients, the business, and each other. 


Learn more on Life at Coupa blog and hear from our employees about their experiences working at Coupa. 


The Impact of a Lead Site Reliability Engineer to Coupa: 


As a Lead SRE, you will ensure the availability, scalability, and performance of Coupa’s mission-critical cloud applications. Your work will combine software engineering, infrastructure knowledge, and operations expertise to build automation, improve observability, and proactively solve system issues before they impact customers.


What You’ll Do:
  • Own the reliability and uptime of Coupa’s production services and continuously enhance automation and recovery strategies
  • Administer Linux and Windows environments, web servers, databases, and application servers
  • Write code and scripts (Python, Bash, PowerShell, or similar) to build scalable automation solutions
  • Collaborate with product, support, and engineering teams to plan and deploy releases with minimal risk
  • Monitor systems proactively and improve alerting, observability, and incident response capabilities
  • Participate in an on-call rotation, triage and resolve critical incidents, and drive root cause analysis (RCA) documentation
  • Support and improve containerization efforts (Docker, Kubernetes, EKS/AKS)
  • Lead or support large-scale infrastructure projects that improve performance and cost efficiency
  • Drive operational excellence through CI/CD, configuration management, and IAC best practices


What you will bring to Coupa:
  • 7+ years of experience managing large-scale, customer-facing applications
  • 5+ years of hands-on experience with Linux and/or Windows administration
  • Proficiency in scripting or programming (Python, PowerShell, Bash, or any object-oriented language)
  • Cloud expertise in AWS or Azure (preferred), with experience in infrastructure-as-code (Terraform, Chef, Ansible)
  • Familiarity with CI/CD tools like Jenkins, Octopus, or Rundeck
  • Strong understanding of container orchestration (Kubernetes, EKS, AKS)
  • Skilled in monitoring and observability platforms (New Relic, Datadog, Splunk)
  • Experience with incident/change/problem management frameworks (ITIL, JIRA)
  • Good knowledge of DNS, load balancers, and network troubleshooting
  • Exposure to database operations (especially MS SQL Server)


Coupa complies with relevant laws and regulations regarding equal opportunity and offers a welcoming and inclusive work environment. Decisions related to hiring, compensation, training, or evaluating performance are made fairly, and we provide equal employment opportunities to all qualified candidates and employees. 


Please be advised that inquiries or resumes from recruiters will not be accepted.


By submitting your application, you acknowledge that you have read Coupa’s Privacy Policy and understand that Coupa receives/collects your application, including your personal data, for the purposes of managing Coupa's ongoing recruitment and placement activities, including for employment purposes in the event of a successful application and for notification of future job opportunities if you did not succeed the first time. You will find more details about how your application is processed, the purposes of processing, and how long we retain your application in our Privacy Policy.

Similar Jobs

Looks like we're out of matches

Set up an alert and we'll send you similar jobs the moment they appear!

Similar Skill Jobs

Looks like we're out of matches

Set up an alert and we'll send you similar jobs the moment they appear!

Jobs in Pune, Maharashtra, India

Looks like we're out of matches

Set up an alert and we'll send you similar jobs the moment they appear!

Similar Category Jobs

Looks like we're out of matches

Set up an alert and we'll send you similar jobs the moment they appear!

About The Company

Pune, Maharashtra, India (Hybrid)

Pune, Maharashtra, India (On-Site)

Bogota, Colombia (Hybrid)

Bengaluru, Karnataka, India (Remote)

Pune, Maharashtra, India (On-Site)

Denmark (Remote)

Hyderabad, Telangana, India (Hybrid)

Taipei City, Taiwan (On-Site)

View All Jobs

Get notified when new jobs are added by Coupa

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug