Lead Site Reliability Engineer - Data Platforms - 10799

4 Minutes ago • 8 Years + • $125,000 PA - $162,000 PA
Devops

Job Description

Coupa makes margins multiply through its community-generated AI and industry-leading total spend management platform. This role is for a Lead Site Reliability Engineer - Data Platforms, passionate about new technologies and continuously expanding knowledge. The Cloud team seeks an engineer ready to question the status quo with system design, code development, deployment, automation, networking, and experience in managing big data/Machine Learning/GenAI platforms. The role involves managing end-to-end data pipelines, AWS infrastructure, supporting containerized applications, and collaborating with ML teams to deliver AI-driven features.
Must Have:
  • Manage end-to-end data pipelines (ETL), AWS infrastructure (S3, EMR, Redshift), and IaC tools like Terraform and Chef.
  • Support containerized applications (ECS, Docker) and administer Linux-based systems.
  • Collaborate with ML teams to manage ML/GenAI infrastructure and deliver AI-driven features.
  • Enable monitoring and observability across systems and applications.
  • Provide support for data pipelines, including on-call rotation and release planning with Dev teams.
  • Participate in design/code reviews, troubleshoot complex issues, and document root cause analyses (RCAs).
  • 8+ years of experience in Big Data technologies, data pipelines, and Linux administration; strong scripting skills in Bash or Python.
  • 5+ years managing cloud platforms (AWS, Azure), including tools like ECS, EKS, AKS, Terraform, and Helm.
  • Hands-on experience with Infrastructure as Code, CI/CD tools (Chef, Ansible, Jenkins), and source control (Git).
  • Familiarity with Generative AI tools (SageMaker, Bedrock, Azure ML), vector databases, and a strong interest in AI technologies.
  • Solid knowledge of networking (DNS, load balancers), MySQL, Apache Spark, and BI/data lake platforms (e.g., Looker).

Add these skills to join the top 1% applicants for this job

team-management
communication
data-analytics
github
talent-acquisition
game-texts
mysql
networking
dns
linux
aws
azure
ansible
terraform
chef
helm
looker
spark
ci-cd
docker
git
python
bash
jenkins
system-design
machine-learning

Privacy Notice

This website uses cookies to improve your web experience. By clicking Accept, you agree to the use of cookies. Coupa Cookie Policy

Coupa makes margins multiply through its community-generated AI and industry-leading total spend management platform for businesses large and small. Coupa AI is informed by trillions of dollars of direct and indirect spend data across a global network of 10M+ buyers and suppliers. We empower you with the ability to predict, prescribe, and automate smarter, more profitable business decisions to improve operating margins.

Why join Coupa?

🔹 Pioneering Technology: At Coupa, we're at the forefront of innovation, leveraging the latest technology to empower our customers with greater efficiency and visibility in their spend.

🔹 Collaborative Culture: We value collaboration and teamwork, and our culture is driven by transparency, openness, and a shared commitment to excellence.

🔹 Global Impact: Join a company where your work has a global, measurable impact on our clients, the business, and each other.

Learn more on Life at Coupa blog and hear from our employees about their experiences working at Coupa.

The Impact of a Lead Site Reliability Engineer - Data Platforms:

If you are passionate about new technologies, have a strong technical background and you are looking for an environment where you can continuously expand your knowledge, you are the right fit for this role. At Coupa, the “Cloud team” is looking for an engineer who is ready to constantly question the status quo with a mixture of system design, code development, deployment, automation, networking, and experience in managing big data/ Machine Learning/GenAI platforms.

What You'll Do:

  • Manage end-to-end data pipelines (ETL), AWS infrastructure (S3, EMR, Redshift), and IaC tools like Terraform and Chef.
  • Support containerized applications (ECS, Docker) and administer Linux-based systems.
  • Collaborate with ML teams to manage ML/GenAI infrastructure and deliver AI-driven features.
  • Enable monitoring and observability across systems and applications.
  • Provide support for data pipelines, including on-call rotation and release planning with Dev teams.
  • Participate in design/code reviews, troubleshoot complex issues, and document root cause analyses (RCAs).

What You Will Bring to Coupa:

  • 8+ years of experience in Big Data technologies, data pipelines, and Linux administration; strong scripting skills in Bash or Python.
  • 5+ years managing cloud platforms (AWS, Azure), including tools like ECS, EKS, AKS, Terraform, and Helm.
  • Hands-on experience with Infrastructure as Code, CI/CD tools (Chef, Ansible, Jenkins), and source control (Git).
  • Familiarity with Generative AI tools (SageMaker, Bedrock, Azure ML), vector databases, and a strong interest in AI technologies.
  • Solid knowledge of networking (DNS, load balancers), MySQL, Apache Spark, and BI/data lake platforms (e.g., Looker).
  • Strong communication skills, self-driven with global thinking, and capable of independently resolving complex issues and delivering projects.

The estimated pay range for this role is $125,000 - $162,000

The starting salary for the successful candidate will be based on permissible, non-discriminatory factors such as skills, experience, and geographic location.

#LI-Remote

#LI-TC1

Coupa complies with relevant laws and regulations regarding equal opportunity and offers a welcoming and inclusive work environment. Decisions related to hiring, compensation, training, or evaluating performance are made fairly, and we provide equal employment opportunities to all qualified candidates and employees.

Please be advised that inquiries or resumes from recruiters will not be accepted.

By submitting your application, you acknowledge that you have read Coupa’s Privacy Policy and understand that Coupa receives/collects your application, including your personal data, for the purposes of managing Coupa's ongoing recruitment and placement activities, including for employment purposes in the event of a successful application and for notification of future job opportunities if you did not succeed the first time. You will find more details about how your application is processed, the purposes of processing, and how long we retain our application in our Privacy Policy.

Set alerts for more jobs like Lead Site Reliability Engineer - Data Platforms - 10799
Set alerts for new jobs by Coupa
Set alerts for new Devops jobs in United States
Set alerts for new jobs in United States
Set alerts for Devops (Remote) jobs

Contact Us
hello@outscal.com
Made in INDIA 💛💙