Staff Site Reliability Engineer

undefined ago • 7 Years + • Devops

Job Summary

Job Description

The Staff Site Reliability Engineer will architect, implement, and maintain robust cloud-native infrastructure and deployment pipelines on AWS, focusing on reliability, scalability, and automation. This role involves managing Kubernetes (EKS), GitOps with ArgoCD, and CI/CD pipelines, utilizing languages like Kotlin, Python, and Bash. The engineer will collaborate with development and operations teams to ensure continuous delivery, system reliability, and rapid incident response for millions of customers.
Must have:
  • 7+ years in DevOps, SRE, or related roles in cloud-native environments
  • 5+ years direct experience managing AWS infrastructure at scale
  • Proficiency in deploying, managing, and troubleshooting Kubernetes clusters (AWS EKS, networking, RBAC, Helm)
  • Advanced hands-on experience with ArgoCD for GitOps-based Kubernetes deployments
  • Strong development and scripting skills in Kotlin, Python, and Bash
  • Deep knowledge of CI/CD concepts and tools
  • Demonstrated ability to design and implement IaC using Terraform and/or AWS CloudFormation
  • Strong problem-solving, root cause analysis, and incident management skills
  • Excellent communication and collaboration abilities
  • Advanced English Level
Good to have:
  • Bachelor’s or Master’s degree in Computer Science, Engineering, or related field or equivalent experience
  • Relevant certifications (e.g., AWS Certified DevOps Engineer, Certified Kubernetes Administrator)
  • Experience with additional DevOps tools such as Jenkins, Ansible, Prometheus, Datadog
  • Familiarity with security and compliance standards in cloud environments
Perks:
  • Flexible payment in USD, Crypto, Euro, ARS via Deel
  • Remote work
  • 15 days of vacation each year from the start date
  • 16 fully paid Argentinean holidays
  • Healthcare Benefit: Monthly stipend to use in your preferred healthcare provider
  • 5-year Sabbatical: After 5 years, get a 4-week paid sabbatical
  • Paid Family leave
  • Compassionate Leave: 3-5 days each time the need arises
  • Customizable benefits including learning opportunities, wellness memberships, delivery apps
  • Personalized English coach

Job Details

About the Team:

The Infrastructure team is responsible for maintaining our highly available infrastructure that services our millions of customers, guaranteeing availability, reliability, and confidentiality. The team services the requests of the engineering organization related to CICD pipelines, builds, infrastructure, and security.

The role:

The DevOps/Site Reliability Engineer (SRE) is responsible for architecting, implementing, and maintaining robust cloud-native infrastructure and deployment pipelines with a focus on reliability, scalability, and automation. This role requires experience with AWS, Kubernetes (EKS), ArgoCD, GitHub, and familiarity with development languages such as Kotlin, Python, and Bash. The engineer will collaborate closely with software development and operations teams to ensure continuous delivery, system reliability, and rapid incident response in a dynamic environment.

Responsibilities:

  • Architect, deploy, and manage highly available and scalable infrastructure on AWS, leveraging services such as EC2, VPC, S3, IAM, and EKS.
  • Design, implement, and maintain Kubernetes clusters (EKS) and oversee the deployment of containerized applications using best practices for security, scaling, and automation.
  • Develop and manage GitOps workflows using ArgoCD for automated, reliable, and auditable application deployments to Kubernetes.
  • Write and maintain infrastructure as code (IaC) using tools such as Terraform.
  • Build, optimize, and troubleshoot CI/CD pipelines to support rapid, reliable software delivery, integrating with ArgoCD and other modern DevOps toolchains.
  • Develop robust automation scripts and tools in languages such as Kotlin, Python, and/or Bash to streamline operational processes, monitoring, and incident response.
  • Proactively monitor system performance, reliability, and security, responding to incidents and participating in on-call rotations as needed.
  • Collaborate with software engineers to improve deployment strategies, system observability, and overall site reliability.
  • Implement and enforce security best practices across all infrastructure and deployment workflows.
  • Maintain comprehensive documentation of infrastructure, processes, and procedures for operational transparency and team knowledge sharing.
  • Experience using GitHub and GitHub Actions to automate, testing and deployments.

Qualifications:

  • 7+ years in DevOps, SRE, or related roles in cloud-native environments, with at least 5 years of direct experience managing AWS infrastructure at scale
  • Proficiency in deploying, managing, and troubleshooting Kubernetes clusters, especially AWS EKS, including networking, RBAC, and Helm.
  • Advanced English Level
  • Advanced hands-on experience with ArgoCD for GitOps-based Kubernetes deployments, including setup, configuration, and troubleshooting.
  • Strong development and scripting skills in Kotlin, Python, and Bash, with the ability to build automation tools and integrate with APIs.
  • Deep knowledge of CI/CD concepts and tools, with proven experience building and maintaining pipelines for cloud-native applications.
  • Demonstrated ability to design and implement infrastructure as code using Terraform and/or AWS CloudFormation.
  • Strong problem-solving skills, including root cause analysis and incident management in distributed, cloud-based systems,
  • Excellent communication and collaboration abilities, working effectively across development, QA, and operations teams.

Preferred requirements:

  • Bachelor’s or Master’s degree in Computer Science, Engineering, or related field or equivalent experience.
  • Relevant certifications (e.g., AWS Certified DevOps Engineer, Certified Kubernetes Administrator).
  • Experience with additional DevOps tools such as Jenkins, Ansible, Prometheus, Datadog.
  • Familiarity with security and compliance standards in cloud environments.

Learn More About the Company

We believe great leadership starts with alignment on vision, values, and ways of working. To give you deeper insight into who we are and what we’re looking for, we invite you to explore: the company's Leadership Principles

– The values and behaviors that guide how we operate, collaborate, and scale.

We hope this provides valuable insight into our culture and product vision. If this excites you, we’d love to connect!

Benefits

💸 Get paid in USD, Crypto, Euro, ARS. Whatever your choice! We use Deel to make things easier for you!

🗺 Work remotely: design the life that you want

⛱ Enjoy 15 days of vacation each year from the start date

🎄 16 fully paid Argentinean holidays

🩺 Healthcare Benefit: Monthly stipend to use in your preferred healthcare provider

🗓️ 5- year Sabbatical: After 5 years with the company, you get a 4-week paid sabbatical

🐣 Paid Family leave

🕯 Compassionate Leave: 3-5 days each time the need arises

🧘🏽‍♀️ Customize the benefits that suit your needs! Access a range of perks tailored to you, including learning opportunities, wellness memberships, delivery apps, and more through our comprehensive benefit platform

🧑‍🏫 Personalized English coach

If you’re interested in this role, please submit your application and if we think you might be a fit, we'll get in touch with you. Thank you for your time!

The company is an Equal Opportunity Employer. We are dedicated to creating a community of inclusion and an environment free from discrimination or harassment. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, age, sexual orientation, gender identity, national origin, citizenship status, protected veteran status, genetic information, or physical or mental disability.

Similar Jobs

Looks like we're out of matches

Set up an alert and we'll send you similar jobs the moment they appear!

Similar Skill Jobs

Looks like we're out of matches

Set up an alert and we'll send you similar jobs the moment they appear!

Jobs in Argentina

Looks like we're out of matches

Set up an alert and we'll send you similar jobs the moment they appear!

Devops Jobs

Looks like we're out of matches

Set up an alert and we'll send you similar jobs the moment they appear!

About The Company

New York, United States (Hybrid)

New York, United States (Remote)

Buenos Aires, Buenos Aires, Argentina (Remote)

New York, United States (On-Site)

New York, United States (On-Site)

New York, United States (On-Site)

Toronto, Ontario, Canada (On-Site)

View All Jobs

Get notified when new jobs are added by CookUnity

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug