Cloud Infrastructure Engineer (AWS / Kubernetes / SRE)

1 Month ago • 5 Years + • Devops

Job Summary

Job Description

We’re looking for a Cloud Infrastructure Engineer who thrives at the crossroads of classic Linux and AWS infrastructure and modern Site Reliability Engineering. This is a high-impact, hybrid role designed for someone who can manage cloud resources, harden Kubernetes clusters, and shape a more reliable and developer-friendly platform. You’ll take over key responsibilities from our current Infra Lead who is transitioning to a software-focused role, giving you immediate ownership and space to shine.
Must have:
  • Maintain and harden AWS infrastructure (EC2, ALB/NLB, WAF, IAM, CloudWatch)
  • Operate and evolve EKS clusters powering Python-based AI services
  • Migrate existing services to Kubernetes using Terraform and Helm
  • Codify infrastructure with Terraform and manage host-level automation via Ansible
  • Build and improve CI/CD pipelines with GitHub Actions
  • Own observability efforts: Prometheus, Grafana, alerting, and on-call readiness
  • Support OS-level patching, certs, WAF rules, and general infra hygiene
  • Partner with engineers to guide best practices and drive platform reliability
  • Create clean, maintainable infrastructure documentation and playbooks
  • Occasionally support rare off-hours incidents
Good to have:
  • Strong Ansible skills beyond the basics
  • PostgreSQL or Amazon RDS tuning and operations experience
  • Deep understanding of observability tools (Prometheus, Grafana, Loki, etc.)
  • Familiarity with PHP production environments
  • Experience with TDD, CI/CD best practices, and agile development
  • Any previous SRE-like exposure such as building resilience, automation, or incident tooling
Perks:
  • Comprehensive health insurance for both you and your family
  • Professional development budget for conference tickets, online courses, and other relevant resources
  • Flexible benefits package
  • Hybrid work
  • Generous leave options
  • In-office perks, including free meals and snacks
  • Company-funded sport activities
  • Annual offsites
  • Team-building events

Job Details

**WHO WE ARE 🌍

Manychat is a leading Chat Marketing platform. We help businesses engage with their customers on Instagram, Facebook Messenger, WhatsApp, and Telegram.

Trusted by over 1 million brands in 170+ countries, we're an official Meta Business Partner, backed by top investors, including Bessemer Venture Partners.

With 200+ teammates across international offices in Barcelona, Austin, Amsterdam, São Paulo, and Yerevan — Manychat helps businesses across the globe improve their ROI and grow faster.

ABOUT THE ROLE 🚀

We’re looking for a Cloud Infrastructure Engineer who thrives at the crossroads of classic Linux and AWS infrastructure and modern Site Reliability Engineering. This is a high-impact, hybrid role designed for someone who can manage cloud resources, harden Kubernetes clusters, and shape a more reliable and developer-friendly platform.

We need you not just to maintain but to rethink and evolve our infrastructure, balancing hands-on operations with strategic improvements that future-proof our growing AI product landscape.

You’ll take over key responsibilities from our current Infra Lead who is transitioning to a software-focused role, giving you immediate ownership and space to shine.

WHY THE ROLE IS SPECIAL 💡

You won’t be a cog in a massive SRE org. You’ll be the bridge between Infrastructure and Engineering, shaping how we scale Kubernetes, how we approach platform reliability, and how developers ship fast without fear. You’ll get autonomy, ownership, and a smart, humble team excited to learn with you.

WHAT YOU’LL DO 🤖

  • Maintain and harden AWS infrastructure (EC2, ALB/NLB, WAF, IAM, CloudWatch)
  • Operate and evolve our EKS clusters powering Python-based AI services
  • Migrate existing services to Kubernetes using Terraform and Helm
  • Codify infrastructure with Terraform and manage host-level automation via Ansible
  • Build and improve CI/CD pipelines with GitHub Actions
  • Own observability efforts: Prometheus, Grafana, alerting, and on-call readiness
  • Support OS-level patching, certs, WAF rules, and general infra hygiene
  • Partner with engineers to guide best practices and drive platform reliability
  • Create clean, maintainable infrastructure documentation and playbooks
  • Occasionally support rare off-hours incidents (don’t worry, really rare)

WHAT YOU’LL BRING 💥

  • 5+ years of experience managing Linux in production (Ubuntu, Amazon Linux)
  • Strong experience with Kubernetes (ideally EKS), Helm, and Terraform
  • Comfort with running and debugging Python workloads in containers
  • Solid understanding of networking, IAM, and cloud security best practices
  • Hands-on Nginx experience (Ingress and reverse proxy setups)
  • Excellent communication skills; you can explain complex infra to devs clearly

NICE TO HAVE SKILLS 🛠️

  • Strong Ansible skills beyond the basics
  • PostgreSQL or Amazon RDS tuning and operations experience
  • Deep understanding of observability tools (Prometheus, Grafana, Loki, etc.)
  • Familiarity with PHP production environments
  • Experience with TDD, CI/CD best practices, and agile development
  • Any previous SRE-like exposure such as building resilience, automation, or incident tooling

WHAT WE OFFER 🤗

We care deeply about your growth, well-being, and comfort:

  • 💙 Comprehensive health insurance for both you and your family.
  • 📚 Professional development budget for conference tickets, online courses, and other relevant resources to help you grow.
  • 🫶 Flexible benefits package to tailor perks that matters most for you.
  • 🪴 Hybrid work and generous leave options to prioritize your work-life balance.
  • 🍽️ In-office perks, including free meals and snacks.
  • 🤝 Company-funded sport activities, annual offsites, and team-building events.

Manychat is an Equal Opportunity Employer. We’re committed to building a diverse and inclusive team. We do not discriminate against qualified employees or applicants because of race, color, religion, gender identity, sex, sexual preference, sexual identity, pregnancy, national origin, ancestry, citizenship, age, marital status, physical disability, mental disability, medical condition, military status, or any other characteristic protected by local law or ordinance.

This commitment is also reflected through our candidate experience. If you have individual needs that may require an accommodation during the interview process, please indicate this in your application. We will do our best to provide assistance throughout your interview process to ensure you’re set up for success.

With my application, I accept the Manychat Privacy Policy.

Similar Jobs

Juego Studios - Associate Technical Lead

Juego Studios

Bengaluru, Karnataka, India (On-Site)
1 Month ago
Next Level Business Services - Android Developer

Next Level Business Services

Redwood City, California, United States (On-Site)
10 Months ago
flying wild hog - Lead Technical Artist

flying wild hog

Brussels, Brussels, Belgium (Hybrid)
1 Year ago
Trackman - Customer Service Specialist (Tier 1)

Trackman

(On-Site)
4 Months ago
P99 soft - IAM Engineer

P99 soft

Hyderabad, Telangana, India (On-Site)
2 Months ago
bytedance - Cloud Native Infrastructure Engineer

bytedance

Singapore (On-Site)
3 Months ago
Veeam Software - Cloud Platform Engineer

Veeam Software

Warsaw, Masovian Voivodeship, Poland (On-Site)
1 Month ago
Mistral AI - Solutions Architect, Partner - EMEA

Mistral AI

London, England, United Kingdom (Hybrid)
5 Months ago
Collaborative Robotics - Software Engineer, Build and Deploy

Collaborative Robotics

Santa Clara, California, United States (On-Site)
3 Months ago
Glean - Solutions Architect

Glean

United States (Remote)
3 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Qualcomm - Staff Manager of Game Studio Relationships

Qualcomm

Shanghai, Shanghai, China (On-Site)
3 Months ago
Hitachi - Senior Project Manager

Hitachi

San José, San José Province, Costa Rica (Remote)
10 Months ago
Saronic Technologies - Material Handler

Saronic Technologies

Franklin, Louisiana, United States (On-Site)
4 Weeks ago
zeta - Lead Software Development Engineer - Backend

zeta

Hyderabad, Telangana, India (On-Site)
4 Months ago
Tesla - Sales Advisor

Tesla

Bolzano, Trentino-South Tyrol, Italy (On-Site)
6 Months ago
Discord - Senior Financial Analyst, Business Partnership

Discord

San Francisco, California, United States (On-Site)
3 Months ago
WebFX - Sr  Quality Assurance Engineer (Philippines )

WebFX

Philippines (Remote)
9 Months ago
bytedance - Strategy Intern, BytePlus

bytedance

Singapore (On-Site)
6 Months ago
PwC - Senior Associate - IFS - Property & Facilities Management

PwC

Jakarta, Jakarta, Indonesia (On-Site)
10 Months ago
Any Desk - C++ Software Developer

Any Desk

Tampa, Florida, United States (Hybrid)
1 Month ago

Get notifed when new similar jobs are uploaded

Jobs in Amsterdam, North Holland, Netherlands

grendel games - Technical art intern

grendel games

Leeuwarden, Friesland, Netherlands (Hybrid)
3 Months ago
Thales - Network Software Engineer

Thales

Hengelo, Overijssel, Netherlands (Hybrid)
3 Months ago
Tesla - Senior Software Engineer - Full Stack React & PHP

Tesla

North Holland, Netherlands (On-Site)
6 Months ago
Tesla - Sr. Product Manager, Financial Services

Tesla

North Holland, Netherlands (On-Site)
6 Months ago
Square - Tax Data Analytics Internship

Square

Amsterdam, North Holland, Netherlands (On-Site)
1 Month ago
Devoteam - Manager Bedrijfsvoering Publieke Sector

Devoteam

Amsterdam, North Holland, Netherlands (On-Site)
2 Months ago
Newzoo - Product Manager - Business & Store Intelligence

Newzoo

Amsterdam, North Holland, Netherlands (Hybrid)
3 Months ago
Adyen - AML & Screening Monitoring Investigator

Adyen

Amsterdam, North Holland, Netherlands (On-Site)
1 Month ago
Tesla - Tesla Roadside Support Specialist - Amsterdam (Arabic Speaking)

Tesla

North Holland, Netherlands (On-Site)
6 Months ago
Tesla - Electrical Equipment Engineer

Tesla

North Brabant, Netherlands (On-Site)
6 Months ago

Get notifed when new similar jobs are uploaded

Devops Jobs

Rackspace Technology - Senior GCP Cloud Engineer

Rackspace Technology

United States (Remote)
4 Months ago
NVIDIA - Senior Software Architect, Accelerated Computing SDN

NVIDIA

Tel Aviv-Yafo, Tel Aviv District, Israel (On-Site)
5 Months ago
Salesforce - Senior, Specialist Solution Engineer

Salesforce

London, England, United Kingdom (Hybrid)
4 Weeks ago
Canva - Senior Frontend Engineer - Apps API Platform

Canva

Melbourne, Victoria, Australia (Remote)
4 Months ago
Nagarro - Associate Principal Engineer, Performance and Site Reliability

Nagarro

Sri Lanka (Remote)
10 Months ago
Flow - Senior/Staff Platform Engineer/SRE

Flow

Palo Alto, California, United States (Hybrid)
6 Months ago
Cursor - Infrastructure Engineer

Cursor

San Francisco, California, United States (On-Site)
1 Month ago
DraftKings - Senior Software Engineer, Automation

DraftKings

Sofia, Sofia City Province, Bulgaria (On-Site)
2 Months ago
Motorola solutions - Senior AWS Bedrock AI & NodeJS engineer

Motorola solutions

Sydney, New South Wales, Australia (On-Site)
4 Weeks ago

Get notifed when new similar jobs are uploaded

About The Company

Barcelona, Catalonia, Spain (Hybrid)

Barcelona, Catalonia, Spain (Hybrid)

Amsterdam, North Holland, Netherlands (Hybrid)

Austin, Texas, United States (Hybrid)

Austin, Texas, United States (Hybrid)

Austin, Texas, United States (Hybrid)

Amsterdam, North Holland, Netherlands (Hybrid)

Amsterdam, North Holland, Netherlands (Hybrid)

Barcelona, Catalonia, Spain (Hybrid)

Barcelona, Catalonia, Spain (On-Site)

View All Jobs

Get notified when new jobs are added by Many Chat Inc.

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug