Site Reliability Engineer

10 Minutes ago • 1 Years +
Devops

Job Description

Lingokids is seeking a Senior Site Reliability Engineer to join their Infrastructure team. The role involves scaling Lingokids' platform, which serves over 1M+ unique users daily, optimizing AWS infrastructure, ensuring 99.9%+ uptime, and pioneering new technologies to enhance platform efficiency and developer productivity. The ideal candidate will contribute to redefining the future of education through Playlearning™.
Good To Have:
  • EdTech or consumer mobile app infrastructure experience
  • AWS certifications (Solutions Architect or DevOps Engineer)
  • Experience with infrastructure cost optimization at scale
  • Familiarity with compliance requirements (COPPA, GDPR)
Must Have:
  • Design, implement, and maintain highly available AWS infrastructure supporting 1M+ daily users
  • Optimize cloud costs through resource right-sizing and architectural improvements
  • Manage Infrastructure as Code using Terraform with GitOps workflows
  • Implement CI/CD pipelines to make our Product Engineers more productive using GitHub Actions and Jenkins
  • Ensure 99.9%+ uptime through proactive capacity planning and redundancy
  • Build comprehensive monitoring
  • Create and maintain runbooks for common operational scenarios
  • Lead incident response, root cause analysis, and post-mortem processes
  • Research and prototype new technologies for efficiency gains
  • Design infrastructure to support 2-3x user growth without linear cost increase
  • Implement chaos engineering practices to improve resilience
  • Drive adoption of platform engineering best practices
  • Partner with product teams to understand and meet infrastructure needs
  • Create self-service tools and documentation for engineering teams
  • Conduct infrastructure reviews for new features and services
  • Share knowledge through tech talks and documentation
  • 1+ years in SRE/DevOps roles with production responsibilities
  • Strong AWS expertise (EC2, RDS, S3, CloudFront, ECS/EKS, Lambda)
  • Infrastructure as Code proficiency with Terraform (2+ years)
  • Monitoring/observability tools experience (Datadog, Prometheus, or similar)
  • Scripting proficiency (Python, Bash, or Go)
  • CI/CD pipeline design and optimization experience (Jenkins, GitHub Actions)
  • Experience supporting high-traffic consumer applications (500k+ daily users)
  • Ability to remain calm and focused under pressure during incident response
  • Excellent written communication
  • English proficiency for global team collaboration
Perks:
  • Career Growth: up to €2,000 per year for books and training
  • Remote-Friendly: Work from home or our offices in Madrid, anywhere within a 2-hour difference from Spain (GMT+1)
  • Stock Options: Opportunity to own part of the company and share in its success
  • Home Office Setup: €400 allowance for setup and €35/month for remote work expenses
  • Meal Allowances: €60/month on your Cobee card
  • Flexible Compensation: Manage meal, transport and childcare expenses easily with Cobee
  • Health Insurance: Access private health coverage at exclusive rates through Adeslas
  • Language Lessons: Free language classes in Spanish and English
  • Visa Sponsorship: If you need a visa to work in the EU, we’ll handle the process and cover the costs
  • Company events: Team gatherings and off-sites in different corners of Spain

Add these skills to join the top 1% applicants for this job

team-management
problem-solving
oops
github
game-texts
incident-response
aws
prometheus
terraform
ci-cd
python
github-actions
bash
jenkins

We’re looking for passionate professionals to join our team, make a real impact, and contribute to something truly meaningful. Let’s build the future of education together! 🚀

Apply for this job

We usually respond within a week

Lingokids is on a mission to help families raise amazing kids through Playlearning™. Ready to join us on this exciting journey? 🚀

Lingokids is a global leader in educational technology, helping over 185 million families worldwide raise amazing kids through Playlearning™—our unique approach that blends education with play. Our mission is to empower children with modern learning experiences, combining educational subjects with essential life skills to help them grow into confident, conscious, and resilient lifelong learners.

Beyond our award-winning app, we’ve built a multi-platform educational universe, including our Baby Bot” and “Baby Bot’s Backyard Tales” shows, Podcasts, and Music Publishing. Our content, developed in collaboration with top education experts and Oxford Press University, ensures an engaging, high-quality learning experience in a safe, ad-free environment. This dedication to excellence has earned Lingokids multiple industry awards across app, podcast, and video categories, including Best Original Learning App by Kidscreen Awards, National Parenting Product Awards by NAPPA Awards, and Best Parenting Product by Good Housekeeping, among many others!

We're seeking a Senior Site Reliability Engineer to join our Infrastructure team and help scale Lingokids' platform serving 1M+ unique users daily. You'll be instrumental in optimizing our AWS infrastructure, ensuring 99.9%+ uptime, and pioneering new technologies that enhance our platform's efficiency and developer productivity.

If you want to be part of a team that's redefining the future of education, we’d love to hear from you!

Core Responsibilities

Infrastructure Excellence (40%)

  • Design, implement, and maintain highly available AWS infrastructure supporting 1M+ daily users
  • Optimize cloud costs through resource right-sizing and architectural improvements
  • Manage Infrastructure as Code using Terraform with GitOps workflows
  • Implement CI/CD pipelines to make our Product Engineers more productive using GitHub Actions and Jenkins
  • Ensure 99.9%+ uptime through proactive capacity planning and redundancy

Observability & Incident Response (30%)

  • Build comprehensive monitoring
  • Create and maintain runbooks for common operational scenarios
  • Lead incident response, root cause analysis, and post-mortem processes

Innovation & Scaling (20%)

  • Research and prototype new technologies for efficiency gains
  • Design infrastructure to support 2-3x user growth without linear cost increase
  • Implement chaos engineering practices to improve resilience
  • Drive adoption of platform engineering best practices

Collaboration & Enablement (10%)

  • Partner with product teams to understand and meet infrastructure needs
  • Create self-service tools and documentation for engineering teams
  • Conduct infrastructure reviews for new features and services
  • Share knowledge through tech talks and documentation

Required Qualifications

Must-Have Technical Skills

  • 1+ years in SRE/DevOps roles with production responsibilities
  • Strong AWS expertise (EC2, RDS, S3, CloudFront, ECS/EKS, Lambda)
  • Infrastructure as Code proficiency with Terraform (2+ years)
  • Monitoring/observability tools experience (Datadog, Prometheus, or similar)
  • Scripting proficiency (Python, Bash, or Go)
  • CI/CD pipeline design and optimization experience (Jenkins, GitHub Actions)

Must-Have Soft Skills

  • Experience supporting high-traffic consumer applications (500k+ daily users)
  • Ability to remain calm and focused under pressure during incident response
  • Excellent written communication
  • English proficiency for global team collaboration

Nice-to-Have

  • EdTech or consumer mobile app infrastructure experience
  • AWS certifications (Solutions Architect or DevOps Engineer)
  • Experience with infrastructure cost optimization at scale
  • Familiarity with compliance requirements (COPPA, GDPR)

English is a must: We’re a multicultural team providing a service in English, so while certifications aren’t necessary, fluency is essential. As a fully remote company, clear and effective spoken and written communication—especially in asynchronous, long-form formats—is key to collaborating successfully.

Life at Lingokids

=================

📚 Career Growth: Your growth drives our success! We invest in your development up to €2,000 per year for books and training—so you can keep learning and growing with us.

🏡 Remote-Friendly: Work from where you’re most productive—home or our offices in Madrid—anywhere within a 2-hour difference from Spain (GMT+1). The choice is yours!

📈 Stock Options: Your contribution matters! You'll receive stock options, giving you the opportunity to own part of the company and share in its success.

**🖥 Home Office Setup: Create your ideal workspace with a €400 allowance for setup and €35/month** for remote work expenses—because comfort fuels creativity!

**🍲 Meal Allowances: Get €60/month** on your Cobee card to enjoy meals at restaurants or food delivery—good food makes everything better!

**💳 Flexible Compensation: Manage your meal, transport and childcare expenses easily with Cobee**

, integrating them directly into your payroll.

**🩺 Health Insurance: Access private health coverage** at exclusive rates through Adeslas, seamlessly deducted from your payroll—quality care made simple.

💬 Language Lessons: Learning never stops! Enjoy free language classes in Spanish and English, to sharpen your skills and stay connected in a global team.

🌍 Visa Sponsorship: If you need a visa to work in the EU, we’ll handle the process and cover the costs to make your transition seamless.

🎉 Company events: Yes! We’re a fully remote team spread across different countries, but we love getting together from time to time in different corners of Spain, for team gatherings and recharging at our amazing off-sites!

Passion matters more than perfection

Didn’t check every box for this role? No worries! We’re looking for passionate, driven individuals who believe in our mission. If that sounds like you, we’d still love to hear from you!

Diversity, Equity, and Inclusion

At Lingokids, diversity isn’t just a checkbox—it’s at the heart of everything we do. Just like we teach kids to embrace differences, we celebrate them in our team, knowing that the best ideas come from unique perspectives. We’re all about creating a space where everyone feels valued, heard, and empowered to be their authentic selves. No matter your background, identity, or story, if you’re passionate about making a difference in education, we want you here.

Team

Engineering

Locations

Madrid HQ, Remote (international, outside Spain), Remote (Spain-based)

Remote status

Fully Remote

Employment type

Full-time

Contact Raül Arnau Talent Sourcer – People & Culture

Colleagues

----------

Paulo Alves Pereira

VP of Engineering

Tomás

Staff Site Reliability Engineer

Our culture

-----------

Playworking:

We work hard and playwork™ even harder! Learning through play and having fun is also the mantra of Lingokids behind the scenes.

Creative leadership:

Lingokids members work in interdisciplinary teams with a long-term goal designed to feel like a mini-startup where everyone must perform creative leadership. All members are responsible for proposing solutions to problems, and nobody is expected to come up with all the ideas while the rest are just developing them.

We know nothing:

At Lingokids we are humble and one of our mottos is “I know nothing”. But knowing is our passion so we are always finding creative ways to increase our wisdom. That continuous race for perfecting our knowledge brought us where we are. That and that we are fast getting that knowledge.

About Lingokids

---------------

We’re the playlearning™ app for more than 100 million families worldwide.

By learning through play, your child will develop skills like creativity, collaboration, critical thinking, and communication.

Join the adventure!

Apply for this job

Already working at Lingokids?

-----------------------------

Let’s recruit together and find your next colleague.

Log in

Set alerts for more jobs like Site Reliability Engineer
Set alerts for new jobs by Lingokids
Set alerts for Devops (Remote) jobs

Contact Us
hello@outscal.com
Made in INDIA 💛💙