Junior Site Reliability Engineer

1 Month ago • 3 Years +
Devops

Job Description

The Junior Site Reliability Engineer assists in designing, building, and maintaining infrastructure and deployment systems for live environments. This collaborative role involves working with development teams and senior SREs to ensure reliable, scalable, and well-instrumented systems. Responsibilities include implementing robust, automated solutions, documenting work for knowledge-sharing, and contributing to infrastructure scalability, availability, and performance. The role also supports CI/CD pipelines, monitoring, alerting, and participates in on-call rotations.
Good To Have:
  • Experience with AWS ECS.
  • Familiarity with Argo CD.
  • Exposure to Prometheus and Grafana.
Must Have:
  • Assist in building and maintaining infrastructure using IaC tools.
  • Support provisioning and lifecycle management of critical environments.
  • Help implement shared infrastructure components (logging, metrics, service mesh, load balancing).
  • Support and extend CI/CD pipelines (GitHub Actions, Argo CD).
  • Assist in implementing and maintaining monitoring, alerting, and logging infrastructure.
  • Ensure all systems and processes are thoroughly documented.
  • Work closely with SRE Lead and other team members.
  • 3+ years in a DevOps, SRE, or related role.
  • Basic understanding of cloud computing concepts and AWS.
  • Familiarity with Docker and Kubernetes concepts.
  • Exposure to CI/CD principles and GitHub Actions.
  • Experience with Infrastructure as Code tools like Terraform or CloudFormation.
  • Proficiency in at least one scripting language (e.g., Bash, Python).
  • Basic understanding of monitoring and logging.

Add these skills to join the top 1% applicants for this job

communication
github
talent-acquisition
game-texts
aws
service-mesh
load-balancing
argo-cd
prometheus
terraform
grafana
ci-cd
docker
kubernetes
python
github-actions
bash

The Junior Site Reliability Engineer is responsible for assisting in the design, build, and maintenance of the infrastructure and deployment systems that underpin our live environments. This role is hands-on and highly collaborative, working closely with development teams and senior SREs to ensure our systems are reliable, scalable, and well-instrumented. Junior SREs are expected to learn and apply best practices in building robust, automated solutions, and to ensure their work is repeatable and understandable by others. Every contribution should be accompanied by documentation to support knowledge-sharing within the team and across engineering.

Core Responsibilities

  • Infrastructure Design & Maintenance
  • Assist in building and maintaining infrastructure using infrastructure-as-code (IaC) tools (e.g., Terraform, CloudFormation).
  • Support the provisioning and lifecycle management of production, staging, and other critical environments.
  • Help implement shared infrastructure components (e.g., logging, metrics, service mesh, load balancing).
  • Contribute to improving infrastructure scalability, availability, and performance under the guidance of senior engineers.
  • Collaborate with development teams to provide infrastructure support for their deployment needs.
  • Deployment Systems & CI/CD
  • Support and help extend CI/CD pipelines (GitHub Actions, Argo CD) to improve reliability and automation of deployments.
  • Help promote consistency and best practices across environments for deployment, rollback, and observability.
  • Work with developers to streamline testing and delivery of code to production.
  • Assist in reducing manual steps in the deployment and operations workflows.
  • Reliability, Observability & Tooling
  • Assist in the implementation and maintenance of our monitoring, alerting, and logging infrastructure (Kube-Prometheus-Grafana stack).
  • Help track SLOs/SLIs for core services in partnership with service owners.
  • Learn to identify and help eliminate single points of failure, performance bottlenecks, and sources of instability.
  • Participate in reliability reviews and post-incident analysis.
  • Documentation & Knowledge Sharing
  • Ensure that all systems and processes you work on are accompanied by thorough, up-to-date documentation.
  • Contribute to shared knowledge bases, runbooks, and developer-facing onboarding materials.
  • Participate in internal training sessions and pairings to learn from teammates.
  • Collaboration & Culture
  • Work closely with the SRE Lead and other team members to execute work aligned with team goals.
  • Engage constructively with other teams across engineering.
  • Participate in on-call rotations with strong support from senior members.
  • Embrace a culture of blameless learning, transparency, and continuous improvement.

Qualifications & Skills

  • Experience: 3+ years in a DevOps, SRE, or related role.
  • Cloud: Basic understanding of cloud computing concepts, with some hands-on experience in AWS.
  • Containers & Orchestration: Familiarity with Docker and a foundational understanding of Kubernetes concepts. Experience with AWS ECS is a plus.
  • CI/CD: Exposure to CI/CD principles and tools like GitHub Actions. Familiarity with Argo CD is a bonus.
  • IaC: Some experience with or exposure to Infrastructure as Code tools like Terraform or CloudFormation.
  • Scripting: Proficiency in at least one scripting language (e.g., Bash, Python).
  • Observability: A basic understanding of monitoring and logging. Exposure to Prometheus and Grafana is desirable.
  • Collaboration: Strong communication skills and a desire to learn and work within a team.
  • Problem Solving: An enthusiastic and curious approach to solving technical challenges.

Set alerts for more jobs like Junior Site Reliability Engineer
Set alerts for new jobs by Cubic corporation
Set alerts for new Devops jobs in India
Set alerts for new jobs in India
Set alerts for Devops (Remote) jobs

Contact Us
hello@outscal.com
Made in INDIA 💛💙