SR Site Reliability Engineer

1 Month ago • 4 Years + • DevOps

About the job

Job Description

Experienced SRE needed to manage Kubernetes, CI/CD pipelines, Grafana/Prometheus, Microsoft SQL Server & VMware. Strong automation skills, DevOps background & passion for reliability essential.
Must have:
  • Kubernetes experience
  • CI/CD pipeline
  • Grafana/Prometheus
  • Microsoft SQL Server
Good to have:
  • VMware experience
  • Azure/AWS
  • Windows OS
  • Network tech
Perks:
  • Professional growth
  • Impactful work
Not hearing back from companies?
Unlock the secrets to a successful job application and accelerate your journey to your next opportunity.

About the job

Job Summary:

  • We are looking for an experienced Site Reliability Engineer (SRE) to join our growing team. This role demands a focus on enhancing the reliability, efficiency, and performance of our infrastructure and hosted applications. The role will collaborate with other SRE team members, applying your expertise in coding, algorithms, complexity analysis, and large-scale system design to tackle complex challenges. The ideal candidate will have hands-on experience in deploying and managing Kubernetes clusters, utilizing Grafana and Prometheus for monitoring, and supporting Microsoft SQL Server and VMware virtualization. Expertise in cloud deployments, Windows operating systems, and network technologies are a plus. Prior experience in DevOps, including CI/CD pipeline management and automation, is highly desirable. We emphasize self-directed, impactful work while providing the necessary support and mentorship for professional growth.

Responsibilities

Essential Responsibilities:

  • Deploy, manage, and scale Kubernetes clusters to support containerized applications.
  • Design, implement, and optimize continuous integration and continuous deployment (CI/CD) pipelines to streamline code deployment and enhance delivery efficiency.
  • Implement and maintain monitoring solutions with Grafana and Prometheus to ensure system reliability.
  • Drive efficiency by automating manual and repetitive tasks to reduce operational costs.
  • Assist in managing and enhancing the entire production stack to deliver reliable systems for a diverse range of internal and external customers.
  • Assist our internal Database teams to optimize and support Microsoft SQL Server databases, including performance tuning and maintenance.
  • Assist our internal Virtualization/Server teams to ensure high availability and efficient resource use in our technology stack.
  • Deploy and manage cloud resources on Azure or AWS, focusing on cost efficiency and performance optimization.
  • Support and troubleshoot Windows operating systems and network technologies.

Required Skills:

  • Demonstrated ability to automate processes and improve efficiency.
  • Strong focus on managing and optimizing the full production stack for a wide user base.
  • Proven experience as a DevOps Engineer, with a strong foundation in Kubernetes, Grafana, Prometheus, Microsoft SQL Server, and VMware
  • Experience in designing, implementing, and managing CI/CD pipelines using tools like Jenkins, GitHub, or similar.
  • Experience with Windows operating systems and network technologies

Qualifications

Qualifications:

  • Bachelor’s degree in Computer Science, Engineering, or a related field.
  • 4+ years of relevant experience in a Site Reliability Engineer or similar role, with a background in DevOps and CI/CD pipeline management preferred.

View Full Job Description

About The Company

Karnataka, India (On-Site)

Karnataka, India (On-Site)

Karnataka, India (On-Site)

Karnataka, India (On-Site)

Karnataka, India (Hybrid)

Karnataka, India (Hybrid)

Karnataka, India (Hybrid)

View All Jobs

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug