Site Reliability Engineer

5 Months ago • 9-12 Years
Devops

Job Description

Capgemini Engineering is seeking a Site Reliability Engineer to design, implement, and maintain scalable and reliable compute infrastructure, focusing on Wintel, Linux, VMWare, and Redhat KVM environments. This role involves collaborating with development teams, automating tasks, monitoring system performance, and troubleshooting issues. The engineer will also develop and maintain tools for deployment, monitoring, and operations, ensuring high availability and implementing best practices for security and compliance. The candidate will be part of a global team of engineers, scientists, and architects to help the world’s most innovative companies unleash their potential.
Must Have:
  • Design and maintain compute infrastructure.
  • Collaborate with development teams.
  • Automate tasks to improve efficiency.
  • Monitor system performance and identify bottlenecks.
  • Develop and maintain deployment and monitoring tools.
  • Troubleshoot issues in different environments.
  • Implement security and compliance best practices.
  • Proficiency in scripting languages.
  • Experience with infrastructure tools.

Add these skills to join the top 1% applicants for this job

cpp
gitlab
postgresql
mysql
networking
linux
kvm
prometheus
ansible
terraform
grafana
elk
vmware
ci-cd
python
bash
jenkins
java

At Capgemini Engineering, the world leader in engineering services, we bring together a global team of engineers, scientists, and architects to help the world’s most innovative companies unleash their potential. From autonomous cars to life-saving robots, our digital and software technology experts think outside the box as they provide unique R&D and engineering services across all industries. Join us for a career full of opportunities. Where you can make a difference. Where no two days are the same.

Job Description

  • Design, implement, and maintain scalable and reliable compute infrastructure, with a focus on Wintel, Linux, VMWare, and Redhat KVM environments.
  • Collaborate with development teams to ensure applications are designed for reliability and performance across different operating systems and virtualization platforms.
  • Automate repetitive tasks to improve efficiency and reduce manual intervention, specifically within Wintel and Linux systems.
  • Monitor system performance, identify bottlenecks, and implement solutions to improve overall system reliability in VMWare and Redhat KVM environments.
  • Develop and maintain tools for deployment, monitoring, and operations tailored to Wintel, Linux, VMWare, and Redhat KVM.
  • Troubleshoot and resolve issues in development, test, and production environments, focusing on compute-related challenges.
  • Participate in on-call rotations and respond to incidents promptly, ensuring high availability of compute resources.
  • Implement best practices for security, compliance, and data protection within Wintel, Linux, VMWare, and Redhat KVM systems.
  • Document processes, procedures, and system configurations specific to the compute infrastructure.

Primary Skills

  • Site Reliability Engineer SRE
  • Compute Infrastructure
  • Wintel Administration
  • Linux Administration
  • VMWare Administration
  • Redhat
  • Proficiency in scripting languages Python, Java, C/C++, Bash
  • Infrastructure tools Terraform, Ansible
  • Experience with monitoring and logging tools Prometheus, Grafana, ELK stack
  • Solid understanding of networking, security, and system administration within Wintel and Linux environments.
  • Experience with CI/CD pipelines and tools Jenkins, GitLab CI
  • Knowledge of database management systems MySQL, PostgreSQL

Capgemini is a global business and technology transformation partner, helping organizations to accelerate their dual transition to a digital and sustainable world, while creating tangible impact for enterprises and society. It is a responsible and diverse group of 340,000 team members in more than 50 countries. With its strong over 55-year heritage, Capgemini is trusted by its clients to unlock the value of technology to address the entire breadth of their business needs. It delivers end-to-end services and solutions leveraging strengths from strategy and design to engineering, all fueled by its market leading capabilities in AI, generative AI, cloud and data, combined with its deep industry expertise and partner ecosystem.

Set alerts for more jobs like Site Reliability Engineer
Set alerts for new jobs by Capgemini
Set alerts for new Devops jobs in India
Set alerts for new jobs in India
Set alerts for Devops (Remote) jobs

Contact Us
hello@outscal.com
Made in INDIA 💛💙