Lead System Engineer - Unix/Linux, VMware & Ansible

10 Minutes ago • 8 Years +
System Design

Job Description

Blue Yonder is seeking a Lead System Engineer with a strong background in IT Infrastructure, focusing on Unix/Linux, VMware, and Ansible. This role involves managing, configuring, and optimizing systems, maintaining OS patching, and developing automation frameworks using tools like Ansible and Terraform. The engineer will also administer enterprise storage solutions, lead data protection strategies, and ensure effective monitoring and incident response for critical infrastructure, actively participating in CI/CD and DevOps processes.
Good To Have:
  • Bachelor’s degree in computer science, MIS or engineering related field or equivalent work experience.
  • Ability to interact with various levels of professionals.
  • Ability to work under pressure in a fast-paced environment and meet tight deadlines.
  • Ability to act independently to drive IT goals and changes.
  • Identify and escalate situations requiring urgent attention.
  • Proficiency in operating system and software.
  • Willingness to work under different technologies and take up new technology responsibilities outside core skills.
  • Demonstrable experience with Continuous Integration/Delivery principles (CI/CD) and implementation.
  • Solid understanding of Restful APIs.
  • Experience with the implementation and use of different Application (APM) and/or Infrastructure monitoring tools.
  • Ability to work cross-platform, with Windows and Linux.
  • Certification is preferable (RHCE or likewise).
  • Knowledge of ITIL processes (incident, problem, change management).
  • Knowledge of protocols: HTTP, SSL, SSH, WINRM, JMS, JDBC, REST API (ServiceNow and AWX/Tower).
  • Familiarity with observability and analysis solutions such as Elastic and Datadog.
  • Experience in automation of key functions, including back-up, continuous integration, provisioning.
Must Have:
  • Manage, configure, optimize, and administer Unix/Linux systems (RHEL, CentOS, Ubuntu, AIX, Solaris) or Windows servers.
  • Manage VMware virtualization environments (vSphere, ESXi, vCenter).
  • Maintain OS patching, upgrades, and compliance.
  • Develop and maintain automation frameworks using Ansible, Terraform, Python, Shell, PowerShell.
  • Administer and optimize NetApp ONTAP storage systems.
  • Lead data protection strategy using Commvault or similar backup solutions.
  • Own end-to-end patch management process for servers, virtualization, and storage.
  • Conduct root cause analysis (RCA) and implement preventive measures.
  • Ensure effective monitoring, alerting, and incident response.
  • Participate in on-call support rotation.
  • Actively engage in CI/CD, Agile, and DevOps processes.
  • 8+ years of combined related work experience.
  • 6+ years of experience in Unix/Linux system engineering.
  • 5+ years of experience with VMware technologies.
  • 3+ years working experience with Ansible.
  • Strong scripting and automation skills (Python, Bash, Shell, PowerShell, Ansible, Terraform).
  • Solid knowledge of storage, networking, backup, and security concepts.
  • Experience managing hybrid environments (on-premises + cloud, preferably Azure).
  • Experience with container platforms (Docker, Kubernetes).
  • Experience working with CI/CD tools and Git.
  • Intermediate knowledge of Networking (VLAN, subnetting, routing, switching).
  • Advanced troubleshooting methodology.
  • Fluent English and high oral and written communication.

Add these skills to join the top 1% applicants for this job

saas-business-models
problem-solving
github
game-texts
agile-development
gitlab
networking
incident-response
linux
unix
azure
ansible
terraform
vmware
powershell
jdbc
ci-cd
docker
kubernetes
git
python
shell
bash
machine-learning

Overview:

  • Blue Yonder is the proven leader in artificial intelligence and machine learning (AI/ML)-driven supply chain and retail solutions for 4,000 of the world’s leading retail, manufacturing, and logistics companies. Blue Yonder’s world-class client brands include 75 of the top 100 retailers, 77 of the top 100 consumer goods companies, and 8 of the top 10 global 3PLs. Running Blue Yonder, you can plan to deliver.
  • The Candidate should have prior background working in IT Infrastructure and should have a solid high-level understanding of the underlying IT principles (Systems, Storage, backup, IaC, SaaS, Virtualization, Kubernetes, Containers, CI/CD). Lead System Engineer act as an escalation point for critical issues, ensure systems are secure and compliant through proactive patching, and collaborate across teams to maintain reliable and resilient IT services.

Scope:

  • Manage, configure, Optimize and administer Unix/Linux systems (RHEL, CentOS, Ubuntu, AIX, or Solaris) or Windows servers and VMware virtualization environments (vSphere, ESXi, vCenter)
  • Maintain OS patching, upgrades, and compliance across environments.
  • Develop and maintain automation frameworks for system provisioning, configuration, and operations using tools such as Ansible, Terraform, or scripting (Python, Shell, PowerShell).
  • Implement self-service and automated workflows for routine operational tasks.
  • Drive continuous improvement by identifying opportunities to reduce manual work and enhance system efficiency.
  • Prioritize workload and resolve any technical issues/roadblocks
  • Solid skills in logical troubleshooting, communication, documentation and problem resolution
  • Create and update application run books & appropriate technical documentation
  • Ensure all release processes, policies and procedures are properly communicated and documented
  • Administer and optimize enterprise storage solutions, with a focus on NetApp ONTAP storage systems. This includes managing LUNs, volumes, SAN/NAS protocols, and performance tuning
  • Lead the strategy and operations for data protection using Commvault or similar backup solutions. This includes managing backup policies, performing data restores, and conducting regular disaster recovery testing.
  • Own the end-to-end patch management process for servers, virtualization, and storage.
  • Coordinate and execute patching schedules while minimizing downtime.
  • Conduct root cause analysis (RCA) and implement preventive measures.
  • Ensure effective monitoring, alerting, and incident response for critical infrastructure.
  • Participate in on-call support rotation for high-priority issues.
  • Actively engage in CI/CD, Agile and DevOps process, participate regularly in planning and releases
  • Assist in establishing and enforcing standards that will improve the ease of automating the build process and the development environments
  • Manage and maintain enterprise infrastructure tools as the primary subject matter expert
  • Automate, deploy and manage virtualization infrastructure
  • Provide support, and implementation of security policies, compliance, governance and best practices

Our current technical environment:

  • Operating System: Windows & Linux
  • Hyper converged Environment: VMWare
  • Programming languages: Python, PowerShell, and Shell scripting
  • Cloud Architecture: MS Azure (Terraform, ARM templates, AKS, Virtual Networks, Azure AD)
  • Configuration management tools: Ansible and Terraform
  • DevOps Tools: GIT, GitLab/GitHub and Docker
  • Storage: NetApp

What you’ll do:

  • Manage, configure, Optimize and administer Unix/Linux systems (RHEL, CentOS, Ubuntu, AIX, or Solaris) or Windows servers and VMware virtualization environments (vSphere, ESXi, vCenter)
  • Maintain OS patching, upgrades, and compliance across environments.
  • Develop and maintain automation frameworks for system provisioning, configuration, and operations using tools such as Ansible, Terraform, or scripting (Python, Shell, PowerShell).
  • Implement self-service and automated workflows for routine operational tasks.
  • Drive continuous improvement by identifying opportunities to reduce manual work and enhance system efficiency.
  • Prioritize workload and resolve any technical issues/roadblocks
  • Solid skills in logical troubleshooting, communication, documentation and problem resolution
  • Create and update application run books & appropriate technical documentation
  • Ensure all release processes, policies and procedures are properly communicated and documented
  • Administer and optimize enterprise storage solutions, with a focus on NetApp ONTAP storage systems. This includes managing LUNs, volumes, SAN/NAS protocols, and performance tuning
  • Lead the strategy and operations for data protection using Commvault or similar backup solutions. This includes managing backup policies, performing data restores, and conducting regular disaster recovery testing.
  • Own the end-to-end patch management process for servers, virtualization, and storage.
  • Coordinate and execute patching schedules while minimizing downtime.
  • Conduct root cause analysis (RCA) and implement preventive measures.
  • Ensure effective monitoring, alerting, and incident response for critical infrastructure.
  • Participate in on-call support rotation for high-priority issues.
  • Actively engage in CI/CD, Agile and DevOps process, participate regularly in planning and releases
  • Assist in establishing and enforcing standards that will improve the ease of automating the build process and the development environments
  • Manage and maintain enterprise infrastructure tools as the primary subject matter expert
  • Automate, deploy and manage virtualization infrastructure

What we are looking for:

  • Bachelor’s degree in computer science, MIS or engineering related field or equivalent work experience
  • 8+ years of combined related work experience
  • 6+ years of experience in Unix/Linux system engineering (RHEL, CentOS, Ubuntu, AIX, or Solaris) or Windows.
  • 5+ years of experience with VMware technologies (vSphere, ESXi, vCenter).
  • 3+ years working experience with Ansible configuration and orchestration
  • Strong scripting and automation skills (Python, Bash, Shell, PowerShell, Ansible or Terraform).
  • Solid knowledge of storage, networking, backup, and security concepts.
  • Experience managing hybrid environments (on-premises + cloud, preferably Azure).
  • Experience with container platforms (Docker, Kubernetes).
  • Experience in Cloud Technologies – Private, Public, Hybrid, IaaS+, PaaS, SaaS
  • Experience working with CI/CD tools and Git
  • Intermediate knowledge of Networking (VLAN, sub netting, routing and switching)
  • Ability to interact with various levels of professionals
  • Ability to work under pressure in a fast-paced environment and meet tight deadlines
  • Ability to act independently to drive IT goals and changes
  • Identify and escalate situations requiring urgent attention
  • Proficiency in operating system and software
  • Willing to work under different technologies and take up new technology responsibilities outside the core skills
  • Demonstrable experience with Continuous Integration/Delivery principles (ci/cd) and implementation
  • Strong Scripting experience like Python, Bash Shell, PowerShell, etc.
  • Solid understanding of Restful APIs
  • Advanced troubleshooting methodology
  • Ability to judge priorities and adjust their work accordingly
  • Experience with the implementation and use of different Application (APM) and/or Infrastructure monitoring tools.
  • Being able to work cross platform, with Windows and Linux. This helps understand hybrid platform environment and thus helps design considerations. Certification is preferable (RHCE or likewise).
  • Knowledge of ITIL processes (incident, problem, change management).
  • Knowledge of protocols: HTTP, SSL, SSH, WINRM, JMS, JDBC, REST API (ServiceNow and AWX/Tower), etc.
  • Familiarity with observability and analysis solutions such as Elastic and Datadog.
  • Experience in automation of key functions, including back-up, continuous integration, provisioning is a huge plus
  • Fluent English and high oral and written communication

Set alerts for more jobs like Lead System Engineer - Unix/Linux, VMware & Ansible
Set alerts for new jobs by Blue Yonder
Set alerts for new System Design jobs in India
Set alerts for new jobs in India
Set alerts for System Design (Remote) jobs

Contact Us
hello@outscal.com
Made in INDIA 💛💙