Blue Yonder is seeking a Lead System Engineer with a strong background in IT Infrastructure, focusing on Unix/Linux, VMware, and Ansible. This role involves managing, configuring, and optimizing systems, maintaining OS patching, and developing automation frameworks using tools like Ansible and Terraform. The engineer will also administer enterprise storage solutions, lead data protection strategies, and ensure effective monitoring and incident response for critical infrastructure, actively participating in CI/CD and DevOps processes.
Good To Have:- Bachelor’s degree in computer science, MIS or engineering related field or equivalent work experience.
- Ability to interact with various levels of professionals.
- Ability to work under pressure in a fast-paced environment and meet tight deadlines.
- Ability to act independently to drive IT goals and changes.
- Identify and escalate situations requiring urgent attention.
- Proficiency in operating system and software.
- Willingness to work under different technologies and take up new technology responsibilities outside core skills.
- Demonstrable experience with Continuous Integration/Delivery principles (CI/CD) and implementation.
- Solid understanding of Restful APIs.
- Experience with the implementation and use of different Application (APM) and/or Infrastructure monitoring tools.
- Ability to work cross-platform, with Windows and Linux.
- Certification is preferable (RHCE or likewise).
- Knowledge of ITIL processes (incident, problem, change management).
- Knowledge of protocols: HTTP, SSL, SSH, WINRM, JMS, JDBC, REST API (ServiceNow and AWX/Tower).
- Familiarity with observability and analysis solutions such as Elastic and Datadog.
- Experience in automation of key functions, including back-up, continuous integration, provisioning.
Must Have:- Manage, configure, optimize, and administer Unix/Linux systems (RHEL, CentOS, Ubuntu, AIX, Solaris) or Windows servers.
- Manage VMware virtualization environments (vSphere, ESXi, vCenter).
- Maintain OS patching, upgrades, and compliance.
- Develop and maintain automation frameworks using Ansible, Terraform, Python, Shell, PowerShell.
- Administer and optimize NetApp ONTAP storage systems.
- Lead data protection strategy using Commvault or similar backup solutions.
- Own end-to-end patch management process for servers, virtualization, and storage.
- Conduct root cause analysis (RCA) and implement preventive measures.
- Ensure effective monitoring, alerting, and incident response.
- Participate in on-call support rotation.
- Actively engage in CI/CD, Agile, and DevOps processes.
- 8+ years of combined related work experience.
- 6+ years of experience in Unix/Linux system engineering.
- 5+ years of experience with VMware technologies.
- 3+ years working experience with Ansible.
- Strong scripting and automation skills (Python, Bash, Shell, PowerShell, Ansible, Terraform).
- Solid knowledge of storage, networking, backup, and security concepts.
- Experience managing hybrid environments (on-premises + cloud, preferably Azure).
- Experience with container platforms (Docker, Kubernetes).
- Experience working with CI/CD tools and Git.
- Intermediate knowledge of Networking (VLAN, subnetting, routing, switching).
- Advanced troubleshooting methodology.
- Fluent English and high oral and written communication.