Sr. Site Reliability Administrator
OpenText
Job Summary
As a Site Reliability Engineer at OpenText, you will ensure the reliability, scalability, and performance of our cloud infrastructure. This role requires thorough understanding of application designs and hands-on technical experience. You will collaborate with development and operations teams to design, implement, and maintain resilient and efficient systems. Key responsibilities include deploying services, managing migrations and disaster recovery, troubleshooting performance issues, enhancing automated deployment strategies (CI/CD), and utilizing AWS services. You will also work with third-party vendors, prioritize project deliveries, and participate in on-call rotations for 24x7 support.
Must Have
- Ensure cloud infrastructure reliability, scalability, and performance.
- Learn and understand current application designs thoroughly.
- Work with development and operations teams on system design and maintenance.
- Collaborate with Product Development, Professional Services, and Customer Support.
- Expertise in application/service migration and disaster recovery.
- Troubleshoot and resolve infrastructure and application performance issues.
- Develop and enhance automated deployment strategies (CI/CD).
- Utilize AWS services for cloud infrastructure build and maintenance.
- Work with 3rd party vendors and their support organizations.
- Prioritize work for timely project deliveries.
- Available for weekend maintenance activities, upgrades, deployments.
- Participate in on-call rotations for 24x7 critical system support.
Job Description
Description
OPENTEXT - THE INFORMATION COMPANY
OpenText is a global leader in information management, where innovation, creativity, and collaboration are the key components of our corporate culture. As a member of our team, you will have the opportunity to partner with the most highly regarded companies in the world, tackle complex issues, and contribute to projects that shape the future of digital transformation.
AI-First. Future-Driven. Human-Centered.
At OpenText, AI is at the heart of everything we do—powering innovation, transforming work, and empowering digital knowledge workers. We're hiring talent that AI can't replace to help us shape the future of information management. Join us.
YOUR IMPACT
As a Site Reliability Engineer at OpenText, you will be responsible for ensuring our cloud infrastructure's reliability, scalability, and performance. You must be able to learn and understand current application designs thoroughly and have hands-on technical experience. You will work closely with development and operations teams to design, implement, and maintain systems that are resilient and efficient.
WHAT THE ROLE OFFERS
- Collaborate with Product Development, Professional Services, and Customer Support teams to deploy services and applications and make these services more stable and resilient.
- Be an expert on migration of applications and services, and disaster recovery planning & execution.
- Troubleshoot and resolve issues related to infrastructure and application performance and provide root cause analysis.
- Develop and enhance OpenText automated deployment strategies (CI/CD) to streamline development and deployment processes.
- Utilize AWS services to build and maintain cloud infrastructure.
- Work with 3rd party vendors and their support organizations for any impact to OT hosted solutions
- Prioritize work to ensure timely project deliveries.
- Available to work on weekends to perform scheduled maintenance activities, upgrades, and deployments.
- Participate in on-call rotations to provide 24x7 support for critical systems.
WHAT YOU NEED TO SUCCEED
- Minimum 5 years of experience in Cloud Applications and Platforms.
- Hands-on experience in Cloud computing (AWS, Google, Azure)
- In-depth knowledge of Linux and Windows operating systems and system administration.
- Design, implement, and manage Kubernetes clusters to ensure high availability and scalability.
- Develop and maintain Helm charts for container-based application deployments and management.
- Solid understanding of Terraform for infrastructure provisioning and management.
- Able to automate tasks with Python, Go, bash, PowerShell, or Shell-script.
- Experience with CI/CD tools and practices (Ansible/Chef/Puppet/Gitlab/Concourse etc.)
- Understanding security and network protocols, such as SFTP, VPN, HTTPS, and SSH.
- Familiarity with load balancers, SMTP, DNS, and networking protocols.
- Strong analytical and problem-solving techniques and the ability to work under pressure.
- Ability to analyze and debug complex systems and maintain detailed documentation.
- Strong communication and collaboration skills.
- A fast self-learner, and ability to adapt to a changing application/platform.
OpenText's efforts to build an inclusive work environment go beyond simply complying with applicable laws. Our Employment Equity and Diversity Policy provides direction on maintaining a working environment that is inclusive of everyone, regardless of culture, national origin, race, color, gender, gender identification, sexual orientation, family status, age, veteran status, disability, religion, or other basis protected by applicable laws.
If you need assistance and/or a reasonable accommodation due to a disability during the application or recruiting process, please contact us at hr@opentext.com . Our proactive approach fosters collaboration, innovation, and personal growth, enriching OpenText's vibrant workplace.