This position will assist in performing implementation, operation, monitoring, recovery, and performance tuning for infrastructure and application services at symplr. The DevOps team augments the symplr Development, IT teams by focusing on automation, CI/CD and operating production systems using a software engineering approach.
DevOps goals include improving system performance, increasing operational observability, enhancing system stability, and reducing time for software delivery
Duties & Responsibilities
- Be a champion for department initiatives and values by ensuring all actions promote the department’s mission statement
- Collaborate with Engineering Managers, Architects, and Developers throughout product release cycles.
- Automate product deployments across environments by integrating continuous integration (CI) and continuous delivery (CD) tools, while following monitoring and change management practices.
- Identify and resolve performance and scalability issues, proactively addressing bottlenecks to enhance system stability and performance across CI/CD pipelines.
- Assess architectural risks such as single points of failure and propose resilient solutions to improve system reliability.
- Continuously explore automation opportunities to reduce operational workloads through scripting, new tools, and process improvements.
- Continuously explore automation opportunities to reduce operational workloads through scripting, new tools, and process improvements.
- Document standard procedures (SOP’s) for common tasks such as deployments, configurations, and troubleshooting.
- Implement monitoring, alerting, notification and metrics collection for
- Infrastructure and application performance
- System uptime
- Error rate
- Ticket Handling and Support
- Tickets that are handled should have clear communication and correct stakeholders involved
- Tickets should be completed within the SLA and should be clearly informed, documented if there is any delay or improper tickets.
- Tickets should have proper comments to close the ticket including steps for resolutions, screen shots.
- Tickets that are repetitive should be discussed in standup call for brainstorming and eventually should lead into resolution through automation if necessary.
Skills Required
- 3+ years of experience with any public cloud provider such as Microsoft Azure, Amazon Web Services (AWS) or Google Compute Engine (GCE) and On-Prem Servers
- Good knowledge on CI/CD tools like Azure ADO, GitHub Actions, Jenkins etc
- Good understanding of DevOps practices.
- Good understanding of Web Servers & Database
- [Optional] Good understanding in Docker and Kubernetes.
- Good scripting knowledge & Software life cycles model.
- Good at IaC tools like Terraform, and Configuration Management tools like Ansible
- Solid understanding of standard TCP/IP networking, Load Balancing and common protocols like DNS, HTTPS
- Knowledge on fundamental aspects for release automation (packaging, dependencies, promotion, deployment, compliance)
- Monitoring and Logging: Experience with any Application monitoring and logging tools (e.g. Datadog, New Relic, AppDynamics, Application Insight, ELK, Prometheus).
- A passion for collecting, evaluating, and improving performance metrics.
- Excellent time management, resource organization and priority establishment skills, and ability to multi-task in a fast-paced environment
- Ability to work quickly and efficiently with minimal supervision
- Excellent communication skills with both written and verbal
Qualifications
- Have HEART. To work here, you must be:
- Humble – self-aware and respectful
- Effective – measurably move the needle & immeasurably add value
- Adaptable – innately curious and constantly changing
- Remarkable – stand out in some way
- Transparent – openly and honestly sharing knowledge
- 3+ years of Systems Engineering experience in the following areas
- Cloud platforms (Azure, AWS) and On-Prem Servers
- Windows and Linux Servers
- Application Monitoring Tools (Datadog, New Relic, AppDynamics, Application Insights)
- Log Aggregation Tools (Datadog, ELK, etc)
- PowerShell, Bash, or Python scripting
- CI/CD tools (Azure Pipelines, GithHub Actions, Jenkins, Octopus, etc.)
- Infrastructure management tools (Terraform, Ansible, etc.)
- Application Hosting (IIS, Apache, Tomcat)
- Bachelor’s degree or equivalent experience
- https://www.symplr.com/careers/india