Job Description
Location: Blumenau SC;
In this role, you will be part of the Digital Innovation Platform (DIP), responsible for driving the next generation of connected health applications.
The HSDP Tasy Operations team is growing and needs a Cloud Operations Analyst with a focus on monitoring and observability, to strengthen the infrastructure that supports our SaaS solutions in an AWS environment.
You will be responsible for creating, maintaining, and evolving monitoring platforms, ensuring that our cloud environments operate stably, securely, and efficiently, in addition to collaborating with technical teams in diagnosing and resolving incidents.
Responsibilities:
- Design, install, configure, and maintain Zabbix, Prometheus, and Grafana platforms, including architecture, templates, auto-discovery, and monitoring via agent and SNMP.
- Develop scripts and APIs to automate monitoring workflows, such as host, template, and trigger creation.
- Create and customize dashboards, metrics, and alerts for performance monitoring, critical events, and availability.
- Define and manage severity levels, alarms, and notifications, ensuring quick responses to incidents.
- Monitor servers, networks, and applications in cloud environments, correlating infrastructure and application metrics.
- Utilize scripts (Bash, Shell, Python) for task automation and report generation.
- Provide support for databases used by monitoring tools (MySQL, PostgreSQL, etc.).
- Support the analysis of performance and stability of monitored systems and infrastructure, collaborating with operations, architecture, and DevOps teams.
What we expect from you:
- Degree in Information Systems and/or Computer Science or related field.
- Experience in cloud environments - AWS.
- Experience with SaaS solutions operations and technical support for critical applications.
- Proficiency in Zabbix, Prometheus, and Grafana (configuration, customization, and automation).
- Knowledge of Linux, Windows Server, databases (Oracle is a differential), and networks.
- Knowledge of automation tools (Lambda, Python, Terraform).
- Familiarity with network protocols and services (TCP/IP, DNS, HTTP).
- Ability to diagnose and resolve complex infrastructure and performance problems.
- Experience with ServiceNow / Jira is a differential.
Differentials:
- Zabbix Certifications (Certified Specialist, Professional or similar).
- Grafana Certification or proven observability experience.
- AWS Certification (Cloud Practitioner, SysOps or equivalent).
- Knowledge of Terraform, Kubernetes (EKS) and DevOps practices.
- Intermediate English and/or Spanish for collaboration with international teams.
How we work together
We believe that we generate a greater impact when we are together than apart. For our office team members, this means working in person at least three days a week, to collaborate and connect with others.
On-site positions require the employee to be present at the company's facilities at all times. For employees in a field position, their duties are performed more effectively outside the company's facilities, usually at client or supplier facilities.
About Philips
We are a health technology company. We built our entire company on the belief that all human beings are important and we will not stop until everyone, everywhere, has access to the quality healthcare we all deserve. Do the work of your life to help improve the lives of others.
If you are interested in this role and have many, but not all, of the required experiences, we encourage you to apply. You may still be the right candidate for this or other opportunities at Philips. Learn more about our commitment to diversity and inclusion here.