About us CareStack is a complete cloud-based dental software solution for scheduling, clinical, billing, patient engagement, and reporting needs of dental offices of any size - whether it's a single location or a large multi-site DSO with hundreds of locations. The company was founded in 2015, and the commercial launch was done in early 2018. Since then, more than 1000 offices have chosen CareStack as their single source of truth. This is the fastest growth to date in the dental practice management software market, dominated by 100-year-old distribution companies. Why You Should Join the SRE Team At our SRE team, we don't just manage systems; we architect and maintain the backbone of digital experiences. We are the guardians of reliability, scalability, and performance. By joining us, you'll be at the heart of innovation, working with cutting-edge technologies to keep our systems running smoothly. You'll learn from some of the best minds in the industry, collaborate with diverse teams, and have a direct impact on the user experience. We embrace a culture of continuous learning, where challenges are opportunities for growth. What would SRE do here 1. Manage and maintain day-to-day BAU operations, including monitoring system performance, troubleshooting issues, and ensuring high availability. 2. Build infrastructure as code (IAC) patterns that meet security and engineering standards. 3. Build CI/CD pipelines using Octopus, GitLab-CI and cloud-native toolchains like Argo CD. 4. Build and maintain automation scripts and tools to streamline operational processes. 5. Ensure observability around the system uptime is available and take necessary actions to triage issues with respective service teams and stakeholders. 6. Manage observability setup including metrics and logging and enhance capability with proficiency in PromQL queries. 7. Build runbooks that are comprehensive and detailed to manage detect, remediate and restore services. 8. Collaborate with engineering teams to provide quicker solutions during the firefighting and help improve the overall process. 9. Support the operations team in managing BAU by monitoring and analyzing system logs and performance metrics to identify areas for improvement and take proactive measures. 10. Stay up to date with industry trends and best practices in SRE, observability, alerting and infrastructure automation. 11. Actively participate in rotational shift/on-call duties to ensure continuous operational support. 12. Communicate effectively with technical peers and team members in both written and verbal formats. What are we looking in new hire 1. At least 2+ years of experience as an SRE, with strong knowledge of cloud computing platforms, preferably Azure. 2. Cross-functional knowledge in Linux systems, storage, networking, security, and databases. 3. Experience in container orchestration tools like Kubernetes. 4. Proficiency in languages such as Python, Go, etc. 5. Have the capability to develop and maintain software written in any programming language. 6. Experience working with continuous integration and continuous delivery tooling and practices (e.g., GitLab, Argo CD, Octopus). 7. Experience in monitoring infrastructure and application uptime and availability to ensure functional and performance objectives. 8. Excellent communication and collaboration skills. Join us online for future opportunities Website: https://carestack.com/ Instagram: https://www.instagram.com/carestack.people LinkedIn: https://www.linkedin.com/company/carestack/mycompany/ Note: As part of our interview process, we conduct an initial shortlisting to identify candidates who closely match our requirements. While we strive to notify all applicants about their status, if you do not receive a response from us, please understand that your profile has not been shortlisted at this time.