Site Reliability Engineer

2 Weeks ago • 2-10 Years

About the job

SummaryBy Outscal

CareStack seeks an SRE with 2+ years of experience in cloud computing (Azure preferred), Linux systems, storage, networking, security, databases, and container orchestration tools like Kubernetes. Proficiency in Python, Go, CI/CD tooling (GitLab, Argo CD, Octopus), and monitoring infrastructure are crucial.

About the job

About us

CareStack is a complete cloud-based dental software solution for scheduling, clinical, billing, patient engagement, and reporting needs of dental offices of any size - whether it's a single location or a large multi-site DSO with hundreds of locations. The company was founded in 2015, and the commercial launch was done in early 2018. Since then, more than 1000 offices have chosen CareStack as their single source of truth. This is the fastest growth to date in the dental practice management software market, dominated by 100-year-old distribution companies.

  • Rated by independent B2B software reviews and research analysts as the most modern, innovative, and customer experience-focused company in the space with the fastest growth in the segment.
  • Important strategic go-to-market partnerships with dental industry leaders like Delta Dental, Darby Dental, and several others.
  • Venture-backed with over $135M raised from leading financial and strategic investors.
  • HQ'd in Orlando, FL with offices in Minnesota, Bangalore, Trivandrum, and Cochin.


Why You Should Join the SRE Team

At our SRE team, we don't just manage systems; we architect and maintain the backbone of digital experiences. We are the guardians of reliability, scalability, and performance. By joining us, you'll be at the heart of innovation, working with cutting-edge technologies to keep our systems running smoothly. You'll learn from some of the best minds in the industry, collaborate with diverse teams, and have a direct impact on the user experience. We embrace a culture of continuous learning, where challenges are opportunities for growth.


What would SRE do here

1. Manage and maintain day-to-day BAU operations, including monitoring system performance, troubleshooting issues, and ensuring high availability.

2. Build infrastructure as code (IAC) patterns that meet security and engineering standards.

3. Build CI/CD pipelines using Octopus, GitLab-CI and cloud-native toolchains like Argo CD.

4. Build and maintain automation scripts and tools to streamline operational processes.

5. Ensure observability around the system uptime is available and take necessary actions to triage issues with respective service teams and stakeholders.

6. Manage observability setup including metrics and logging and enhance capability with proficiency in PromQL queries.

7. Build runbooks that are comprehensive and detailed to manage detect, remediate and restore services.

8. Collaborate with engineering teams to provide quicker solutions during the firefighting and help improve the overall process.

9. Support the operations team in managing BAU by monitoring and analyzing system logs and performance metrics to identify areas for improvement and take proactive measures.

10. Stay up to date with industry trends and best practices in SRE, observability, alerting and infrastructure automation.

11. Actively participate in rotational shift/on-call duties to ensure continuous operational support.

12. Communicate effectively with technical peers and team members in both written and verbal formats.

What are we looking in new hire

1. At least 2+ years of experience as an SRE, with strong knowledge of cloud computing platforms, preferably Azure.

2. Cross-functional knowledge in Linux systems, storage, networking, security, and databases.

3. Experience in container orchestration tools like Kubernetes.

4. Proficiency in languages such as Python, Go, etc.

5. Have the capability to develop and maintain software written in any programming language.

6. Experience working with continuous integration and continuous delivery tooling and practices (e.g., GitLab, Argo CD, Octopus).

7. Experience in monitoring infrastructure and application uptime and availability to ensure functional and performance objectives.

8. Excellent communication and collaboration skills.


Join us online for future opportunities

Website: https://carestack.com/

Instagram: https://www.instagram.com/carestack.people

LinkedIn: https://www.linkedin.com/company/carestack/mycompany/


Note: As part of our interview process, we conduct an initial shortlisting to identify candidates who closely match our requirements. While we strive to notify all applicants about their status, if you do not receive a response from us, please understand that your profile has not been shortlisted at this time.

Similar Jobs

VGW - Site Reliability Engineer Supervisor

New South Wales, Australia (On-Site)

VGW - Site Reliability Engineer Supervisor

Western Australia, Australia (On-Site)

PlayStation Global - Site Reliability Engineer Intern - Undergraduate

California, United States (Hybrid)

PlayStation Global - Senior Service Reliability Engineer

Berlin, Germany (On-Site)

PlayStation Global - Staff Service Reliability Engineer

Berlin, Germany (On-Site)

PlayStation Global - Senior Service Reliability Engineer

Berlin, Germany (On-Site)

PlayStation Global - Staff Service Reliability Engineer

Berlin, Germany (On-Site)

Vimeo - Sr. Site Reliability Engineer

New York, United States (Remote)

Lifechurch - Senior Site Reliability Engineer

Oklahoma, United States (On-Site)

Lifechurch - Senior Site Reliability Engineer

Oklahoma, United States (On-Site)

Similar Skill Jobs

Niantic - Software Engineer, Native Web

California, United States (On-Site)

Niantic - Software Engineer, Native Web

California, United States (On-Site)

Maximum Games - Junior Accounts Intern (paid)

St. Ouen, Jersey (On-Site)

Sleeper - Performance Creative Associate (TikTok)

Washington, United States (On-Site)

Sleeper - Performance Creative Associate (TikTok)

California, United States (On-Site)

Sleeper - Performance Creative Associate (TikTok)

New York, United States (On-Site)

Sleeper - Performance Creative Associate (TikTok)

California, United States (On-Site)

Sleeper - Performance Creative Associate (TikTok)

Nevada, United States (On-Site)

Patreon - Executive Creative Director

California, United States (Hybrid)

Jobs in Thiruvananthapuram, Kerala, India

Klutchh - Mobile Application Developer

Delhi, India (On-Site)

Awestruck Gifts - Video Editing

Maharashtra, India (On-Site)

Chit1 Studios - Sculptor

Punjab, India (On-Site)

Dream Game Studios - Manager - Product (Gameplay)

Maharashtra, India (On-Site)

Trailer Park - PRODUCTION DESIGNER – STATIC DESIGN

Maharashtra, India (Hybrid)

Aristocrat Gaming - Lead Game Mathematician

Haryana, India (Hybrid)

Aristocrat Gaming - SCCM & In-Tune - Technical Lead

Uttar Pradesh, India (Hybrid)

Aristocrat Gaming - Technical Lead - Manual Testing

Uttar Pradesh, India (Hybrid)

Technicolor Creative Studios - Senior GL Accountant

Karnataka, India (On-Site)

Software Engineering Jobs

Niantic - Software Engineer, Native Web

California, United States (On-Site)

Niantic - Software Engineer, Native Web

California, United States (On-Site)

Aristocrat Gaming - Senior Software Engineer II

Texas, United States (Hybrid)

DraftKings - Manager, Platform Operations Analytics

Massachusetts, United States (On-Site)

DraftKings - VIP Host, Detroit

United States (Remote)

Blizzard Entertainment - Capture Media Artist - Temp (SFD / Cinematics)

California, United States (Hybrid)

Klutchh - Mobile Application Developer

Delhi, India (On-Site)

PlayStation Global - Sr. Software Engineer

California, United States (On-Site)

Chit1 Studios - Sculptor

Punjab, India (On-Site)

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug