This role demands extensive hands-on experience working as an SRE engineer for large-scale, customer-facing Cloud applications. The candidate should have a good understanding of SRE principles, including monitoring, alerting, error budgets, fault analysis, and other common reliability engineering concepts. The candidate should possess excellent troubleshooting and problem-solving skills. They will be expected to represent the SRE organization in design reviews and operational readiness exercises for new and existing services. They will also be required to collaborate with technical and non-technical teams and analyze statistics to come up with a clear picture of the current state of our system. Good working knowledge of Oracle and Cassandra databases will be beneficial. The candidate should be passionate about automating manual operations and improving them through repeated iteration. They should have a good understanding of networking and load balancing concepts and should be able to lead a small team and come up with innovative solutions. The candidate should be self-motivated, capable of making business-critical decisions, and comfortable working in a dynamic, ever-changing environment. They should be proactive in dealing with critical production issues and take them to closure while working with required partners. Participate in an on-call rotation, providing hands-on technical expertise during service impacting events.