Senior Platform Support Engineer

3 Months ago • 6-8 Years
Devops

Job Description

As a Cloud Operations Engineer in the Cloud Operations Center, you will be instrumental in ensuring the 24x7x365 smooth operation of Saviynt’s Enterprise Identity Cloud. This role emphasizes maintaining platform stability, performance, and reliability, with a strong focus on application layer support and operational ownership. You will collaborate with other operations team members, development, and engineering to resolve issues, implement improvements, and provide exceptional support. This is an opportunity for problem-solvers who enjoy operational challenges in a dynamic cloud environment and want to see their work through to completion.
Good To Have:
  • Experience with Grafana systems and dashboards
  • Experience with automation tools and scripting languages (Python, Bash)
  • Experience working in a SaaS environment
Must Have:
  • 6-8 years of experience in IT/Cloud operations and application support (Java)
  • Strong pod-level troubleshooting skills in AKS/EKS
  • Analyze application and DB performance issues (Java, Grails, Hibernate)
  • Oversee monitoring of SaaS applications and infrastructure (Kubernetes on AWS/Azure, VPN, Elastic Search, MySQL)
  • Understanding of DNS, IP addressing, Networking, and LDAP
  • Participate in on-call escalations and provide technical guidance during incidents
  • Communicate proactively with customers on technical issues
  • Guide junior engineers technically
  • Manage full lifecycle of alerts, incidents, and service requests
  • Develop and maintain operational procedures, runbooks, and knowledge base articles
  • Drive continuous improvement initiatives for operational efficiency
  • Collaborate with backend engineering and development teams
  • Ensure adherence to defined SLAs and KPIs
  • Maintain operational documentation
  • Ensure compliance with security and compliance policies
  • Plan and coordinate scheduled maintenance activities
  • Bachelor's degree in Computer Science, Information Technology, Engineering, or related field
  • Strong understanding of cloud computing concepts, architectures, and services (AWS and Azure)
  • Working knowledge of containerization and orchestration technologies (Kubernetes)
  • End-to-end technical accountability and operational ownership
  • Willingness to work in a 24/7 operating model
  • Experience managing and troubleshooting network connectivity, including VPNs
  • Familiarity with monitoring tools and practices
  • Hands-on experience with log management and analysis tools (Elastic Search)
  • Working knowledge of database systems (MySQL), including L2 troubleshooting and performance monitoring
  • Experience with ITSM systems (FreshService)
  • Excellent problem-solving, analytical, and troubleshooting skills
  • Strong communication (written and verbal), interpersonal, and presentation skills
  • Ability to work effectively under pressure and manage multiple priorities
  • Experience in developing and documenting operational procedures and runbooks
Perks:
  • Competitive total rewards package
  • Learning and growth opportunities
  • Challenging yet rewarding work
  • Welcoming and positive work environment

Add these skills to join the top 1% applicants for this job

saas-business-models
account-management
problem-solving
performance-analysis
talent-acquisition
mysql
networking
dns
ldap
incident-response
aws
azure
grafana
hibernate
kubernetes
python
bash
java


Senior  Engineer - Cloud Operations (Platform Support)As a Cloud Operations Engineer in our Cloud Operations Center, you will be a key player in ensuring the 24x7x365 smooth operation of Saviynt’s Enterprise Identity Cloud. This role focuses on maintaining the stability, performance, and reliability of our platform with a strong emphasis on application layer support and operational ownership. You will be working closely with other operations team members, development, and engineering to resolve issues, implement improvements, and provide exceptional support. This is an opportunity for someone who enjoys operational challenges and problem-solving in a dynamic cloud environment and wants to see their work through to completion.
WHAT YOU WILL BE DOING
·       Strong pod-level troubleshooting skills in AKS/EKS (not just restarting pods).
·       Analyze application and DB (RDS, MySQL) performance issues.Deeply investigate and analyze application performance issues (Java, Grails, Hibernate), identifying root causes and implementing solutions.
·       Oversee the monitoring of our SaaS applications and underlying infrastructure (Kubernetes on AWS and Azure, VPN connections, customer applications, Elastic Search, MySQL) for alerts and performance issues.
·       Strong understanding of basic computing concepts like DNS, IP addressing, Networking, and LDAP.
·       Effectively participate and contribute in on-call escalations with a strong operational mindset and provide technical guidance during critical incidents.
·       Proactively communicate with customers on technical issues when required.
·       Ability to guide junior engineers when needed technically.
·       Manage the full lifecycle of alerts, incidents, and service requests reported through FreshService, ensuring timely and accurate logging, prioritization, resolution, and escalation.
·       Develop, implement, and maintain operational procedures, runbooks, and knowledge base articles to standardize incident resolution and service request fulfillment.
·       Drive continuous improvement initiatives to optimize operational efficiency, reduce incident rates, and improve service request turnaround times.
·       Collaborate with backend engineering and development teams to troubleshoot complex issues, identify root causes, and implement preventative measures.
·       Ensure adherence to defined SLAs (Service Level Agreements) and KPIs (Key Performance Indicators) for operational performance.Maintain operational documentation, including system diagrams, contact lists, and escalation paths.
·       Ensure compliance with relevant security and compliance policies.
·       Plan and coordinate scheduled maintenance activities with minimal impact to service availability.
 
WHAT YOU BRING
·       Bachelor's degree in Computer Science, Information Technology, Engineering, or a related field.
·       Minimum of 6-8 years of experience in IT/Cloud operations and application support (specifically Java apps), with knowledge of cloud infrastructure (AWS and Azure).
·       Strong experience with application support (Java, Grails, Hibernate) and performance analysis in a production environment, able to pinpoint a performance degradation through analysis.
·       Strong understanding of cloud computing concepts, architectures, and services on both AWS and Azure platforms.
·       Working knowledge of containerization and orchestration technologies, specifically Kubernetes.End-to-end technical accountability and operational ownership.Willingness to work in a 24/7 operating model.
·       Experience managing and troubleshooting network connectivity, including VPNs and connections to external networks.
·       Familiarity with monitoring tools and practices, with experience in setting up and responding to alerts.
·       Hands-on experience with log management and analysis tools, preferably Elastic Search.
·       Working knowledge of database systems, preferably MySQL, including L2 troubleshooting and performance monitoring.
·       Experience with ITSM (IT Service Management) systems, preferably FreshService, including incident, problem, and service request management processes.
·       Excellent problem-solving, analytical, and troubleshooting skills with a data-driven approach.Experience with Grafana systems and dashboards is a plus.
·       Strong communication (written and verbal), interpersonal, and presentation skills.
·       Ability to work effectively under pressure and manage multiple priorities in a fast-paced environment.
·       Experience in developing and documenting operational procedures and runbooks.
·       Experience with automation tools and scripting languages (e.g., Python, Bash) is a plus.
·       Experience working in a SaaS environment is highly desirable.
·       Working knowledge of database systems, preferably MySQL, including L2 troubleshooting and performance monitoring.
·       Experience with ITSM (IT Service Management) systems, preferably FreshService, including incident, problem, and service request management processes.
·       Excellent problem-solving, analytical, and troubleshooting skills with a data-driven approach.Experience with Grafana systems and dashboards is a plus.
·       Strong communication (written and verbal), interpersonal, and presentation skills.
·       Ability to work effectively under pressure and manage multiple priorities in a fast-paced environment.
·       Experience in developing and documenting operational procedures and runbooks.
·       Experience with automation tools and scripting languages (e.g., Python, Bash) is a plus.
·       Experience working in a SaaS environment is highly desirable.
 
We offer you a competitive total rewards package, learning and tremendous opportunities to grow and advance in your career. At Saviynt, it is not typical for an individual to be hired at or near the top of the range for their role and final compensation decisions are dependent on many factors including, but are not limited to location; skill sets; experience and training; licensure and certifications; and other relevant business and organizational needs. A reasonable estimate of the current range is $Min,000 - $Max,000 annually.
You may also be eligible to participate in a Saviynt discretionary bonus plan, subject to the rules governing the program, whereby an award, if any, depends on various factors, including, without limitation, individual and organizational performance.If required for this role, you will:Complete security & privacy literacy and awareness training during onboarding and annually thereafterReview (initially and annually thereafter), understand, and adhere to Information Security/Privacy Policies and Procedures such as (but not limited to):
> Data Classification, Retention & Handling Policy> Incident Response Policy/Procedures> Business Continuity/Disaster Recovery Policy/Procedures> Mobile Device Policy> Account Management Policy> Access Control Policy> Personnel Security Policy> Privacy Policy
Saviynt is an amazing place to work. We are a high-growth, Platform as a Service company focused on Identity Authority to power and protect the world at work. You will experience tremendous growth and learning opportunities through challenging yet rewarding work that directly impacts our customers, all within a welcoming and positive work environment. If you're resilient and enjoy working in a dynamic environment you belong with us!
Saviynt is an equal opportunity employer and we welcome everyone to our team.  All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, or veteran status.

Set alerts for more jobs like Senior Platform Support Engineer
Set alerts for new jobs by Saviynt
Set alerts for new Devops jobs in India
Set alerts for new jobs in India
Set alerts for Devops (Remote) jobs

Contact Us
hello@outscal.com
Made in INDIA 💛💙