Sr. Cloud DevOps Engineer
Synechron
Job Summary
Synechron is seeking a highly experienced Sr. Engineer specializing in Site Reliability Engineering (SRE) and AWS cloud infrastructure to lead the development and management of scalable, reliable, and secure systems. The successful candidate will oversee the deployment, automation, and operational health of enterprise applications, ensuring high availability and resilience while supporting continuous improvement initiatives. This role offers the opportunity to drive innovation, optimize operational processes, and influence the technical architecture for mission-critical workloads.
Must Have
- Extensive experience with AWS services such as EC2, Lambda, S3, RDS, CloudFormation, and CloudWatch
- Proficiency in scripting and automation using Bash, Python, PowerShell
- Strong understanding of Infrastructure as Code (IaC) using Terraform, CloudFormation
- Hands-on experience with containerization (Docker) and orchestration platforms (Kubernetes, Helm)
- Knowledge of configuration management tools such as Ansible or Chef
- Experience with monitoring, logging, and observability tools like Prometheus, Grafana, ELK stack, Splunk, or CloudWatch
- Fundamentals of network security, firewalls, VPNs, and security best practices
- 7+ years of hands-on experience in cloud operations, infrastructure automation, and Site Reliability Engineering
- Proven track record managing high-availability, large-scale cloud environments in enterprise or regulated industries
- Demonstrated expertise in automation, scripting, and CI/CD pipeline development
- Experience in performing incident and problem management, including root cause analysis and remediations
- Strong background with security standards, vulnerability mitigation, and compliance practices
- Bachelor’s degree in Computer Science, Information Technology, or related field
Good to Have
- Familiarity with multi-cloud or hybrid cloud architectures (GCP, Azure)
- Experience with automation of deployment pipelines and CI/CD tools such as Jenkins, GitLab CI, or Azure DevOps
- Knowledge of cloud security standards (e.g., CIS benchmarks, ISO 27001)
- Exposure to serverless technologies like AWS Lambda or Fargate
- Experience with cross-stack automation and compliance automation tools
- Familiarity with Istio, Service Mesh solutions
- Experience with automation frameworks and scripting for security and compliance tasks
- Knowledge of SIEM solutions, vulnerability management tools, security benchmarks
- Prior experience with financial services infrastructure or banking environments
- Certifications such as AWS Certified Solutions Architect – Professional, AWS Certified DevOps Engineer, or equivalent
Job Description
Job Summary
Synechron is seeking a highly experienced Sr. Engineer specializing in Site Reliability Engineering (SRE) and AWS cloud infrastructure to lead the development and management of scalable, reliable, and secure systems. The successful candidate will oversee the deployment, automation, and operational health of enterprise applications, ensuring high availability and resilience while supporting continuous improvement initiatives. This role offers the opportunity to drive innovation, optimize operational processes, and influence the technical architecture for mission-critical workloads.
Software Requirements
Required Skills:
- Extensive experience with AWS services such as EC2, Lambda, S3, RDS, CloudFormation, and CloudWatch
- Proficiency in scripting and automation using Bash, Python, PowerShell, or similar languages
- Strong understanding of infrastructure as code (IaC) using Terraform, CloudFormation, or similar tools
- Hands-on experience with containerization (Docker) and orchestration platforms (Kubernetes)
- Knowledge of configuration management tools such as Ansible or Chef
- Experience with monitoring, logging, and observability tools like Prometheus, Grafana, ELK stack, or Splunk
- Fundamentals of network security, firewalls, VPNs, and security best practices
Preferred Skills:
- Familiarity with multi-cloud or hybrid cloud architectures
- Experience with automation of deployment pipelines and continuous integration/continuous deployment (CI/CD) tools such as Jenkins, GitLab CI, or Azure DevOps
- Knowledge of cloud security standards (e.g., CIS benchmarks, ISO 27001)
- Exposure to serverless technologies like AWS Lambda or Fargate
Overall Responsibilities
- Design, implement, and operate scalable, resilient, and secure cloud infrastructure supporting enterprise applications
- Automate deployment, configuration, and operational workflows to enable continuous delivery and operation
- Monitor system health, troubleshoot operational issues, and optimize performance across distributed environments
- Lead initiatives in system upgrades, migrations, and capacity planning to meet evolving business needs
- Collaborate with development, security, and operations teams to embed best practices and improve system observability
- Conduct vulnerability assessments, manage security controls, and coordinate regular penetration testing and audits
- Drive incident response, root cause analysis, and problem management activities
- Develop and maintain infrastructure documentation, runbooks, and operational standards
- Lead effort in adopting industry best practices for reliability, security, and automation
Technical Skills (By Category)
Cloud Technologies & Infrastructure:
- Essential: AWS services (EC2, Lambda, S3, RDS, CloudFormation, CloudWatch)
- Preferred: GCP, Azure, multi-cloud management
Automation & Infrastructure as Code:
- Essential: Terraform, CloudFormation, Ansible, Chef
- Preferred: Cross-stack automation, compliance automation tools
Containerization & Orchestration:
- Essential: Docker, Kubernetes (k8s), Helm
- Preferred: Istio, Service Mesh solutions
Scripting & Programming:
- Essential: Bash, Python, PowerShell
- Preferred: Automation frameworks, scripting for security and compliance tasks
Monitoring & Security:
- Essential: Prometheus, Grafana, ELK stack, CloudWatch
- Preferred: SIEM solutions, vulnerability management tools, security benchmarks
Experience Requirements
- 7+ years of hands-on experience in cloud operations, infrastructure automation, and site reliability engineering
- Proven track record managing high-availability, large-scale cloud environments in enterprise or regulated industries
- Demonstrated expertise in automation, scripting, and CI/CD pipeline development
- Experience in performing incident and problem management, including root cause analysis and remediations
- Strong background with security standards, vulnerability mitigation, and compliance practices
- Prior experience with financial services infrastructure or banking environments is a plus; equivalent experience in other sectors acceptable
Day-to-Day Activities
- Manage deployment and operational health of cloud infrastructure to ensure high availability and performance
- Automate onboarding, scaling, and configuration management for cloud resources and applications
- Monitor system performance, conduct incident response, and execute troubleshooting for outages or performance degradation
- Lead infrastructure buildouts, migrations, and upgrades with minimal disruption
- Implement security controls, manage vulnerabilities, and ensure GDPR, PCI DSS, or other compliance standards are met
- Collaborate closely with development teams on infrastructure requirements, security best practices, and automation strategies
- Maintain comprehensive documentation, runbooks, and operational procedures
- Identify opportunities for process automation and system reliability improvements
- Conduct regular reviews of system metrics, logs, and health dashboards, and adjust operations accordingly
Qualifications
- Bachelor’s degree in Computer Science, Information Technology, or related field
- Certifications such as AWS Certified Solutions Architect – Professional, AWS Certified DevOps Engineer, or equivalent are strongly preferred
- Proven experience in cloud infrastructure, automation, security, and high-availability system management
Professional Competencies
- Strong analytical, troubleshooting, and incident management skills
- Excellent collaboration and stakeholder communication abilities
- Demonstrated leadership in implementing automation and reliability best practices
- Ability to prioritize tasks effectively and work under tight deadlines
- Continuous learner with a growth mindset and adaptability to evolving technologies
- Proactive in adopting emerging tools and industry standards to improve system resilience and operational efficiency
SYNECHRON’S DIVERSITY & INCLUSION STATEMENT
Diversity & Inclusion are fundamental to our culture, and Synechron is proud to be an equal opportunity workplace and is an affirmative action employer. Our Diversity, Equity, and Inclusion (DEI) initiative ‘Same Difference’ is committed to fostering an inclusive culture – promoting equality, diversity and an environment that is respectful to all. We strongly believe that a diverse workforce helps build stronger, successful businesses as a global company. We encourage applicants from across diverse backgrounds, race, ethnicities, religion, age, marital status, gender, sexual orientations, or disabilities to apply. We empower our global workforce by offering flexible workplace arrangements, mentoring, internal mobility, learning and development programs, and more.
All employment decisions at Synechron are based on business needs, job requirements and individual qualifications, without regard to the applicant’s gender, gender identity, sexual orientation, race, ethnicity, disabled or veteran status, or any other characteristic protected by law.
Candidate Application Notice
About Us
At Synechron, we believe in the power of digital to transform businesses for the better. Our global consulting firm combines creativity and innovative technology to deliver industry-leading digital solutions. Synechron’s progressive technologies and optimization strategies span end-to-end Artificial Intelligence, Consulting, Digital, Cloud & DevOps, Data, and Software Engineering, servicing an array of noteworthy financial services and technology firms. Through research and development initiatives in our FinLabs we develop solutions for modernization, from Artificial Intelligence and Blockchain to Data Science models, Digital Underwriting, mobile-first applications and more.
Over the last 20+ years, our company has been honored with multiple employer awards, recognizing our commitment to our talented teams. With top clients to boast about, Synechron has a global workforce of 14,500+, and has 58 offices in 21 countries within key global markets.
For more information on the company, please visit our website
or LinkedIn
community.
Sustainability and Health Safety Commitment
At Synechron, we are committed to integrating sustainability into our business strategy, ensuring responsible growth while minimizing environmental impact. Employees play a key role in driving our sustainability initiatives, from reducing our carbon footprint to fostering ethical and sustainable business practices across global operations. All positions are required to adhere to our Sustainability and Health Safety standards, demonstrating a commitment to environmental stewardship, workplace safety, and sustainable practices.