Senior Site Reliability Engineer

2 Months ago β€’ Upto 10 Years β€’ DevOps

About the job

Job Description

We're seeking a Senior Site Reliability Engineer to design, implement, and optimize AWS systems for high availability and scalability. Must have experience with AWS, Terraform, monitoring, and incident response. Strong understanding of cloud-native development practices and DevOps principles is a must.
Must have:
  • AWS Experience
  • Terraform IaC
  • Monitoring Tools
  • Incident Response
Good to have:
  • Database Design
  • Data Integration
  • Jenkins CI/CD
  • Splunk/ELK
Perks:
  • Remote Work
  • Part-time Option
Not hearing back from companies?
Unlock the secrets to a successful job application and accelerate your journey to your next opportunity.

About the job

Are you a Site Reliability Engineer working at a Large Financial Institution and being told by your leadership that you are too hands on or detail oriented or think and work like a start-up?


Imagine working at Intellibus to engineer platforms that impact billions of lives around the world. With your passion and focus we will accomplish great things together!


We are looking forward to you joining our Platform Engineering Team.


Our Platform Engineering Team is working to solve the Multiplicity Problem. We are trusted by some of the most reputable and established FinTech Firms. Recently, our team has spearheaded the Conversion & Go Live of apps which support the backbone of the Financial Trading Industry.


We are looking for Engineers who can

  • Design, implement, and optimize systems on AWS to ensure high availability, scalability, and fault tolerance.
  • Monitor, analyze, and respond to performance and reliability issues to maintain system health and performance.
  • Develop and maintain infrastructure as code (IaC) templates using Terraform to ensure consistent and repeatable deployments across different environments.
  • Collaborate with development teams to establish and implement best practices for monitoring, logging, and observability in a cloud-native environment.
  • Participate in the on-call rotation to provide prompt response to incidents, perform root cause analysis, and contribute to post-incident reviews.
  • Work closely with security teams to implement and maintain security best practices and compliance requirements in the AWS environment.
  • Automate operational processes and routine tasks to improve efficiency, reduce manual intervention, and minimize the risk of errors.
  • Contribute to capacity planning and performance optimization efforts to accommodate growing workloads and user demands.
  • Stay with industry trends and emerging technologies, and provide recommendations for adopting new tools and practices to improve operations.
  • Collaborate with cross-functional teams to improve the end-to-end software development and delivery pipeline.
  • Document work thoroughly, create a roadmap with milestones, and prioritize tasks in Jira.


We work closely with

  • AWS S3
  • Database Design
  • Data Integration
  • Jenkins
  • Splunk / ELK
  • Amazon VPC
  • Datadog New Relic / Wavefront
  • PCF
  • CI/CD
  • ECS
  • Lambda


Our Process

  • Schedule a 15 min Video Call with someone from our Team
  • 4 Proctored GQ Tests (< 2 hours)
  • 30-45 min Final Video Interview
  • Receive Job Offer


If you are interested in reaching out to us, please apply and our team will contact you within the hour.



View Full Job Description

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug