About the job
Acoustic is seeking a skilled and seasoned Senior Site Reliability Engineer to join our SRE team. We believe that the ideal candidate will bring innovative ideas and implement preventative measures to minimize downtime. This position is perfect for someone enthusiastic about technology and eager to contribute to the growth and success of our organization.
Key Responsibilities
- Lead major incident calls and provide solutions to the team.
- Collaborate with our SRE teams to provide early detection and response.
- Provide automated solutions for our application problems.
- Collaborate with our Engineering team to understand our products and features.
- Participate in team on-call rotation.
Requirements
- 5-8 years experience
- Strong Communication Skills
- Coding proficiency in one or more of the following languages with the ability to quickly learn new languages:
- Strong automation experience in AWS (preferred) or other Cloud Providers
- Worked with at least one of the automation tools such as
- uppet, Chef, Ansible, or Terraform
- Strong Java application debugging and troubleshooting skills such as looking at thread and heap dumps, performance tuning.
- Experience working with distributed systems and at least one of the following databases:
- racle, DB2, MySQL, PostgresDB, MSSQL
- In-depth knowledge of monitoring & log aggregation and o11y tools such as:
- ataDog, New Relic, LGTM, OpenTelemetry and other open source tools.
- Experience with deploying and managing
- ubernetes, Kafka, Open Search, Hbase, MongoDB, or their variants.
- Experience with queueing stack such as:
- ctiveMQ, RabbitMQ, or their variants
- Experience with CICD pipelines and work with at least one of the following tools:
- rtifactory, GitHub, Jenkins, CloudBees, Octopus, and other tools
- Ability to document work for the benefit of the team
Nice to have
- Experience with Snowflake and Looker