The K8S team runs the container orchestration infrastructure and platform services for the company. In addition to the K8s clusters, the team maintains the edge infrastructure of proxies between the internet feed and the cluster ingresses.
As a Senior DevOps Engineer, you will be working on both Kubernetes-based solutions and the current stack of nginx based, providing support for production-grade environments. You will participate in the evolution of the K8s environment as well as in the design and implementation of the new edge solution that will be based on up-to-date industry standards using an envoy-based product.
Responsibilities:
- You will be implementing features in the areas described above as part of ongoing development or in response to our user community requests.
- You will participate in supporting the team’s managed infrastructure and code base, including troubleshooting of business-critical systems.
The ideal candidate combines deep knowledge of deploying K8s workloads in an AWS EKS environment with the use of proxies for complex request routing and manipulation applications.
Must Haves (Required Skills):
- Hands-on experience with microservices architectures and specifically discovery, service-to-service communication patterns, and high availability deployments on Kubernetes clusters.
- Mid to high-level experience with at least one programming/scripting language, such as Python or Bash, in an enterprise environment.
- Experience with web proxies such as nginx, envoy, and OpenResty Lua for advanced request routing and manipulation.
- Hands-on experience with AWS cloud infrastructure and ability to work with components such as VPC, IAM, EC2, ELB, Route53, KMS, Cloudwatch, S3, RDS, DynamoDB.
- Hands-on experience with a Container orchestration platform in AWS, primarily EKS.
- Hands-on experience with infrastructure provisioning and continuous deployment with tools such as Terraform and ArgoCD.
- Working knowledge of DevOps/GitOps and best practices for modern DevOps operations and software development.
- Experience with security best practices related to infrastructure and platform design.
Nice-to-have Skills:
- Experience with ECS.
- Prometheus Stack for monitoring and alerting.
- ELK (Elasticsearch, Logstash, and Kibana) for logging and backing stores.
- Grafana for visualizing Prometheus metrics.