Senior Site Reliability Engineer
Regrello
Job Summary
Regrello is a fast-growing startup focused on automating supply chains, a market valued at $13 trillion annually. The company is building a global operating network and workflow engine for supply chain companies. The Senior Site Reliability Engineer role involves shaping the developer platform, working with customers, and architecting solutions for security and reliability. Projects include cloud-neutral infrastructure, GPU management for AI, and architecture enhancements. The company fosters a hybrid/remote culture and offers competitive compensation packages, including equity and health benefits. They are open to candidates across the US, Canada, and Mexico and provide US work authorization sponsorship.
Must Have
- Bachelor's degree in Computer Science or related field
- 4-8 years of experience in SRE, software engineering
- Strong understanding of SDLC and Agile methodologies
- Experience with CI/CD tools
- Proficiency in scripting languages
- Fluency with cloud platforms (AWS, Azure, GCP)
- Experience with Kubernetes
- Experience with feature flags
- Experience with modern backend technologies
- Experience with Go is strongly preferred
- Excellent problem-solving and communication skills
- Attention to detail
- Ability to work effectively in a remote team
Good to Have
- Ability to quickly learn new technologies
Perks & Benefits
- Equity
- Health insurance (medical, dental, vision)
- Generous paid time off
Job Description
- Bachelor’s degree in Computer Science or a related field.
- 4-8 years of experience in site reliability, software engineering, or a related role.
- Strong understanding of software development lifecycle (SDLC) and Agile methodologies.
- Experience with CI/CD tools such as Github Actions, GitLab CI, or CircleCI.
- Proficiency in scripting languages for automation tasks.
- Fluency with cloud platforms (AWS, Azure, GCP), kubernetes, feature flags, and modern backend technologies (experience with Go is strongly preferred, with the ability to quickly learn new technologies as needed).
- A builder's spirit (you have a track record of building projects for fun, staying updated with open-source developments, etc.)
- Excellent problem-solving and communications skills, and attention to detail, with the ability to work effectively in a remote team environment.
- Contribute to SRE-owned portions of application codebases related to infrastructure clients, SaaS clients, observability, and reliability patterns.
- Contribute to the developer platform interfaces to enable a growing number of engineers, microservices, and environments (helm charts, CI platform, and deploy processes).
- Advocate for new tools and processes that will help Regrello grow.
- Take part in on-call rotations.
- Collaborate with cross-functional teams, including Development, QA, Product Management, to ensure successful releases.
- GCP: GKE, CloudRun, Memorystore, CloudSQL, BigQuery
- Kubernetes: helm, helmfile
- Automation: Terraform, shell
- Queue: Temporal, Machinery, Celery
- Launchdarkly
- Otel / Prometheus / Grafana / Splunk
If you are passionate about improving software release processes and want to be part of a team that is transforming the supply chain industry, we would love to hear from you.