Staff Software Engineer, Site Reliability Engineer (SRE)

9 Minutes ago • 10 Years + • Devops • $250,000 PA - $290,000 PA

Job Summary

Job Description

As a Staff Software Engineer on the Site Reliability team at Harvey, you will ensure the reliability, scalability, and performance of our legal AI platform. This high-leverage role involves owning systems that keep the platform fast, secure, and always on. Your work will focus on scaling across 50+ regions and automating mission-critical operations to ensure Harvey remains resilient as it grows. The ideal candidate is passionate about building robust systems and reducing complexity through automation.
Must have:
  • Design, implement, and manage monitoring, alerting, and infrastructure resources across 50+ global regions.
  • Lead incident management processes, including postmortems, root cause analyses, and driving actionable improvements.
  • Automate operational tasks and workflows for capacity planning, graceful rollouts, and safe data access.
  • Establish best practices for security, compliance, and reliability across teams.
  • Optimize infrastructure costs through strategic capacity planning and build-versus-buy decisions.
  • Provide technical mentorship and leadership, promoting best practices and fostering team growth.
Perks:
  • Equity plan
  • Comprehensive health coverage
  • Dental coverage
  • Vision coverage
  • Retirement benefits (401k match up to 4%)
  • Flexible PTO
  • Relocation assistance

Job Details

Why Harvey

Harvey is a secure AI platform for legal and professional services that augments productivity and automates complex workflows. Harvey uses algorithms with reasoning-adept LLMs that have been customized and developed by our expert team of lawyers, engineers and research scientists. We’ve found product market fit and are scaling our team very quickly. Some reasons to join Harvey are:

  • Exceptional product market fit: We have partnered with the largest law firms and professional service providers in the world, including Paul Weiss

, A&O Shearman

, Ashurst

, O'Melveny & Myers, PwC

, KKR, and many others.

from strategic investors including Sequoia, Google Ventures, Kleiner Perkins, and OpenAI.

  • World-class team: Harvey is hiring the best talent

from DeepMind, Google Brain, Stripe, FAIR, Tesla Autopilot, Glean, Superhuman, Figma, and more.

  • Partnerships: Our engineers and researchers work directly with OpenAI to build the future of generative AI and redefine professional services.
  • Performance: 4x ARR in 2024.
  • Competitive compensation.

Role Overview

As a Staff Software Engineer on the Site Reliability team at Harvey, you will ensure the reliability, scalability, and performance of our legal AI platform. You’ll join a high-leverage team that sits at the intersection of infrastructure and product, owning the systems that keep our platform fast, secure, and always on. From scaling across 50+ regions to automating mission-critical operations, your work will ensure that Harvey remains resilient as we grow. If you’re passionate about building robust systems and reducing complexity through automation, we’d love to work with you.

This role is based in San Francisco, CA. We use an in-person work model and offer relocation assistance to new employees.

What You’ll Do

  • Design, implement, and manage monitoring, alerting, and infrastructure resources (compute, storage, networking) across 50+ global regions
  • Lead incident management processes, including postmortems, root cause analyses, and driving actionable improvements
  • Automate operational tasks and workflows, building tools and processes for capacity planning, graceful rollouts, and safe data access to maintain high reliability and reduce manual intervention
  • Establish best practices for security, compliance, and reliability and collaborate across teams to drive these principles throughout the software lifecycle
  • Optimize infrastructure costs through strategic capacity planning and build-versus-buy decisions while maintaining system performance, reliability, and functionality
  • Provide technical mentorship and leadership, promoting best practices and fostering team growth

What You Have

  • 10+ years of experience in Site Reliability Engineering or similar roles supporting production environments, with proven ability to mentor and guide technical teams
  • Expertise in infrastructure as code(IaC) tools (Pulumi, Terraform, CloudFormation, etc.)
  • Deep familiarity with observability tools (Datadog, Sentry, etc.) and incident response practices (PagerDuty, IncidentIO, etc.)
  • Proficiency with cloud infrastructure platforms (Azure, GCP, AWS, etc.)
  • Strong programming skills (Python, Bash, Go, or similar languages)
  • Proven track record of diagnosing complex system problems and implementing durable solutions
  • Solid understanding of CI/CD, Kubernetes, containerization, networking, databases, and cloud security principles
  • Excellent problem-solving skills, meticulous attention to detail, and a commitment to operational excellence

Please find our CA applicant privacy notice here

.

Harvey is an equal opportunity employer and does not discriminate on the basis of race, gender, sexual orientation, gender identity/expression, national origin, disability, age, genetic information, veteran status, marital status, pregnancy or related condition, or any other basis protected by law.

We are committed to providing reasonable accommodations to applicants with disabilities, and requests can be made by emailing interview-help@harvey.ai

.

Similar Jobs

Looks like we're out of matches

Set up an alert and we'll send you similar jobs the moment they appear!

Similar Skill Jobs

Looks like we're out of matches

Set up an alert and we'll send you similar jobs the moment they appear!

Jobs in San Francisco, California, United States

Looks like we're out of matches

Set up an alert and we'll send you similar jobs the moment they appear!

Devops Jobs

Looks like we're out of matches

Set up an alert and we'll send you similar jobs the moment they appear!

About The Company

San Francisco, California, United States (On-Site)

Sydney, New South Wales, Australia (Hybrid)

San Francisco, California, United States (On-Site)

San Francisco, California, United States (On-Site)

San Francisco, California, United States (Hybrid)

San Francisco, California, United States (Hybrid)

San Francisco, California, United States (On-Site)

San Francisco, California, United States (On-Site)

San Francisco, California, United States (On-Site)

View All Jobs

Get notified when new jobs are added by Harvey

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug