Staff Site Reliability Engineer

2 Months ago • 8-10 Years • DevOps

About the job

Job Description

Moveworks seeks a Staff Site Reliability Engineer to architect and manage its AI cloud infrastructure. 8+ years of experience with Python/Go, managing AWS workloads, and Jenkins/Terraform/Ansible/Helm are essential.
Must have:
  • Python / Go
  • AWS workloads
  • Jenkins/Terraform
  • Ansible/Helm
Good to have:
  • Azure experience
  • Distributed systems
  • Blue/green deployments
  • Canary environments
Perks:
  • Award-winning product
  • Innovative team
Not hearing back from companies?
Unlock the secrets to a successful job application and accelerate your journey to your next opportunity.

About the job

Who We Are

Moveworks is the universal AI copilot for search and automation across all your business applications. We give employees one place to go to find information and get support while reducing costs for your business. The Moveworks Copilot is powered by an industry-leading Reasoning Engine that uses a combination of public and proprietary language models to understand employee queries, then build and execute multi-step plans that achieve them. It does this by linking into systems (like ITSM, HRIS, ERP, identity management, and more) with native and custom-built integrations that turn natural language into powerful automations for employees.

The world’s most innovative brands like Databricks, Broadcom, Hearst, and Palo Alto Networks trust Moveworks to eliminate repetitive support issues, deliver instant knowledge, and empower employees to work faster across applications.

Founded in 2016, Moveworks has raised $315 million in funding, at a valuation of $2.1 billion, thanks to our award-winning product and team. In 2023, we were included in the Forbes Cloud 100 list as well as the Forbes AI 50 for the fifth consecutive year. We were also recognized by the 2023 Edison Awards for AI Optimized Productivity, and were included on Fast Company's Most Innovative Companies list for 2024!

Moveworks has over 500 employees in six offices around the world, and is backed by some of the world's most prominent investors, including Kleiner Perkins, Lightspeed, Bain Capital Ventures, Sapphire Ventures, Iconiq, and more.

Come join one of the most innovative teams on the planet!


About the job

Site Reliability Engineering (SRE) combines software and systems engineering to build and run large-scale, distributed, fault-tolerant systems.


We are looking for a passionate Staff Site reliability engineer to join our team. As the founding member of the SRE team at Moveworks in Bengaluru, you will be responsible for architecting and managing Moveworks AI cloud infrastructure and strategy. As Moveworks grows fast, this team will be tasked with designing and operating resilient and secure cloud and infrastructure that allows our products to operate reliably and our engineering teams to build and release customer facing features very rapidly.


You will get to work closely with platform, infrastructure, machine learning, search, data, DevOps, and frontend teams, and build systems that allow these teams to ship high quality software at a rapid pace. This might include building/improving CI/CD pipelines, supporting blue/green deployments, creating/managing canary environments and ensuring that the likelihood of bad code getting into production is minimized.


Responsibilities

  • Improve observability and reliability of Moveworks systems by managing/building monitoring and alerting infrastructure.
  • Improve debuggability - build / manage systems that help debug issues in production and analyze performance.
  • Architect, design, and execute projects to improve the reliability of our applications and systems.
  • Be a technical lead for the adjacent teams in Bengaluru.


Minimum qualifications:

  • Bachelor’s degree in Computer Science or a related field.
  • 8+ years of experience in software engineering / SRE with significant experience in Python / Go.
  • 2+ years of experience leading projects and designing, analyzing, and troubleshooting distributed systems.
  • Experience in managing/building infrastructure systems for deployment and management of workloads in AWS. Familiarity with Jenkins, Terraform, Ansible, Helm, etc.
  • Experience with Azure would be a plus.

View Full Job Description

Add your resume

80%

Upload your resume, increase your shortlisting chances by 80%

About The Company

Bengaluru, Karnataka, India (On-Site)

Bengaluru, Karnataka, India (On-Site)

Bengaluru, Karnataka, India (On-Site)

Karnataka, India (On-Site)

View All Jobs

Get notified when new jobs are added by Moveworks

Similar Jobs

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Sitetracker - Salesforce Engineer (EDS)

Sitetracker, United States (Remote)

Upstox - Quality Assurance II

Upstox, India (On-Site)

Reltio - Staff Engineer, DevOps

Reltio, India (On-Site)

The Walt Disney Company - Sr Software Engineer (Roku Engineer)

The Walt Disney Company, United States (On-Site)

McCain Foods - Cloud Engineer

McCain Foods, India (Hybrid)

Pixel Toys - Head of Server Engineering

Pixel Toys, United Kingdom (Hybrid)

IGT - Systems Engineer

IGT, United States (Remote)

Nagarro - Staff Engineer, Java Fullstack

Nagarro, Colombia (Remote)

Playrix - Senior QA Engineer (Render Team)

Playrix, Ireland (Remote)

Get notifed when new similar jobs are uploaded

Jobs in Bengaluru, Karnataka, India

Bluevine - DevOps Engineer II

Bluevine, India (Hybrid)

House Sparrow Films - Motion Designer and 2D Animator

House Sparrow Films, India (On-Site)

Exela Technologies - Civil 3D Specialist

Exela Technologies, India (On-Site)

Dream Game Studios - SDE - 1 - DevOps

Dream Game Studios, India (On-Site)

Dun & Bradstreet - Associate Research Analyst (R-16749)

Dun & Bradstreet, India (Hybrid)

Publicis Groupe - Brand Services Director

Publicis Groupe, India (On-Site)

Nagarro - Principal Engineer, PHP Lavavel

Nagarro, India (Remote)

Get notifed when new similar jobs are uploaded

DevOps Jobs

PlayStation Global - Info Sys Engineer 3

PlayStation Global, United States (On-Site)

Rockstar Games - Systems Engineer, Automation

Rockstar Games, United Kingdom (On-Site)

Luxoft - JIRA Developer

Luxoft, (Remote)

Sensia Global - Cloud Engineer, Sensia

Sensia Global, India (On-Site)

Publicis Groupe - DevOps Service Delivery Manager

Publicis Groupe, Mexico (On_site)

Auros Global - Strategy Developer - Asia

Auros Global, (Remote)

Get notifed when new similar jobs are uploaded