Site Reliability Engineer, GovCloud 24x7

1 Month ago • 5 Years + • Devops • $143,300 PA - $197,000 PA

Job Summary

Job Description

Salesforce is looking for a Site Reliability Engineer to join their GovCloud team in Denver, Colorado. This role is crucial for maintaining 99.99% uptime for customer-facing services and ensuring data security. The engineer will be part of the GovCloud Incident Response team, handling alert responses, smart hands support, and incident management. Responsibilities include proactive monitoring, participating in incident reviews and root cause analyses, collaborating with technical staff, automating issue detection and resolution, and improving operational processes. The position requires shift work, including night shifts, as part of a 24/7 support team. This role demands a proactive approach to problem-solving and continuous improvement in a fast-paced environment.
Must have:
  • 5+ years systems engineering experience
  • Expertise in TCP/IP technologies
  • Expertise in Unix CLI support (Linux/Solaris)
  • Strong understanding of monitoring security
  • Experience with AWS/C2S infrastructure
  • Proficiency in scripting (Python, Go)
  • Strong communication skills
  • Incident Management experience
  • Ability to participate in 24/7 on-call rotation
Good to have:
  • Prior experience with Chef/Puppet
  • Prior experience with Jenkins/Bamboo/Spinnaker
  • Experience supporting monitoring/alert systems
  • Experience supporting Java applications
  • Hands-on experience with AWS CLI/SDKs
  • Linux+, RedHat, AWS certifications
  • Experience with Kubernetes
  • Familiarity with Agile/DevOps
  • Experience with blameless retrospectives
  • Knowledge of resilience engineering
  • Experience with AI/ML operational tools
Perks:
  • Wellbeing reimbursement
  • Generous parental leave
  • Adoption assistance
  • Fertility benefits
  • Medical, dental, vision, mental health support
  • Life and disability insurance
  • 401(k)
  • Employee stock purchasing program

Job Details

To get the best candidate experience, please consider applying for a maximum of 3 roles within 12 months to ensure you are not duplicating efforts.

Job Category

Software Engineering

Job Details

About Salesforce

We’re Salesforce, the Customer Company, inspiring the future of business with AI+ Data +CRM. Leading with our core values, we help companies across every industry blaze new trails and connect with customers in a whole new way. And, we empower you to be a Trailblazer, too — driving your performance and career growth, charting new paths, and improving the state of the world. If you believe in business as the greatest platform for change and in companies doing well and doing good – you’ve come to the right place.

This candidate must be a U.S. citizen (U.S. born or naturalized) operating on U.S. Soil who does not hold dual citizenship with the ability to meet customer and government screening standards applicable to this role.

Applications will be accepted until 08/27/2025.

This position requires onsite presence in the Denver, Colorado office.

Are you passionate about ensuring the reliability and performance of mission-critical cloud services? Salesforce is seeking a talented Site Reliability Engineer to join our dynamic team in our Denver, CO, location, supporting our GovCloud environment. As a key member of our Site Reliability organization, you'll play a vital role in maintaining 99.99% uptime for customer-facing services, proactively addressing issues, and ensuring the security of our data. We foster a collaborative and innovative culture, where you’ll work alongside skilled engineers to solve complex problems and drive continuous improvement.

Please Note: This position requires a successful background investigation and the ability to obtain and maintain a specific level of U.S. government background clearance. Details will be provided during the interview process.
 

Shift Requirements: This role involves shift work, including night shifts, as part of a 24/7 support team. We provide a rotating schedule and ensure adequate compensation for shift differentials.

About the Role:

The Site Reliability team at Salesforce is the backbone of our cloud operations, working around the clock to keep our services available and our customers protected. You will be a crucial part of the GovCloud Incident Response (GIR) team, which maintains the current infrastructure through day-to-day alert response, smart hands support, and comprehensive incident management, including retrospectives and long-term remediation.

Your Responsibilities:

  • Ensure 99.99% uptime for customer-facing services by proactively monitoring and maintaining the health of supporting systems, contributing directly to customer satisfaction and trust.

  • Act in key support roles during major incidents (e.g., Sev0, Sev1) and participate in technical incident reviews for problem management.

  • Contribute to Problem Management by populating and participating in Root Cause Analyses (RCAs) and handing them off to the Global Solutions team.

  • Ensure all work carried out by the Site Reliability team aligns with the company’s internal compliance policies and directives.

  • Collaborate with technical staff to solve complex technical issues and customer concerns.

  • Lead and mentor other team members in staying abreast of industry innovations and technologies, and assist in team development growth.

  • Thrive in a fast-paced environment, solving sophisticated issues quickly and successfully balancing multiple priorities.

  • Automate the detection and resolution of recurring issues in the production environment.

  • Help create and improve current processes to reduce operational and engineering toil, including the implementation of AI-driven automation for routine tasks.

Requirements:

  • A related technical degree required.

  • 5+ years systems engineering experience in enterprise-scale internet service engineering or support role.

Required Technical Skills:

  • Expertise in TCP/IP related technologies (networking protocols, network programming, etc.).

  • Expertise in CLI enterprise support of Unix variants (Linux/Solaris/BSD), with significant exposure to Red Hat Enterprise Linux and Solaris.

  • Strong understanding of monitoring security systems and administration.

  • Experience provisioning, operating, and running AWS/C2S based infrastructure and systems.

  • Proficiency in scripting with Python, Go, or other languages.

  • Communication: Strong written and oral communication skills.

  • Incident Management: Past experience in Incident Management and a good understanding of ITIL service operations.

  • Availability: Ability to participate in a 24/7 on-call rotation supporting large data center operations and be available for shift work.

Preferred Qualifications:

  • Prior experience with Chef/Puppet or automated deployment. (This helps streamline our infrastructure management.)

  • Prior experience with Jenkins/Bamboo/Spinnaker pipeline execution. (This aids in our continuous integration and deployment processes.)

  • Experience supporting and maintaining monitoring and alert systems. (Ensures proactive issue detection.)

  • Experience supporting and maintaining Java applications. (Supports our application stack.)

  • Hands-on experience configuring and running AWS (Amazon Web Services) using the CLI/SDKs. (Essential for our cloud infrastructure.)

  • Certifications in Linux+, RedHat, and AWS. (Validates technical expertise.)

  • Experience supporting and leading Kubernetes-based applications and services. (Supports our containerized environment.)

  • Familiarity with Agile Process and DevOps practices. (Enables efficient workflow and collaboration.)

  • Experience participating in blameless retrospectives, learning from incidents, and conducting post-incident investigations, with an interest in how AI can assist in root cause analysis and pattern identification. (Promotes a culture of continuous improvement.)

  • Working knowledge of and interest in resilience engineering, including concepts such as Safety II and proactive problem prevention, leveraging AI for proactive risk identification and system optimization. (Enhances system reliability.)

  • Experience with AI/ML concepts and tools for operational insights, predictive maintenance, or intelligent automation.

  • Familiarity with data analysis and visualization tools to interpret AI-generated insights.

This candidate must be a U.S. citizen (U.S. born or naturalized) who does not hold dual citizenship and agrees to complete a U.S. federal government Minimum Background Investigation (MBI) for a Moderate Public Trust position. Due to the citizenship requirements for this role, which supports U.S. federal, state, and/or local government customers, citizenship will be verified through two of the following REAL ID Act documents: U.S. Passport, Passport Card, REAL Driver’s License, Global Entry Card, U.S. Government CAC/PIV. You agree to complete a Minimum Background Investigation (MBI) for a Moderate Public Trust position with the U.S. federal government and gain other clearances as deemed appropriate for the role.

Benefits & Perks
Check out our benefits site which explains our various benefits, including wellbeing reimbursement, generous parental leave, adoption assistance, fertility benefits, and more.

Salesforce Information
Check out our Salesforce Engineering Site.

This candidate must be a U.S. citizen (U.S. born or naturalized) who does not hold dual citizenship and agrees to complete a U.S. federal government Minimum Background Investigation (MBI) for a Moderate Public Trust position.

Accommodations

If you require assistance due to a disability applying for open positions please submit a request via this Accommodations Request Form.

Posting Statement

Salesforce is an equal opportunity employer and maintains a policy of non-discrimination with all employees and applicants for employment. What does that mean exactly? It means that at Salesforce, we believe in equality for all. And we believe we can lead the path to equality in part by creating a workplace that’s inclusive, and free from discrimination. Know your rights: workplace discrimination is illegal. Any employee or potential employee will be assessed on the basis of merit, competence and qualifications – without regard to race, religion, color, national origin, sex, sexual orientation, gender expression or identity, transgender status, age, disability, veteran or marital status, political viewpoint, or other classifications protected by law. This policy applies to current and prospective employees, no matter where they are in their Salesforce employment journey. It also applies to recruiting, hiring, job assignment, compensation, promotion, benefits, training, assessment of job performance, discipline, termination, and everything in between. Recruiting, hiring, and promotion decisions at Salesforce are fair and based on merit. The same goes for compensation, benefits, promotions, transfers, reduction in workforce, recall, training, and education.

In the United States, compensation offered will be determined by factors such as location, job level, job-related knowledge, skills, and experience. Certain roles may be eligible for incentive compensation, equity, and benefits. Salesforce offers a variety of benefits to help you live well including: time off programs, medical, dental, vision, mental health support, paid parental leave, life and disability insurance, 401(k), and an employee stock purchasing program. More details about company benefits can be found at the following link: https://www.salesforcebenefits.com.

For Colorado-based roles, the base salary hiring range for this position is $143,300 to $197,000.

Similar Jobs

QS Quacquarelli Symonds  - Software Engineer (PHP)

QS Quacquarelli Symonds

Bengaluru, Karnataka, India (Hybrid)
3 Months ago
Synechron - Node JS - Testing

Synechron

Bengaluru, Karnataka, India (On-Site)
2 Years ago
Like Card - Customer Service Senior Supervisor – Chat, Social Media & Call Center

Like Card

Istanbul, İstanbul, Türkiye (On-Site)
2 Months ago
Sonar Source - Research Associate

Sonar Source

Singapore (On-Site)
4 Months ago
Applied materials  - Storage Lead Solution Architect

Applied materials

Bengaluru, Karnataka, India (On-Site)
2 Months ago
bytedance - Experienced Software Engineer - Traffic Platform

bytedance

San Jose, California, United States (On-Site)
9 Months ago
Supabase - Partner Solutions Architect

Supabase

(Remote)
1 Month ago
Thousand Eyes - Senior Software Engineer, Cloud and Enterprise Agents

Thousand Eyes

San Francisco, California, United States (On-Site)
2 Months ago
Palo Alto Networks - Principal DevOps Engineer

Palo Alto Networks

Santa Clara, California, United States (On-Site)
1 Month ago
Canva - Staff Software Engineer - Infra - Core Infrastructure

Canva

Brisbane, Queensland, Australia (Remote)
2 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Socure - Head of PR and Analyst Relations

Socure

United States (Remote)
3 Months ago
Activision - Principal Online Programmer

Activision

Warsaw, Masovian Voivodeship, Poland (On-Site)
2 Months ago
The Walt Disney Company - Technical Assistant

The Walt Disney Company

London, England, United Kingdom (Hybrid)
4 Months ago
Digicore studios - Webinar and Workshop Tutor – Parenting AI Tools

Digicore studios

Pune, Maharashtra, India (Remote)
7 Months ago
Insight Software - Consultant, Technical (SQL Query Writing + Implementations + Installations)

Insight Software

Hyderabad, Telangana, India (On-Site)
1 Month ago
Bushiroad - Marketing Executive

Bushiroad

Singapore, Singapore (On-Site)
6 Months ago
Marsh McLennan - Process Efficiency Project Manager

Marsh McLennan

Warsaw, Masovian Voivodeship, Poland (Hybrid)
2 Months ago
Power Integrations - Principal Product Definition Engineer

Power Integrations

San Jose, California, United States (On-Site)
5 Months ago
Barracuda - Lead Development Representative

Barracuda

Alpharetta, Georgia, United States (Hybrid)
1 Month ago
ElevenLabs - Mexico Revenue Lead

ElevenLabs

Mexico (Remote)
3 Months ago

Get notifed when new similar jobs are uploaded

Jobs in Denver, Colorado, United States

Lionsgate - Coordinator, Digital Marketing

Lionsgate

Santa Monica, California, United States (On-Site)
3 Months ago
LegalZoom - Group Manager, Content Marketing Lead

LegalZoom

Los Angeles, California, United States (Remote)
3 Weeks ago
Stord - Customer Experience Manager I

Stord

Atlanta, Georgia, United States (On-Site)
1 Month ago
Infosys - Senior .NET Full Stack Developer with React or Angular

Infosys

Alpharetta, Georgia, United States (On-Site)
3 Months ago
bytedance - Research Scientist in Large Model System

bytedance

Seattle, Washington, United States (On-Site)
9 Months ago
Plaid  - Data Engineer - Data Engineering

Plaid

San Francisco, California, United States (On-Site)
3 Months ago
Apple - Software Engineer (Data Solutions), AI & Data Platforms

Apple

Austin, Texas, United States (On-Site)
1 Month ago
lifechruh - Product Marketing Strategist

lifechruh

Edmond, Oklahoma, United States (On-Site)
10 Months ago
Apple - AIML - Machine Learning Engineer, Answers, Knowledge & Intelligence (AKI)

Apple

Santa Clara, California, United States (On-Site)
1 Month ago
USE Insider - Senior Product Marketing Manager, Analyst Relations

USE Insider

United States (Remote)
3 Months ago

Get notifed when new similar jobs are uploaded

Devops Jobs

kaizen gaming  - Site Reliability | DevOps Engineer

kaizen gaming

Thessaloniki, Greece (Hybrid)
2 Months ago
Sony Interactive Entertainment - Software Engineer (Automation Framework Development/Team Lead Candidate)

Sony Interactive Entertainment

Tokyo, Japan (On-Site)
7 Months ago
zoox - Principal Software Engineer, ML Infrastructure

zoox

Foster City, California, United States (Hybrid)
2 Months ago
Prophecy - Delivery Solution Architect

Prophecy

(Remote)
3 Months ago
CData Software - Platform Engineer

CData Software

Bengaluru, Karnataka, India (On-Site)
10 Months ago
Workato - Senior Infrastructure Engineer (OpenSearch)

Workato

Sofia, Sofia City Province, Bulgaria (On-Site)
3 Months ago
Hedra - Senior / Staff Platform Engineer

Hedra

San Francisco, California, United States (On-Site)
2 Months ago
Sailpoint - Principal Engineer - Atlas Platform

Sailpoint

Austin, Texas, United States (Hybrid)
2 Months ago
binance - Senior DevOps Engineer (AWS, Kubernetes, Linux)

binance

Taipei City, Taiwan (Remote)
1 Year ago

Get notifed when new similar jobs are uploaded

About The Company

We're Salesforce, the Customer Company, inspiring the future of business with AI + Data + CRM. Leading with our core values, we help companies across every industry blaze new trails and connect with customers in a whole new way. And, we empower you to be a Trailblazer, too — driving your performance and career growth, charting new paths, and improving the state of the world. If you believe in business as the greatest platform for change and in companies doing wellanddoing good – you've come to the right place.

Mexico City, Mexico (Hybrid)

San Francisco, California, United States (Hybrid)

Bogota, Colombia (Hybrid)

Santiago, Santiago Metropolitan Region, Chile (Hybrid)

Mexico City, Mexico (Hybrid)

San Francisco, California, United States (Hybrid)

San Francisco, California, United States (Hybrid)

Mexico City, Mexico (Hybrid)

Bogota, Colombia (Hybrid)

View All Jobs

Get notified when new jobs are added by Salesforce

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug