Senior Site Reliability Engineer (Senior SRE)

26 Minutes ago • 14 Years +

Job Summary

Job Description

As a Senior Site Reliability Engineer (SRE) at Wind River, you will be responsible for deploying, managing, and scaling highly available, secure, and resilient software services across multi-cloud (AWS, Azure, GCP) and on-premises environments. You will collaborate with developers, architects, and operations teams to improve system reliability, automation, security, and platform performance. Responsibilities include managing Kubernetes clusters, cloud infrastructure, monitoring, CI/CD pipelines, security, compliance, and cost optimization, and you will communicate complex infrastructure concepts and strategies.
Must have:
  • Extensive experience in Kubernetes and container orchestration.
  • Hands-on experience with cloud platforms (AWS, Azure, GCP).
  • Proficiency in scripting and automation languages (Python, Bash, Go).
  • Strong knowledge of CI/CD tools and pipeline design.
  • 14+ years of experience as a Site Reliability Engineer
Good to have:
  • Certifications in Kubernetes (CKA/CKAD/CKS), AWS, Azure, or GCP.
  • Familiarity with multi-cloud management tools and strategies.
  • Background in software development or software infrastructure management.

Job Details

Description

Position at Wind River

Job Title: Senior Site Reliability Engineer (Senior SRE) 
ABOUT WIND RIVER Wind River is a global leader delivering software for mission-critical intelligent systems. For over four decades, Wind River has powered billions of systems requiring the highest levels of security, safety, and reliability. Our software supports groundbreaking NASA missions such as Artemis I, the James Webb Space Telescope, multiple Mars rovers, and pioneering 5G initiatives. 
ABOUT THE OPPORTUNITY Wind River Systems is seeking a Senior Site Reliability Engineer (SRE) experienced in deploying, managing, and scaling highly available, secure, and resilient software services across multi-cloud (AWS, Azure, GCP) and on-premises environments. You will collaborate closely with developers, architects, and operations teams to enhance system reliability, automation, security, and overall platform performance. 
RESPONSIBILITIES 
Kubernetes and Container Orchestration: 
  • Deploy, manage, optimize, and troubleshoot large-scale Kubernetes clusters in multi-cloud (AWS, Azure, GCP) and hybrid environments (OpenStack, VMware vSphere). 
  • Implement cluster autoscaling and resource management strategies with tools such as Karpenter. 
Cloud and Hybrid Infrastructure Management: 
  • Architect, implement, and manage infrastructure in multi-cloud (AWS, GCP, Azure) and hybrid environments. 
  • Optimize cloud resource usage leveraging AWS Cost Explorer, Savings Plans, and similar tools on other cloud providers. 
Monitoring, Observability, and Reliability: 
  • Develop and maintain comprehensive monitoring, logging, tracing, and alerting solutions using Prometheus, Grafana, CloudWatch, Datadog, or similar tools. 
  • Conduct root cause analysis (RCA) and implement proactive improvements to maximize system uptime, reliability, and performance. 
CI/CD Pipelines and Automation: 
  • Design, implement, and maintain robust CI/CD pipelines using Jenkins, GitLab CI/CD, GitHub Actions, or Tekton. 
  • Promote and implement DevSecOps best practices across teams to automate testing, security scanning, and deployments. 
Security, Compliance, and Governance: 
  • Integrate comprehensive security practices throughout the software lifecycle (DevSecOps), including vulnerability scanning and secure coding practices. 
  • Manage secrets securely using Vault, AWS Secrets Manager, Azure Key Vault, or similar tools. 
  • Ensure adherence to compliance standards and regulatory requirements. 
Cost Optimization and Efficiency: 
  • Implement and enforce governance policies and frameworks to optimize infrastructure usage, reduce costs, and enhance operational efficiency. 
  • Regularly review and optimize cloud expenditure, performance, and scaling strategies. 
Collaboration and Communication: 
  • Collaborate closely with architects, developers, QA, product teams, and management stakeholders. 
  • Clearly communicate complex infrastructure concepts and strategies to diverse stakeholders. 
QUALIFICATIONS 
  • Bachelor's degree in Computer Science, Information Technology, or related technical discipline (Master’s preferred). 
  • 14+ years of experience as a Site Reliability Engineer, DevOps Engineer, Platform Engineer, or similar role. 
  • Extensive expertise in Kubernetes, container orchestration, and related ecosystem. 
  • Hands-on experience with cloud platforms (AWS, Azure, GCP), OpenStack, VMware vSphere, and hybrid environments. 
  • Proficiency in scripting and automation languages (Python, Bash, Go, or similar). 
  • Solid experience with infrastructure as code (Terraform, CloudFormation, Pulumi). 
  • Strong knowledge of CI/CD tools and pipeline design (Jenkins, GitLab CI/CD, GitHub Actions, Tekton). 
  • Exceptional troubleshooting and problem-solving skills, coupled with a proactive and continuous learning mindset. 
PREFERRED QUALIFICATIONS 
  • Certifications in Kubernetes (CKA/CKAD/CKS), AWS (Solutions Architect, DevOps Engineer), Azure, or GCP. 
  • Familiarity with multi-cloud management tools and strategies. 
  • Background in software development or software infrastructure management. 
Join our team at Wind River, contribute to building highly reliable, secure, and innovative software systems, and help shape the future of software-defined environments! 
  

Similar Jobs

CloudLinux - Database Administrator (ClickHouse)

CloudLinux

Tbilisi, Tbilisi, Georgia (Remote)
1 Month ago
Aspire - Senior Security Operations Center (SOC) Engineer

Aspire

Gurugram, India (Hybrid)
2 Weeks ago
WaveApps - Technical Project Manager

WaveApps

(Remote)
1 Month ago
The Walt Disney Company - Manager, Database Reliability Engineering

The Walt Disney Company

Washington, United States (On-Site)
1 Month ago
Nightfall - Staff Software Engineer

Nightfall

San Francisco, California, United States (Hybrid)
2 Weeks ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

London stock Exchange - Full Stack Software Engineer- Typescript

London stock Exchange

Bucharest, Bucharest, Romania (On-Site)
1 Day ago
Sony Interactive Entertainment - Senior Cloud Security Engineer

Sony Interactive Entertainment

Tokyo, Japan (On-Site)
5 Months ago
Shyft Labs - Data Engineer

Shyft Labs

Noida, Uttar Pradesh, India (Hybrid)
1 Month ago
Riot Games - Manager, Software Engineering - Infrastructure / Cloud Foundations

Riot Games

Los Angeles, California, United States (On-Site)
4 Months ago
Experian - Lead of Engineering

Experian

Cyberjaya, Selangor, Malaysia (On-Site)
1 Month ago
Aryaka - Senior Sales Engineer

Aryaka

(Remote)
2 Months ago
Enverus - Staff Software Engineer

Enverus

Calgary, Alberta, Canada (On-Site)
2 Weeks ago
BigID - Service Delivery Engineer

BigID

(Remote)
2 Weeks ago
neural concept - ML Application Infrastructure Engineer

neural concept

Warsaw, Masovian Voivodeship, Poland (Hybrid)
1 Week ago
Western Digital - Intern 3, Non-Engineering

Western Digital

Bengaluru, Karnataka, India (On-Site)
6 Days ago

Get notifed when new similar jobs are uploaded

Jobs in Bengaluru, Karnataka, India

Zscaler - Staff Software Development Engineer - DevOps

Zscaler

Bengaluru, Karnataka, India (On-Site)
1 Week ago
Entrata - Software Engineer

Entrata

Pune, Maharashtra, India (Hybrid)
7 Months ago
Studio Image Works - Motion Graphic Artist

Studio Image Works

Gurugram, Haryana, India (On-Site)
1 Year ago
PwC - Senior Associate-SAP FICO-TC

PwC

Kolkata, West Bengal, India (On-Site)
7 Months ago
Qualcomm - Engineer, Staff -Linux

Qualcomm

Hyderabad, Telangana, India (On-Site)
1 Week ago
Nagarro - Senior Engineer

Nagarro

Hyderabad, Telangana, India (On-Site)
6 Months ago
Cyara - Senior Software Development Engineer in Test (SDET)

Cyara

Hyderabad, Telangana, India (Hybrid)
8 Months ago
Capgemini - Power Platform Architect

Capgemini

Mumbai, Maharashtra, India (On-Site)
1 Week ago
Zscaler - Assistant Manager, Finance Transformation (FP&A)

Zscaler

Sahibzada Ajit Singh Nagar, Punjab, India (Hybrid)
2 Weeks ago
Wind River Systems - Member of Technical Staff

Wind River Systems

Bengaluru, Karnataka, India (On-Site)
6 Months ago

Get notifed when new similar jobs are uploaded

Similar Category Jobs

Looks like we're out of matches

Set up an alert and we'll send you similar jobs the moment they appear!

About The Company

Wind River is a global leader in delivering software for mission-critical intelligent systems. For 40 years, the company has been an innovator and pioneer, powering billions of devices and systems that require the highest levels of security, safety, and reliability. Wind River software and expertise are accelerating digital transformation across industries, including automotive, aerospace, defense, industrial, medical, and telecommunications. The company offers a comprehensive portfolio supported by world-class professional services and support and a broad partner ecosystem. To learn more, visit Wind River at www.windriver.com.

Ottawa, Ontario, Canada (Hybrid)

Bengaluru, Karnataka, India (On-Site)

Ottawa, Ontario, Canada (On-Site)

Tokyo, Japan (On-Site)

San José Province, Costa Rica (On-Site)

Troy, Michigan, United States (On-Site)

Alameda, California, United States (Hybrid)

San José Province, Costa Rica (On-Site)

Bengaluru, Karnataka, India (On-Site)

Bengaluru, Karnataka, India (On-Site)

View All Jobs

Get notified when new jobs are added by Wind River

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug