Site Reliability Engineer (SRE) - Kubernetes

2 Months ago • 5 Years + • DevOps

Job Summary

Job Description

This Site Reliability Engineer (SRE) role focuses on Kubernetes and requires 5+ years of experience. Responsibilities include designing, deploying, and managing Kubernetes clusters; automating infrastructure using Terraform, Helm, and Ansible; ensuring high availability and security of distributed systems; implementing CI/CD pipelines; developing observability solutions with Prometheus, Grafana, etc.; improving incident management; optimizing cloud resource utilization; collaborating with development teams; and ensuring security and compliance within Kubernetes. The ideal candidate will have strong expertise in Kubernetes administration, containerization, IaC, scripting, monitoring tools, networking, cloud infrastructure (AWS, GCP, or Azure), CI/CD tools, and security best practices.
Must have:
  • Kubernetes expertise
  • Infrastructure automation
  • Cloud experience (AWS, GCP, Azure)
  • CI/CD pipeline implementation
  • Observability solutions
  • Incident management
  • Security and compliance
Good to have:
  • Multi-cluster Kubernetes deployments
  • Serverless architectures
  • FinTech/Healthcare/SaaS experience
  • Database administration
  • Relevant certifications (CKA, AWS DevOps, GCP DevOps)
Perks:
  • Competitive salary and benefits
  • Cutting-edge technologies
  • Collaborative work environment
  • Career growth opportunities
  • Flexible work schedule (with remote/hybrid options implied)

Job Details

Job Title: Site Reliability Engineer (SRE) - Kubernetes
Location: Austin, Texas (Onsite)
Experience: 5+ years

Job Summary:
We are seeking a highly skilled Site Reliability Engineer (SRE) with expertise in Kubernetes to join our team in Austin, Texas. The ideal candidate will be responsible for maintaining and improving the reliability, scalability, and performance of our cloud infrastructure. You will collaborate with development and operations teams to implement best practices for automation, monitoring, and incident response.



Key Responsibilities:
  • Design, deploy, and manage Kubernetes clusters in production environments.
  • Automate infrastructure provisioning, deployment, and monitoring using tools like Terraform, Helm, and Ansible.
  • Ensure high availability, security, and reliability of distributed systems running on AWS, GCP, or Azure.
  • Implement and maintain CI/CD pipelines to streamline deployments and reduce manual intervention.
  • Develop observability solutions using Prometheus, Grafana, ELK Stack, or Datadog.
  • Improve incident management processes, conduct post-mortems, and implement corrective actions to prevent recurrence.
  • Optimize cloud resource utilization and implement cost management strategies.
  • Work closely with development teams to enhance application performance and enable DevOps culture.
  • Ensure security and compliance by implementing RBAC, network policies, and secrets management in Kubernetes.
  • ​Troubleshoot and resolve performance bottlenecks, network issues, and infrastructure failures.


Requirements

  • 5+ years of experience as an SRE, DevOps Engineer, or Kubernetes Engineer.
  • Hands-on experience with Kubernetes administration, deployment strategies (Helm, Kustomize), and service meshes (Istio, Linkerd).
  • Strong knowledge of containerization technologies like Docker.
  • Experience with Infrastructure as Code (IaC) using Terraform, Pulumi, or CloudFormation.
  • Proficiency in scripting and automation (Python, Bash, Go, or PowerShell).
  • Experience with observability & monitoring tools (Prometheus, Grafana, ELK Stack, Datadog, New Relic).
  • Strong understanding of networking, DNS, Load Balancers (NGINX, Traefik, HAProxy).
  • Experience managing cloud infrastructure in AWS, GCP, or Azure.
  • Working knowledge of CI/CD tools such as Jenkins, GitHub Actions, ArgoCD, or Flux.
  • Familiarity with security best practices, including RBAC, IAM, SSO, and certificate management.
  • Experience with disaster recovery and incident response processes.


Preferred Skills:

  • Experience with multi-cluster Kubernetes deployments.
  • Familiarity with serverless architectures (AWS Lambda, Google Cloud Functions).
  • Experience working in FinTech, Healthcare, or SaaS environments.
  • Knowledge of database administration for PostgreSQL, MySQL, or NoSQL databases.
  • Certifications such as CKA (Certified Kubernetes Administrator), AWS Certified DevOps Engineer, or Google Professional Cloud DevOps Engineer.

Technical Skills:

Kubernetes, Docker, Terraform, Helm, Ansible, AWS, GCP, Azure, Prometheus, Grafana, ELK Stack, Datadog, Jenkins, GitHub Actions, ArgoCD, Flux, Python, Bash, Go, PowerShell, Istio, Linkerd, NGINX, Traefik, HAProxy, RBAC, IAM, SSO, CloudFormation, Pulumi, PostgreSQL, MySQL, NoSQL, serverless architectures.





Benefits

Why Join Us?

  • Competitive salary and benefits package.
  • Opportunity to work with cutting-edge cloud-native technologies.
  • Collaborative and dynamic work environment.
  • Career growth and learning opportunities with access to training and certifications.
  • ​Flexible work schedule with remote/hybrid options.

How to Apply:

If you are passionate about Kubernetes, cloud infrastructure, and automation, and looking for an exciting opportunity in Austin, Texas, apply now by sending your resume to neetu.machhaar@ajmerainfotech.com


Note: Sponsorship is not available for this role. Candidates must be authorized to work in the U.S. without employer sponsorship.


Similar Jobs

Velotio Technologies - Senior DevOps Engineer (AWS)

Velotio Technologies

Pune, Maharashtra, India (Remote)
5 Days ago
Plummy games - Full Stack Lead/Architect

Plummy games

Chișinău, Chisinau, Moldova (Remote)
5 Days ago
Egnyte - Jr. Software Engineer - Node.js, Python

Egnyte

India (Remote)
1 Month ago
PlayStation Global - Staff Software Engineer (Cloud Services / Distributed Systems)

PlayStation Global

Aliso Viejo, California, United States (On-Site)
4 Months ago
PwC - IN-Senior Associate_ML Engineer_Data and Analytics_Advisory_Bangalore

PwC

Bengaluru, Karnataka, India (On-Site)
5 Months ago
Activision - Software Development Intern

Activision

Shanghai, Shanghai, China (On-Site)
2 Weeks ago
Avathon - DevOps Engineer

Avathon

Bengaluru, Karnataka, India (On-Site)
5 Months ago
NVIDIA - Senior Site Reliability Engineer

NVIDIA

Santa Clara, California, United States (On-Site)
6 Days ago
Quizizz - Platform Engineer

Quizizz

Bengaluru, Karnataka, India (On-Site)
2 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

ARHS - Fullstack Developer

ARHS

Liège, Wallonia, Belgium (On-Site)
5 Months ago
Ness Digital - Lead .Net Full-stack Engineer

Ness Digital

Timișoara, Timiș, Romania (Remote)
6 Days ago
Ubisoft - Senior Software Engineer - AI Applications

Ubisoft

Saint-Mandé, Île-de-France, France (Hybrid)
2 Weeks ago
prizepicks - Engineering Manager — Data Science Engineering

prizepicks

Atlanta, Georgia, United States (Remote)
1 Week ago
Canva - Staff Software Engineer - Data Platform

Canva

Brisbane, Queensland, Australia (Remote)
1 Week ago
ByteDance - Software Engineer (Applied Machine Learning - Enterprise)

ByteDance

San Jose, California, United States (On-Site)
6 Days ago
Zones - Cloud Engineer

Zones

Mumbai, Maharashtra, India (On-Site)
3 Months ago
Epic Games - Security Programmer - Backend (Asset Integrity)

Epic Games

Montreal, Quebec, Canada (On-Site)
1 Week ago
PwC - Manager_ Cloud Architecture _ Advisory corporate _ Advisory _ Hyderabad

PwC

Hyderabad, Telangana, India (On-Site)
4 Months ago
NVIDIA - Senior HPC AI Cluster Engineer

NVIDIA

Tel Aviv-Yafo, Tel Aviv District, Israel (On-Site)
1 Month ago

Get notifed when new similar jobs are uploaded

Jobs in Austin, Texas, United States

Onward Search - Producer III

Onward Search

Los Angeles, California, United States (Hybrid)
1 Month ago
Interactive Brokers - Senior Software Engineer

Interactive Brokers

Greenwich, Connecticut, United States (On-Site)
5 Months ago
Nintendo - Experiential Marketing Specialist

Nintendo

Redmond, Washington, United States (Hybrid)
3 Months ago
ByteDance - Cloud Site Reliability Engineer

ByteDance

Seattle, Washington, United States (On-Site)
1 Week ago
Universal Music - Senior Financial Analyst, Global Finance

Universal Music

Santa Monica, California, United States (On-Site)
1 Month ago
ZeniMax Media - Senior Gameplay AI Engineer

ZeniMax Media

Cockeysville, Maryland, United States (Remote)
6 Months ago
Revolgy - L2 Cloud Operations Engineer

Revolgy

Georgia, United States (Remote)
5 Days ago
Meetelise - AI Operations Specialist - Housing

Meetelise

New York, New York, United States (On-Site)
5 Months ago
Riot Games - VFX Artist II - VALORANT, Premium Content

Riot Games

United States (On-Site)
1 Month ago
Electronic Arts - Technical Director - Dynamic Experiences

Electronic Arts

Redwood City, California, United States (On-Site)
3 Weeks ago

Get notifed when new similar jobs are uploaded

DevOps Jobs

Razer - Software Engineer (DevOps)

Razer

Shah Alam, Selangor, Malaysia (On-Site)
6 Months ago
Larian Studios - Senior Automation Engineer

Larian Studios

Guildford, England, United Kingdom (On-Site)
1 Month ago
The Walt Disney Company - Sr. System Reliability Engineer

The Walt Disney Company

Burbank, California, United States (On-Site)
1 Week ago
NVIDIA - Senior Software and Cloud Architect

NVIDIA

Ra'anana, Center District, Israel (On-Site)
2 Months ago
Luxoft - Senior Software Support Engineer

Luxoft

(Remote)
4 Months ago
EXUSIA - Google Cloud Platform - Data Architect / Engineer

EXUSIA

United States (Remote)
1 Month ago
Luxoft - Senior ETL Developer

Luxoft

Kuala Lumpur, Federal Territory Of Kuala Lumpur, Malaysia (On-Site)
4 Months ago
CloudHire - AWS Cloud Engineer

CloudHire

India (Remote)
1 Week ago
Toppan Merrill - Site Reliability Engineer

Toppan Merrill

Chennai, Tamil Nadu, India (On-Site)
6 Months ago
NVIDIA - Senior Storage and Data Production Engineer

NVIDIA

Santa Clara, California, United States (On-Site)
3 Days ago

Get notifed when new similar jobs are uploaded

About The Company

Ahmedabad, Gujarat, India (On-Site)

Hyderabad, Telangana, India (On-Site)

Bengaluru, Karnataka, India (On-Site)

Hyderabad, Telangana, India (On-Site)

Bengaluru, Karnataka, India (On-Site)

Ahmedabad, Gujarat, India (On-Site)

Ahmedabad, Gujarat, India (On-Site)

Austin, Texas, United States (On-Site)

Bengaluru, Karnataka, India (On-Site)

Texas, United States (On-Site)

View All Jobs

Get notified when new jobs are added by Ajmera Infotech

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug