Lead site reliability engineer

3 Months ago • All levels • Devops

Job Summary

Job Description

The Site Reliability Engineer Lead will oversee support operations and site reliability engineering tasks, ensuring the effective functioning of systems and applications. Key responsibilities include managing a team, monitoring system performance, collaborating with cross-functional teams, developing incident response procedures, conducting audits, leading automation implementation, and providing technical guidance. This role focuses on enhancing system performance, availability, and resiliency. The candidate should have experience with monitoring tools, containerization technologies, and strong project management skills. This role may also be eligible for performance-based bonuses subject to company policies. In addition, this role is eligible for the following benefits subject to company policies: medical, dental, vision, pharmacy, life, accidental death & dismemberment, and disability insurance; employee assistance program; 401(k) retirement plan; 10 days of paid time off per year (some positions are eligible for need-based leave with no designated number of leave days per year); and 10 paid holidays per year.
Must have:
  • Proficiency in site reliability engineering (SRE) principles and practices.
  • Strong background in system administration, networking, and cloud computing.
  • Experience with monitoring tools such as Prometheus, Grafana, and ELK stack.
  • Knowledge of containerization technologies like Docker and Kubernetes.
  • Ability to troubleshoot complex technical issues and perform root cause analysis.
  • Excellent communication skills and ability to work collaboratively in a team environment.
  • Strong project management and leadership skills to drive initiatives efficiently.
Good to have:
  • Certifications in relevant areas such as AWS Certified DevOps Engineer or Google Professional Cloud DevOps Engineer are a plus.
Perks:
  • Medical, dental, vision, pharmacy, life, accidental death & dismemberment, and disability insurance
  • Employee assistance program
  • 401(k) retirement plan
  • 10 days of paid time off per year
  • 10 paid holidays per year

Job Details

Job description:

About HCLTech
HCLTech is a global technology company, spread across 60 countries, delivering industry-leading capabilities centered around digital, engineering, cloud and AI, powered by a broad portfolio of technology services and products. We work with clients across all major verticals, providing industry solutions for Financial Services, Manufacturing, Life Sciences and Healthcare, Technology and Services, Telecom and Media, Retail and CPG, and Public Services. We re powered by our people a global, diverse, multi-generational talent - representing 161 nationalities whose unique spark, perspective and boundless passion drive our culture of proactive value creation and problem-solving.
Our purpose is to bring together the best of technology and our people to supercharge progress for everyone, everywhere our clients, partners, their stakeholders, communities, and the planet. As a company, we are deeply focused on accelerating our ESG agenda. We are also creating technology-enabled sustainable solutions with and for our clients and partners. We embed ESG imperatives into every aspect of our business and ensure that the progress we supercharge is responsible, inclusive and beneficial to all our stakeholders in the long term. We have committed to achieving net zero by 2040.

To learn more about how we can supercharge progress for you, visit www.hcltech.com

Site Reliability Engineer Lead

Job Summary
The Support Lead (SRE) is responsible for overseeing the support operations and site reliability engineering tasks, ensuring the effective functioning of systems and applications. The primary goal is to enhance system performance, availability, and resiliency.

  • Key Responsibilities
    1. Manage a team of support engineers and sres to provide technical support and address system issues promptly.
    2. Monitor system performance and reliability metrics, identifying areas for improvement and implementing solutions.
    3. Collaborate with cross functional teams to optimize application performance and enhance system reliability.
    4. Develop and maintain incident response procedures and protocols to minimize system downtime.
    5. Conduct regular audits and assessments to ensure compliance with industry standards and best practices.
    6. Lead the implementation of automation tools and processes to streamline support operations and enhance efficiency.
    7. Provide technical expertise and guidance to team members, promoting a culture of continuous learning and development.

    Skill Requirements
    1. Proficiency in site reliability engineering (sre) principles and practices.
    2. Strong background in system administration, networking, and cloud computing.
    3. Experience with monitoring tools such as prometheus, grafana, and elk stack.
    4. Knowledge of containerization technologies like docker and kubernetes.
    5. Ability to troubleshoot complex technical issues and perform root cause analysis.
    6. Excellent communication skills and ability to work collaboratively in a team environment.
    7. Strong project management and leadership skills to drive initiatives and deliver results efficiently.
    8. Certifications in relevant areas such as aws certified devops engineer or google professional cloud devops engineer are a plus.

Similar Jobs

Wrike - Enablement Business Partner – Customer Solutions & Innovation

Wrike

Prague, Prague, Czechia (Hybrid)
1 Month ago
Haptic  - Senior VFX Artist

Haptic

Paris, Île-de-France, France (Remote)
7 Months ago
bytedance - Insurance Product Manager - Global Payment

bytedance

Singapore (On-Site)
9 Months ago
Everlaw - Manager, Product Lead

Everlaw

Oakland, California, United States (Hybrid)
1 Month ago
Kyruus Health - Director, DevOps & Infrastructure

Kyruus Health

United States (Remote)
2 Months ago
ShyftLabs - Salesforce Marketing Cloud Architect

ShyftLabs

Noida, Uttar Pradesh, India (Hybrid)
5 Months ago
Nagarro - Associate Staff Engineer, DevOps

Nagarro

(On-Site)
9 Months ago
Brillio - Full Stack/Architect (Python, React, Strapi, AWS, Terraform)

Brillio

New York, United States (Remote)
1 Month ago
Brillio - .NET Azure Architect - R01525011

Brillio

Pune, Maharashtra, India (Hybrid)
10 Months ago
Lambda - Technical Solutions Enablement Engineer

Lambda

San Francisco, California, United States (Hybrid)
4 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Square - Senior Software Engineer

Square

Bengaluru, Karnataka, India (Hybrid)
1 Month ago
Zuru - Influencer Research Executive

Zuru

Ahmedabad, Gujarat, India (On-Site)
1 Year ago
Abridge - Deal Desk Manager

Abridge

Chicago, Illinois, United States (Remote)
1 Month ago
Postman - Senior Product Designer, Integrations & Ecosystem

Postman

San Francisco, California, United States (Hybrid)
1 Month ago
Toast - Technical Zuora Revenue Analyst

Toast

United States (Remote)
2 Months ago
WebTech Corporation - Environmental, Health & Safety Site Leader

WebTech Corporation

Duncan, South Carolina, United States (On-Site)
3 Months ago
Lilt - Thai Medical Translators

Lilt

Bangkok, Thailand (Remote)
7 Months ago
Toast - Territory Account Executive

Toast

New York, United States (On-Site)
2 Months ago
JDA - Senior Specialist Employee Engagement & Internal Communications

JDA

Las Vegas, Nevada, United States (On-Site)
2 Months ago
OKX - Senior Audit Manager, FinCrime (EMEA)

OKX

Dubai, Dubai, United Arab Emirates (On-Site)
3 Months ago

Get notifed when new similar jobs are uploaded

Jobs in Illinois, United States

Giant Sparrow - Gameplay Programmer

Giant Sparrow

Los Angeles, California, United States (Remote)
1 Month ago
Playstation - Concept Artist (Environment - Contract)

Playstation

Los Angeles, California, United States (On-Site)
1 Month ago
Greenworks Sunrise Global Marketing - Territory Sales Manager

Greenworks Sunrise Global Marketing

Sacramento, California, United States (Remote)
3 Months ago
Marvell - Principal Optical Engineer

Marvell

Santa Clara, California, United States (On-Site)
2 Months ago
Perplexity - Site Reliability Engineer

Perplexity

San Francisco, California, United States (On-Site)
3 Months ago
Glean - Software Engineer, Backend

Glean

Palo Alto, California, United States (Hybrid)
1 Month ago
MiQ - Associate Account Manager

MiQ

New York, New York, United States (Hybrid)
2 Months ago
Privy - Forward Deployed Engineer

Privy

New York, United States (Remote)
8 Months ago
Apple - Health Sensing - Sensing Hardware Prototyping Engineer

Apple

Cupertino, California, United States (On-Site)
2 Months ago
storm flag games - C++ Game Engineer / Senior C++ Game Engineer

storm flag games

Dedham, Massachusetts, United States (Hybrid)
1 Month ago

Get notifed when new similar jobs are uploaded

Devops Jobs

NVIDIA - Senior Solutions Architect, Omniverse Platform

NVIDIA

Beijing, Beijing, China (On-Site)
5 Months ago
Interface AI - Software Development Engineer II - Backend (Core Platform)

Interface AI

India (Remote)
1 Month ago
C3 IoT - Solution Engineer

C3 IoT

New York, United States (On-Site)
1 Month ago
gitlab - Solutions Architect

gitlab

Canada (Remote)
3 Months ago
extreme network - Senior/Staff Systems Software Engineer – Linux Platform & Virtualization

extreme network

Ontario, Canada (Hybrid)
5 Months ago
Qualcomm - Senior Engineer- Python automation framework Machine learning

Qualcomm

Hyderabad, Telangana, India (On-Site)
1 Month ago
BigID - Solutions Engineer

BigID

Tel Aviv-Yafo, Tel Aviv District, Israel (Hybrid)
1 Month ago
Adtran - Sr. DevOps Software Engineer

Adtran

Huntsville, Alabama, United States (On-Site)
2 Months ago
Xsolla - Solutions Engineer

Xsolla

(Remote)
4 Months ago
Daybreak Game Company LLC - Senior Software Engineer, Platform

Daybreak Game Company LLC

San Diego, California, United States (Remote)
9 Months ago

Get notifed when new similar jobs are uploaded

About The Company

Pune, Maharashtra, India (On-Site)

Noida, Uttar Pradesh, India (On-Site)

Bengaluru, Karnataka, India (On-Site)

Noida, Uttar Pradesh, India (On-Site)

Pune, Maharashtra, India (On-Site)

Chennai, Tamil Nadu, India (On-Site)

California, United States (On-Site)

Noida, Uttar Pradesh, India (On-Site)

Jersey City, New Jersey, United States (On-Site)

Texas, United States (On-Site)

View All Jobs

Get notified when new jobs are added by HCL Tech

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug