Site Reliability Engineer

10 Hours ago • 2-4 Years

Job Summary

Job Description

The Site Reliability Engineer team is responsible for designing, implementing, and owning the infrastructure platform and services that protect Trellix Security’s consumers. This role involves supporting Cloud service measurement, monitoring, reporting, deployments, and security. The engineer will also contribute to improving operational quality through established practices and by collaborating with Engineering, QA, and product DevOps teams. The position requires supporting efforts to enhance Operational Excellence and Availability of Trellix Production environments. The selected individual will have access to cutting-edge tools and technologies, offering a great opportunity to build a career with the world’s cybersecurity leader and gain experience working with high-performance Cloud systems.
Must have:
  • 2 to 4 years of hands-on experience in supporting large-scale cloud services.
  • Strong production support background and experience in-depth troubleshooting.
  • Experience working with solutions in both Linux and Windows environments.
  • Experience using modern Monitoring and Alerting tools (Prometheus, Grafana, PagerDuty, etc.).
  • Excellent written and verbal communication skills.
  • Experience with Python or other scripting languages.
  • Ability to work independently in deploying, testing, and troubleshooting systems.
  • Experience supporting high availability systems and scalable solutions hosted on AWS or GCP.
  • Familiarity with security tools & practices (Wiz, Tenable).
  • Familiarity with Containerization and associated management tools (Docker, Kubernetes).
  • Significant experience of developing and maintaining relationships with a wide range of customers at all levels.
  • Understanding of Incident, Change, Problem and Vulnerability Management processes.
Good to have:
  • Awareness of ITIL best practices
  • AWS Certification and/or Kubernetes Certification
  • Experience with SnowFlake
  • Automation/CI/CD experience, Jenkins, Ansible, Github Actions, Argo CD.
Perks:
  • Retirement Plans
  • Medical, Dental and Vision Coverage
  • Paid Time Off
  • Paid Parental Leave
  • Support for Community Involvement

Job Details

Job Title:

Site Reliability Engineer

About Trellix:

Trellix, the trusted CISO ally, is redefining the future of cybersecurity and soulful work. Our comprehensive, GenAI-powered platform helps organizations confronted by today’s most advanced threats gain confidence in the protection and resilience of their operations. Along with an extensive partner ecosystem, we accelerate technology innovation through artificial intelligence, automation, and analytics to empower over 53,000 customers with responsibly architected security solutions.
We also recognize the importance of closing the 4-million-person cybersecurity talent gap. We aim to create a home for anyone seeking a meaningful future in cybersecurity and look for candidates across industries to join us in soulful work. More at https://www.trellix.com/.

Role Overview:

The Site Reliability Engineer team is responsible for design, implementation and end to end ownership of the infrastructure platform and services that protect the Trellix Security’s Consumer. The services provide continuous protection to our customers with a very strong focus on quality and an extendible services platform to internal partners & product teams.

This role is a Site Reliability Engineer for commercial cloud-native solutions, deployed and managed in public cloud environments like AWS, GCP.

You will be part of a team that is responsible for Trellix Cloud Services that enable protection at the endpoint products on a continuous basis.

Responsibilities of this role include supporting Cloud service measurement, monitoring, and reporting, deployments and security. You will input into improving overall operational quality through common practices and by working with the Engineering, QA, and product DevOps teams.

You will also be responsible for supporting efforts that improve Operational Excellence and Availability of Trellix Production environments.

You will have access to the latest tools and technology, and an incredible career path with the world’s cyber security leader. You will have the opportunity to immerse yourself within complex and demanding deployment architectures and see the “big picture” all while helping to drive continuous improvement in all aspects of a dynamic and high-performing engineering organization.

If you are passionate about running and continuously improving as a world class Site Reliability Engineer Team, we are offering you a unique and great opportunity to build your career with us and gain experience working with high-performance Cloud systems.

About Role:

  • Being part of a global 24x7x365 team providing the operational coverage  including event response and recovery efforts of critical services. 

  • Periodic deployment of features, patches and hotfixes to maintain the Security posture of our Cloud Services. 

  • Ability to work in shifts on a rotational basis and participate in On-Call duties

  • Have ownership and responsibility for high availability of Production environments

  • Input into the monitoring of systems applications and supporting data

  • Report on system uptime and availability

  • Collaborate with other team members on best practices

  • Assist with creating and updating runbooks & SOPs

  • Build a strong relationship with the Cloud DevOps, Dev & QA teams and become a domain expert for the cloud services in your remit.

  • Provided the required support for growth and development in this role.

About you:

  • 2 to 4 years of hands-on working experience in supporting production of large-scale cloud services.

  • Strong production support background and experience of in-depth troubleshooting

  • Experience working with solutions in both Linux and Windows environments

  • Experience using modern Monitoring and Alerting tools (Prometheus, Grafana, PagerDuty, etc.)

  • Excellent written and verbal communication skills.

  • Experience with Python or other scripting languages

  • Proven ability to work independently in deploying, testing, and troubleshooting systems.

  • Experience supporting high availability systems and scalable solutions hosted on AWS or GCP.

  • Familiarity with security tools & practices (Wiz, Tenable)

  • Familiarity with Containerization and associated management tools (Docker, Kubernetes)

  • Significant experience of developing and maintaining relationships with a wide range of customers at all levels

  • Understanding of Incident, Change, Problem and Vulnerability Management processes.

Desired: 

  • Awareness of ITIL best practices 

  • AWS Certification and/or Kubernetes Certification

  • Experience with SnowFlake

  • Automation/CI/CD experience, Jenkins, Ansible, Github Actions,  Argo CD.

Company Benefits and Perks:

We believe that the best solutions are developed by teams who embrace each other's unique experiences, skills, and abilities. We work hard to create a dynamic workforce where we encourage everyone to bring their authentic selves to work every day. We offer a variety of social programs, flexible work hours and family-friendly benefits to all of our employees.

  • Retirement Plans

  • Medical, Dental and Vision Coverage

  • Paid Time Off

  • Paid Parental Leave

  • Support for Community Involvement

We're serious about our commitment to a workplace where everyone can thrive and contribute to our industry-leading products and customer support, which is why we prohibit discrimination and harassment based on race, color, religion, gender, national origin, age, disability, veteran status, marital status, pregnancy, gender expression or identity, sexual orientation or any other legally protected status.

Similar Jobs

Blizzard Entertainment - Sr. Systems Engineer I

Blizzard Entertainment

Shanghai, Shanghai, China (On-Site)
2 Months ago
Coda - Senior/Staff Software Engineer

Coda

Manila, Metro Manila, Philippines (Hybrid)
3 Years ago
KBG Blockchain Game Studios - DevOps (Blockchain Gaming)

KBG Blockchain Game Studios

Thành Phố Hồ Chí Minh, Vietnam (On-Site)
9 Months ago
Addepar - Sr. Software Data Engineer

Addepar

Pune, Maharashtra, India (Hybrid)
1 Day ago
Trend Micro - (Sr.) Backend Engineer

Trend Micro

Taipei City, Taiwan (On-Site)
6 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Easygo - Data Scientist

Easygo

Melbourne, Victoria, Australia (On-Site)
3 Months ago
ByteDance - Data Engineer, Cloud and System

ByteDance

Seattle, Washington, United States (On-Site)
1 Month ago
sound cloud - Senior Backend Engineer - Media Streaming

sound cloud

London, England, United Kingdom (Hybrid)
1 Day ago
Sumo Logic - Senior Software Engineer I - ML Engineer

Sumo Logic

(Remote)
1 Day ago
Nagarro - Senior Staff Engineer - Python Full Stack

Nagarro

Colombia (Remote)
2 Months ago
Applike Group - Manual Tester

Applike Group

Hamburg, Hamburg, Germany (Hybrid)
4 Months ago
KBG Blockchain Game Studios - Back-End Developer (NodeJS)

KBG Blockchain Game Studios

Thành Phố Hồ Chí Minh, Vietnam (On-Site)
9 Months ago
Microsoft - Technical Program Manager, Data Insights & Governance

Microsoft

Redmond, Washington, United States (On-Site)
2 Weeks ago
WinZO - DevOps Engineer

WinZO

(Remote)
1 Day ago
Netflix - Engineer Manager - Intelligence and Experience Engineering

Netflix

Warsaw, Masovian Voivodeship, Poland (On-Site)
2 Weeks ago

Get notifed when new similar jobs are uploaded

Jobs in Bengaluru, Karnataka, India

Enphase Energy - Associate Manager/Manager - Web Projects (Design)

Enphase Energy

Bengaluru, Karnataka, India (On-Site)
4 Months ago
Dialpad AI - Sr. SDET

Dialpad AI

Bengaluru, Karnataka, India (Hybrid)
21 Hours ago
commerce iq - Data Scientist II

commerce iq

Bengaluru, Karnataka, India (On-Site)
19 Hours ago
Ajmera Infotech - Senior React Developer

Ajmera Infotech

Hyderabad, Telangana, India (On-Site)
9 Months ago
Tekion Corp - Lead Product Manager

Tekion Corp

Bengaluru, Karnataka, India (On-Site)
1 Day ago
Insight  Software - Lead Software Engineer

Insight Software

Bengaluru, Karnataka, India (On-Site)
1 Month ago
Google - Web Solutions Engineer, University Graduate, 2025

Google

Hyderabad, Telangana, India (On-Site)
4 Months ago
Workato - Senior Development and Demo Applications Administrator

Workato

Chennai, Tamil Nadu, India (On-Site)
8 Hours ago
PwC - Associate|Oracle fusion Finance| Oracle|Advisory|Mumbai

PwC

Mumbai, Maharashtra, India (On-Site)
6 Months ago
Nagarro - Associate Principal Engineer, Frontend Angular2x

Nagarro

India (Remote)
6 Months ago

Get notifed when new similar jobs are uploaded

Similar Category Jobs

Looks like we're out of matches

Set up an alert and we'll send you similar jobs the moment they appear!

About The Company

Trellix is a global company redefining the future of cybersecurity and soulful work. The company’s comprehensive, open and native cybersecurity platform helps organizations confronted by today’s most advanced threats gain confidence in the protection and resilience of their operations. Trellix, along with an extensive partner ecosystem, accelerates technology innovation through artificial intelligence, automation, and analytics to empower over 50,000 business and government customers with responsibly architected security.

Bengaluru, Karnataka, India (On-Site)

Bengaluru, Karnataka, India (On-Site)

Guangzhou, Guangdong Province, China (On-Site)

Bengaluru, Karnataka, India (On-Site)

Aylesbury, England, United Kingdom (Hybrid)

View All Jobs

Get notified when new jobs are added by Treelix

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug