Site Reliability Engineer

1 Month ago • 2-4 Years

Job Summary

Job Description

The Site Reliability Engineer team is responsible for designing, implementing, and owning the infrastructure platform and services that protect Trellix Security’s consumers. This role involves supporting Cloud service measurement, monitoring, reporting, deployments, and security. The engineer will also contribute to improving operational quality through established practices and by collaborating with Engineering, QA, and product DevOps teams. The position requires supporting efforts to enhance Operational Excellence and Availability of Trellix Production environments. The selected individual will have access to cutting-edge tools and technologies, offering a great opportunity to build a career with the world’s cybersecurity leader and gain experience working with high-performance Cloud systems.
Must have:
  • 2 to 4 years of hands-on experience in supporting large-scale cloud services.
  • Strong production support background and experience in-depth troubleshooting.
  • Experience working with solutions in both Linux and Windows environments.
  • Experience using modern Monitoring and Alerting tools (Prometheus, Grafana, PagerDuty, etc.).
  • Excellent written and verbal communication skills.
  • Experience with Python or other scripting languages.
  • Ability to work independently in deploying, testing, and troubleshooting systems.
  • Experience supporting high availability systems and scalable solutions hosted on AWS or GCP.
  • Familiarity with security tools & practices (Wiz, Tenable).
  • Familiarity with Containerization and associated management tools (Docker, Kubernetes).
  • Significant experience of developing and maintaining relationships with a wide range of customers at all levels.
  • Understanding of Incident, Change, Problem and Vulnerability Management processes.
Good to have:
  • Awareness of ITIL best practices
  • AWS Certification and/or Kubernetes Certification
  • Experience with SnowFlake
  • Automation/CI/CD experience, Jenkins, Ansible, Github Actions, Argo CD.
Perks:
  • Retirement Plans
  • Medical, Dental and Vision Coverage
  • Paid Time Off
  • Paid Parental Leave
  • Support for Community Involvement

Job Details

Job Title:

Site Reliability Engineer

About Trellix:

Trellix, the trusted CISO ally, is redefining the future of cybersecurity and soulful work. Our comprehensive, GenAI-powered platform helps organizations confronted by today’s most advanced threats gain confidence in the protection and resilience of their operations. Along with an extensive partner ecosystem, we accelerate technology innovation through artificial intelligence, automation, and analytics to empower over 53,000 customers with responsibly architected security solutions.
We also recognize the importance of closing the 4-million-person cybersecurity talent gap. We aim to create a home for anyone seeking a meaningful future in cybersecurity and look for candidates across industries to join us in soulful work. More at https://www.trellix.com/.

Role Overview:

The Site Reliability Engineer team is responsible for design, implementation and end to end ownership of the infrastructure platform and services that protect the Trellix Security’s Consumer. The services provide continuous protection to our customers with a very strong focus on quality and an extendible services platform to internal partners & product teams.

This role is a Site Reliability Engineer for commercial cloud-native solutions, deployed and managed in public cloud environments like AWS, GCP.

You will be part of a team that is responsible for Trellix Cloud Services that enable protection at the endpoint products on a continuous basis.

Responsibilities of this role include supporting Cloud service measurement, monitoring, and reporting, deployments and security. You will input into improving overall operational quality through common practices and by working with the Engineering, QA, and product DevOps teams.

You will also be responsible for supporting efforts that improve Operational Excellence and Availability of Trellix Production environments.

You will have access to the latest tools and technology, and an incredible career path with the world’s cyber security leader. You will have the opportunity to immerse yourself within complex and demanding deployment architectures and see the “big picture” all while helping to drive continuous improvement in all aspects of a dynamic and high-performing engineering organization.

If you are passionate about running and continuously improving as a world class Site Reliability Engineer Team, we are offering you a unique and great opportunity to build your career with us and gain experience working with high-performance Cloud systems.

About Role:

  • Being part of a global 24x7x365 team providing the operational coverage  including event response and recovery efforts of critical services. 

  • Periodic deployment of features, patches and hotfixes to maintain the Security posture of our Cloud Services. 

  • Ability to work in shifts on a rotational basis and participate in On-Call duties

  • Have ownership and responsibility for high availability of Production environments

  • Input into the monitoring of systems applications and supporting data

  • Report on system uptime and availability

  • Collaborate with other team members on best practices

  • Assist with creating and updating runbooks & SOPs

  • Build a strong relationship with the Cloud DevOps, Dev & QA teams and become a domain expert for the cloud services in your remit.

  • Provided the required support for growth and development in this role.

About you:

  • 2 to 4 years of hands-on working experience in supporting production of large-scale cloud services.

  • Strong production support background and experience of in-depth troubleshooting

  • Experience working with solutions in both Linux and Windows environments

  • Experience using modern Monitoring and Alerting tools (Prometheus, Grafana, PagerDuty, etc.)

  • Excellent written and verbal communication skills.

  • Experience with Python or other scripting languages

  • Proven ability to work independently in deploying, testing, and troubleshooting systems.

  • Experience supporting high availability systems and scalable solutions hosted on AWS or GCP.

  • Familiarity with security tools & practices (Wiz, Tenable)

  • Familiarity with Containerization and associated management tools (Docker, Kubernetes)

  • Significant experience of developing and maintaining relationships with a wide range of customers at all levels

  • Understanding of Incident, Change, Problem and Vulnerability Management processes.

Desired: 

  • Awareness of ITIL best practices 

  • AWS Certification and/or Kubernetes Certification

  • Experience with SnowFlake

  • Automation/CI/CD experience, Jenkins, Ansible, Github Actions,  Argo CD.

Company Benefits and Perks:

We believe that the best solutions are developed by teams who embrace each other's unique experiences, skills, and abilities. We work hard to create a dynamic workforce where we encourage everyone to bring their authentic selves to work every day. We offer a variety of social programs, flexible work hours and family-friendly benefits to all of our employees.

  • Retirement Plans

  • Medical, Dental and Vision Coverage

  • Paid Time Off

  • Paid Parental Leave

  • Support for Community Involvement

We're serious about our commitment to a workplace where everyone can thrive and contribute to our industry-leading products and customer support, which is why we prohibit discrimination and harassment based on race, color, religion, gender, national origin, age, disability, veteran status, marital status, pregnancy, gender expression or identity, sexual orientation or any other legally protected status.

Similar Jobs

Crunchyroll - Senior Data Engineer

Crunchyroll

Culver City, California, United States (On-Site)
5 Months ago
Aisera Jobs - Cloud Data Architect

Aisera Jobs

Athens, Greece (On-Site)
1 Month ago
The Walt Disney Company - Lead Software Engineer (Roku Engineer)

The Walt Disney Company

Bristol, Connecticut, United States (On-Site)
6 Months ago
Riot Games - Senior Backend Software Engineer - Metagame Features

Riot Games

Singapore (On-Site)
3 Weeks ago
Rackspace Technology - Cloud Engineer IV (Java Dev Google Cloud Practice Engineer)

Rackspace Technology

Gurugram, Haryana, India (Remote)
2 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

N-iX - Senior Full-Stack Engineer (React+Java)

N-iX

Ukraine (Remote)
1 Month ago
Casumo - Quality Assurance Specialist

Casumo

(Hybrid)
2 Months ago
Coda - Senior/Staff Full Stack Engineer

Coda

Bangkok, Thailand (Hybrid)
2 Years ago
PwC - Solution Data Architect – Technology Consulting

PwC

Prague, Prague, Czechia (On-Site)
7 Months ago
Thousand Eyes - Senior Software Engineer (Java)

Thousand Eyes

London, England, United Kingdom (Hybrid)
2 Weeks ago
TTC Global - Test Architect

TTC Global

Naperville, Illinois, United States (On-Site)
2 Weeks ago
ION - Data Engineer

ION

Budapest, Hungary (On-Site)
7 Months ago
PivotRoots - Business Intelligence Supervisor

PivotRoots

São Paulo, Brazil (Hybrid)
3 Weeks ago
Postman - Senior Software Engineer, Monetization

Postman

Bengaluru, Karnataka, India (On-Site)
7 Months ago

Get notifed when new similar jobs are uploaded

Jobs in Bengaluru, Karnataka, India

CRED - Technology Leader - Mobile

CRED

Bengaluru, Karnataka, India (On-Site)
3 Weeks ago
Quizizz - Product Design Manager

Quizizz

Bengaluru, Karnataka, India (On-Site)
8 Months ago
Qualcomm - Engineer

Qualcomm

Hyderabad, Telangana, India (On-Site)
2 Weeks ago
Assystems - Senior Site Engineer- Civil

Assystems

Gujrat, Punjab, India (On-Site)
7 Months ago
Single Store - Technical Account Manager

Single Store

Hyderabad, Telangana, India (Remote)
2 Months ago
Anthology - Senior Software Engineer II

Anthology

Bengaluru, Karnataka, India (Hybrid)
3 Months ago
Capgemini - Learning & Developing

Capgemini

Kolkata, West Bengal, India (On-Site)
3 Weeks ago
Microsoft - Software Engineer 2

Microsoft

Hyderabad, Telangana, India (On-Site)
1 Month ago
Rackspace Technology - SAP FICO Business Systems Consultant IV

Rackspace Technology

Gurugram, Haryana, India (Remote)
2 Months ago
INTEL - Business Systems Analyst

INTEL

Bengaluru, Karnataka, India (Hybrid)
3 Weeks ago

Get notifed when new similar jobs are uploaded

Similar Category Jobs

Looks like we're out of matches

Set up an alert and we'll send you similar jobs the moment they appear!

About The Company

Trellix is a global company redefining the future of cybersecurity and soulful work. The company’s comprehensive, open and native cybersecurity platform helps organizations confronted by today’s most advanced threats gain confidence in the protection and resilience of their operations. Trellix, along with an extensive partner ecosystem, accelerates technology innovation through artificial intelligence, automation, and analytics to empower over 50,000 business and government customers with responsibly architected security.

Cork, County Cork, Ireland (On-Site)

Paderborn, North Rhine-Westphalia, Germany (On-Site)

Toronto, Ohio, United States (Remote)

Bengaluru, Karnataka, India (On-Site)

Bengaluru, Karnataka, India (On-Site)

State Of São Paulo, Brazil (On-Site)

United States (Remote)

Bengaluru, Karnataka, India (On-Site)

Bengaluru, Karnataka, India (On-Site)

Bengaluru, Karnataka, India (On-Site)

View All Jobs

Get notified when new jobs are added by Treelix

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug