Home >

Jobs >

Senior Staff Site Reliability Engineer (Cortex Observability)

Palo Alto Networks

California, United States (On-site)

Senior Staff Site Reliability Engineer (Cortex Observability)

2 Months ago • 5 Years + • Devops • $126,000 PA - $203,500 PA

Job Summary

Job Description

The Cortex team at Palo Alto Networks is looking for a Senior Staff Site Reliability Engineer to operate and maintain a large-scale GCP environment. This role involves designing, implementing, and enhancing observability systems using modern tools and practices. You will manage high cardinality metrics, implement tracing, and operationalize large-scale logging solutions. Collaboration with engineering teams to provide clear insights into system performance and health is key. Responsibilities include cloud expertise in GCP, improving monitoring processes and alerts, incident management, automating complex tasks, continuous improvement, providing on-call coverage, and influencing service operability for reliability and availability.

Must have:

5+ years experience as DevOps/SRE
High proficiency with observability tools (Thanos, Prometheus, Grafana, Open Telemetry)
Incident and alerts management (Pagerduty, Prometheus Alert Manager)
High proficiency in GCP or AWS
High proficiency with Kubernetes and Docker
High proficiency in Python and Linux Shell
Experience with Ansible and Terraform
Effective communication and interpersonal skills
Ability to troubleshoot complex problems
Ability to operate independently and take responsibility

Good to have:

Passion for technology and high reliability
Experience with XDR, XSIAM, XSOAR, and XPANSE platforms

Perks:

FLEXBenefits wellbeing spending account
Mental and financial health resources
Personalized learning opportunities
Choice in how you are supported

13 skills required

13 skills required for this role

Add these skills to join the top 1% applicants for this job

communication

problem-solving

linux

prometheus

ansible

terraform

grafana

google-cloud-platform

amazon-web-services

docker

kubernetes

python

shell

Job Details

Company Description

Our Mission

At Palo Alto Networks® everything starts and ends with our mission:

Being the cybersecurity partner of choice, protecting our digital way of life.
Our vision is a world where each day is safer and more secure than the one before. We are a company built on the foundation of challenging and disrupting the way things are done, and we’re looking for innovators who are as committed to shaping the future of cybersecurity as we are.

Who We Are

We take our mission of protecting the digital way of life seriously. We are relentless in protecting our customers and we believe that the unique ideas of every member of our team contributes to our collective success. Our values were crowdsourced by employees and are brought to life through each of us everyday - from disruptive innovation and collaboration, to execution. From showing up for each other with integrity to creating an environment where we all feel included.

As a member of our team, you will be shaping the future of cybersecurity. We work fast, value ongoing learning, and we respect each employee as a unique individual. Knowing we all have different needs, our development and personal wellbeing programs are designed to give you choice in how you are supported. This includes our FLEXBenefits wellbeing spending account with over 1,000 eligible items selected by employees, our mental and financial health resources, and our personalized learning opportunities - just to name a few!

At Palo Alto Networks, we believe in the power of collaboration and value in-person interactions. This is why our employees generally work full time from our office with flexibility offered where needed. This setup fosters casual conversations, problem-solving, and trusted relationships. Our goal is to create an environment where we all win with precision.

Job Description

Your Career

The Cortex team builds and delivers the industry’s most advanced SecOps platform, consisting of XDR, XSIAM, XSOAR, and XPANSE. As a member of the Cortex DevOps team, your role involves operating and maintaining a large-scale GCP environment, including the design, implementation, and continuous enhancement of our comprehensive observability systems. To meet the opportunities that such a role provides, you will have a deep knowledge of modern observability and monitoring tools and practices, having managed high cardinality metrics, implemented tracing, and operationalized large-scale logging solutions. As part of this role, you will collaborate closely with our engineering teams to develop innovative solutions that provide clear and actionable insights into our systems’ performance and health.

Your Impact

As a Senior Staff SRE with the Cortex Observability team, you will:

Cloud Expertise: Utilize your expertise in monitoring cloud platforms, particularly GCP, to optimize our infrastructure, leveraging cloud-native technologies
Monitoring Expertise: Improve monitoring processes, alerts, and metrics. Work with development teams to ensure that all of our services have the right monitoring and metrics in place so that we detect problems before our customers do
Incident Management: Leverage incident management processes to ensure efficient resolution of system issues and minimal impact on services
Automation: Automate complex monitoring and alerting tasks by building tools for cloud operations, such as automated remediation of known issues and auto-scaling
Continuously Improve: Stay up-to-date with cutting-edge technologies, evaluate their potential impact on our operations, and implement them when appropriate
On-Call: Provide follow-the-sun operational coverage in the production of our Observability infrastructure
Collaborate: Work with our Engineering team to influence the operability of the product and ensure the reliability and availability of our services

Qualifications

Your Experience

DevOps/SRE Expertise: 5+ years of experience as a DevOps/SRE engineer with a passion for technology and a strong motivation for high reliability at the service level
Observability Tools: High proficiency with Thanos, Prometheus, Grafana, Open Telemetry and other monitoring tools
Incident and Alerts Management: Clear understanding of incident and alerts management using tools like Pagerduty and Prometheus Alert Manager
Cloud Proficiency: High proficiency in either Google Cloud Platform or Amazon Web Services
Kubernetes and Docker: High proficiency with Kubernetes and Docker for container orchestration
Scripting and Automation: High proficiency in Python programming and Linux Shell commands. Experience with Ansible and Terraform for infrastructure as code
Communication Skills: Effective communication and interpersonal skills, with the ability to work and coordinate between multiple teams in different time zones
Troubleshooting: Ability to effectively troubleshoot and address emerging and complex problems
Independence: Ability to operate independently, make decisions, take action, and take responsibility

Additional Information

The Team

We’re trailblazers who dream big, take risks, and challenge cybersecurity’s status quo. It’s simple: we can’t accomplish our mission without diverse teams innovating together.

Compensation Disclosure

The compensation offered for this position will depend on qualifications, experience, and work location. For candidates who receive an offer at the posted level, the starting base salary (for non-sales roles) or base salary + commission target (for sales/commissioned roles) is expected to be between $126000/YR - $203500/YR The offered compensation may also include restricted stock units and a bonus. A description of our employee benefits may be found here.

Our Commitment

We’re problem solvers that take risks and challenge cybersecurity’s status quo. It’s simple: we can’t accomplish our mission without diverse teams innovating, together.

We are committed to providing reasonable accommodations for all qualified individuals with a disability. If you require assistance or accommodation due to a disability or special need, please contact us at accommodations@paloaltonetworks.com.

Palo Alto Networks is an equal opportunity employer. We celebrate diversity in our workplace, and all qualified applicants will receive consideration for employment without regard to age, ancestry, color, family or medical care leave, gender identity or expression, genetic information, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran status, race, religion, sex (including pregnancy), sexual orientation, or other legally protected characteristics.

All your information will be kept confidential according to EEO guidelines.

Similar Jobs

Project Director

hogarth

London, England, United Kingdom (Hybrid)

• 3 Months ago

Infrastructure Engineer

SS8

Farnborough, England, United Kingdom (Hybrid)

• 5 Months ago

Software Development Engineer in Test

SSC Technologies

Boston, Massachusetts, United States (Hybrid)

• 1 Year ago

Mid-Market Sales Executive

QuinStreet

United States (Remote)

• 3 Months ago

Software Engineer, Compilers, Runtimes and Toolchains, Early Career

Google

Mexico City, Mexico City, Mexico (On-Site)

• 1 Month ago

Senior Staff Engineer, Cloud

Nagarro

Hyderabad, Telangana, India (On-Site)

• 10 Months ago

Associate Staff Engineer, Mobile Cross Platform

Nagarro

Riyadh, Riyadh Province, Saudi Arabia (On-Site)

• 10 Months ago

Senior Backend Software Engineer - Customer Service Platform

bytedance

Seattle, Washington, United States (On-Site)

• 5 Months ago

Sr Enterprise Solution Architect-Zuora Billing & CPQ

Zuora

United States (Remote)

• 3 Months ago

Sr. Solutions Architect, Fusus

Axon

Boston, Massachusetts, United States (Hybrid)

• 3 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Scholars Program Coordinator

Games For Love

Washington, United States (Remote)

• 5 Months ago

Restaurant Manager

Open Systems Technologies

Burnsville, Minnesota, United States (On-Site)

• 1 Month ago

Functional Safety Engineer

Cadence

Pune, Maharashtra, India (On-Site)

• 3 Months ago

Sr. Product Designer (Native Mobile)

onwards Search

Boston, Massachusetts, United States (Remote)

• 1 Month ago

Megapack Sales Operations Project Manager, Energy EMEA

Tesla

North Holland, Netherlands (On-Site)

• 7 Months ago

Design Lead - Design for Manufacturability (DFM)

HCL Tech

New York, United States (On-Site)

• 3 Months ago

Sales Team Lead

USE Insider

Paris, Île-de-France, France (Hybrid)

• 4 Months ago

Field Service Technician - NorCal

Enphase Energy

United States (On-Site)

• 1 Month ago

Football Systems Operator

Hawkeye Innovations

Salzburg, Salzburg, Austria (On-Site)

• 5 Months ago

User Research Intern

Kabam

Vancouver, British Columbia, Canada (On-Site)

• 1 Month ago

Get notifed when new similar jobs are uploaded

Jobs in Santa Clara, California, United States

Principal Technical Artist - Unpublished R&D Product

Riot Games

Los Angeles, California, United States (On-Site)

• 4 Months ago

Systems Quality Mechatronics Engineer

Apple

Austin, Texas, United States (On-Site)

• 2 Months ago

Supply Demand Planner

Apple

Cupertino, California, United States (On-Site)

• 2 Months ago

Senior Solution Architect

Cognite

Austin, Texas, United States (Hybrid)

• 8 Months ago

Gameplay Tools Engineer

Heart Machine

Los Angeles, California, United States (Remote)

• 1 Month ago

Developer

HCL Tech

California, United States (On-Site)

• 3 Months ago

Civil Project Engineer II/III - Land Development

Square

Charlotte, North Carolina, United States (On-Site)

• 1 Month ago

Lead Tools Engineer

Light Speed Studios

California, United States (On-Site)

• 4 Months ago

Revenue Accountant

Axon

Scottsdale, Arizona, United States (Hybrid)

• 3 Months ago

Enterprise Growth Lead

Perplexity

New York, New York, United States (On-Site)

• 1 Month ago

Get notifed when new similar jobs are uploaded

Devops Jobs

Site Reliability Engineer

Razer

Kuala Lumpur, Federal Territory Of Kuala Lumpur, Malaysia (On-Site)

• 3 Months ago

Senior Cloud Solution Architect

Tencent

Paris, Île-de-France, France (On-Site)

• 1 Month ago

Senior DevOps Programmer

Epic Games

Montreal, Quebec, Canada (On-Site)

• 5 Months ago

Senior Solutions Engineer

Insight Software

London, England, United Kingdom (On-Site)

• 5 Months ago

Team Lead - DevOps

Contentstack

Pune, Maharashtra, India (Hybrid)

• 4 Months ago

BDM/ Solution Engineer (Trial and Tissue Industry Specialist)

Buckman

Paris, Île-de-France, France (On-Site)

• 10 Months ago

Senior Cloud Service Provider Application Engineer

NVIDIA

Santa Clara, California, United States (On-Site)

• 4 Months ago

Experienced Software Architect

Thales

Brest, Brittany, France (On-Site)

• 3 Months ago

DevOps Software Engineer II

Rocket

Vilnius, Vilnius County, Lithuania (Hybrid)

• 1 Month ago

SAP Solution Architect | Senior Manager/Director | Technology Consulting | Advisory

PwC

Dublin, County Dublin, Ireland (On-Site)

• 10 Months ago

Get notifed when new similar jobs are uploaded

About The Company

Palo Alto Networks

275 Active Jobs

Our enterprise security platform detects and prevents known and unknown threats while safely enabling an increasingly complex and rapidly growing number of applications. Come be part of the team that redefined the firewall industry and is now the fastest-growing security company in history. Palo Alto Networks, the global cybersecurity leader, is shaping the cloud-centric future with technology that is transforming the way people and organizations operate. Our mission is to be the cybersecurity partner of choice, protecting our digital way of life. We help address the world's greatest security challenges with continuous innovation that seizes the latest breakthroughs in artificial intelligence, analytics, automation, and orchestration. By delivering an integrated platform and empowering a growing ecosystem of partners, we are at the forefront of protecting tens of thousands of organizations across clouds, networks, and mobile devices. Our vision is a world where each day is safer and more secure than the one before.

Get notified when new jobs are added by Palo Alto Networks

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

A global community of game builders. Helping people upskill and land jobs in the best gaming studios.

Company

Key Links

hello@outscal.com

Made in INDIA 💛💙

Senior Staff Site Reliability Engineer (Cortex Observability)

Job Summary

Job Description

13 skills required

13 skills required for this role

Job Details

Company Description

Job Description

Qualifications

Additional Information

Similar Jobs

Project Director

Infrastructure Engineer

Software Development Engineer in Test

Mid-Market Sales Executive

Software Engineer, Compilers, Runtimes and Toolchains, Early Career

Senior Staff Engineer, Cloud

Associate Staff Engineer, Mobile Cross Platform

Senior Backend Software Engineer - Customer Service Platform

Sr Enterprise Solution Architect-Zuora Billing & CPQ

Sr. Solutions Architect, Fusus

Similar Skill Jobs

Scholars Program Coordinator

Restaurant Manager

Functional Safety Engineer

Sr. Product Designer (Native Mobile)

Megapack Sales Operations Project Manager, Energy EMEA

Design Lead - Design for Manufacturability (DFM)

Sales Team Lead

Field Service Technician - NorCal

Football Systems Operator

User Research Intern

Jobs in Santa Clara, California, United States

Principal Technical Artist - Unpublished R&D Product

Systems Quality Mechatronics Engineer

Supply Demand Planner

Senior Solution Architect

Gameplay Tools Engineer

Developer

Civil Project Engineer II/III - Land Development

Lead Tools Engineer

Revenue Accountant

Enterprise Growth Lead

Devops Jobs

Site Reliability Engineer

Senior Cloud Solution Architect

Senior DevOps Programmer

Senior Solutions Engineer

Team Lead - DevOps

BDM/ Solution Engineer (Trial and Tissue Industry Specialist)

Senior Cloud Service Provider Application Engineer

Experienced Software Architect

DevOps Software Engineer II

SAP Solution Architect | Senior Manager/Director | Technology Consulting | Advisory

About The Company

Principal Software Engineer - MacOS, C/C++ (Global Protect)

Principal Software Engineer - MacOS, C/C++ (Prisma Access)

Consulting Director, CTI - Proactive Services (Unit 42)

Sr. Director Global Business Development, GSIs

Sr. Director Global Business Development, GSIs

Manager - FP&A

Principal Consultant, Proactive Services (Unit 42)

Principle Software Engineer- KSPM (Cortex Cloud)

Sr Principal Engineer Software (Big Data)

Senior Consultant, DFIR (Unit 42)

Level Up Your Career in Game Development!