Cloud Site Reliability Engineer

4 Hours ago • 2 Years + • Devops

Job Summary

Job Description

NiCE Public Safety is seeking a Cloud Site Reliability Engineer to ensure their cloud platforms are observable, measurable, reliable, scalable, and maintainable. This hands-on role involves acting as a gatekeeper for production, managing work backlogs, and developing reliability improvements. You will lead investigations into outages, performance, and cost issues, and drive automation of low-value tasks. The role requires developing and configuring monitoring dashboards and alerts using tools like Grafana and Azure Monitor, as well as installing and configuring observability platforms. You will also develop bicep modules for monitoring infrastructure.
Must have:
  • 2+ years of experience in Site Reliability Engineering
  • Excellent technical, analytical, and troubleshooting skills
  • In-depth knowledge of databases and data handling (MS-SQL, Elasticsearch, YML, JSON, XML)
  • Significant experience in programming or advanced scripting (C#, PowerShell etc.)
  • Experience with infrastructure/configuration as code and version control (ARM, BICEP, Git)
  • Experience managing monitoring, alerting, and dashboarding platforms (Azure Monitor, Prometheus, Grafana, Elasticsearch)
  • Demonstrable experience of supporting live cloud services and platforms
  • Production experience with Kubernetes and containerization
  • Implementation and support of service level objectives (SLOs)
  • Exposure to commercial cloud providers (Ideally Azure, others considered)
  • Efficient, effective, and respectful communication skills
Good to have:
  • Exposure to Azure DevOps pipelines (CI/CD)
  • Exposure to test frameworks (NUnit, Jasmine, Selenium)
Perks:
  • NiCE-FLEX hybrid model (2 days in office, 3 days remote)
  • Opportunity for learning and growth
  • Internal career opportunities

Job Details

At NiCE, we don’t limit our challenges. We challenge our limits. Always. We’re ambitious. We’re game changers. And we play to win. We set the highest standards and execute beyond them. And if you’re like us, we can offer you the ultimate career opportunity that will light a fire within you.

So, what’s the role all about?

Here at NiCE Public Safety, we provide state of the art solutions for the Public Safety & Justice market, providing software as a service for multi-media evidence management and Emergency Contact Centers to a worldwide customer base.

This is a very hands-on role. You will be involved in ensuring our cloud platforms are observable, measurable, reliable, scalable, and maintainable. It’s likely that the successful candidate will have significant experience in a DevOps, SRE, Cloud Engineer, or Cloud Development role.

How will you make an impact?

  • Act as part of a team of SRE’s that act as the ‘gatekeepers’ of production, and actively manage the work backlog and develop reliability improvements.
  • Lead investigations into root cause outages, performance, and cost issues.
  • Lead initiatives to develop the automation of low-value tasks balanced against project delivery demands.
  • You will provide technical leadership and to wider Cloud Operations and Support teams along with providing oversight to the products and services they support.
  • Develop and configure monitoring dashboards and alerts in tools like Grafana and Azure Monitor.
  • Installation and configuration of Observability Platform including tools like Grafana, Prometheus, Azure Monitor, Open telemetry etc.
  • Developing bicep modules for monitoring infrastructure and deploy it.

Have you got what it takes?

  • Must have 2+ years of experience in Site Reliability Engineering
  • Excellent technical, analytical and troubleshooting skills
  • Experience and in-depth knowledge of databases and data handling (MS-SQL, Elasticsearch, YML, JSON, XML)
  • Significant experience in programming or advanced scripting (C#, PowerShell etc.)
  • Experience with infrastructure/configuration as code and version control (ARM, BICEP, Git)
  • Experience managing monitoring, alerting and dashboarding platforms (Azure Monitor, Prometheus, Grafana, Elasticsearch)
  • Demonstrable experience of supporting live cloud services and platforms
  • Production experience with Kubernetes and containerization
  • Implementation and support of service level objectives (SLOs)
  • Exposure to commercial cloud providers (Ideally Azure, others considered)
  • Exposure to Azure DevOps pipelines is desirable (CI/CD)
  • Exposure to test frameworks is desirable (NUnit, Jasmine, Selenium)
  • Efficient, effective, and respectful communication skills both with customers and within internal departments. Including,
    • Good listener, able to identify and validate assumptions.
    • Able to use effective questioning to confirm understanding of a customer problem and then provide help to solve it.
    • Methodical troubleshooting, technical skill and attention to detail used in diagnosing problems and reproducing issues in a local environment.
    • Multi-tasking and time-management to priorities and switch between varied tasks.

You must

  • Be flexible with working hours when needed to address critical or urgent matters.
  • Be able to provide on-call services from time to time as needed.

What’s in it for you?

Join an ever-growing, market-disrupting, global company where the teams – comprised of the best of the best – work in a fast-paced, collaborative, and creative environment! As the market leader, every day at NiCE is a chance to learn and grow, and there are endless internal career opportunities across multiple roles, disciplines, domains, and locations. If you are passionate, innovative, and excited to constantly raise the bar, you may just be our next NiCEr!

 

Enjoy NiCE-FLEX!

At NiCE, we work according to the NiCE-FLEX hybrid model, which enables maximum flexibility: 2 days working from the office and 3 days of remote work, each week. Naturally, office days focus on face-to-face meetings, where teamwork and collaborative thinking generate innovation, new ideas, and a vibrant, interactive atmosphere.

Requisition ID: 7737
Reporting into: Manager
Role Type: Individual Contributor 

About NiCE

NICE Ltd. (NASDAQ: NICE) software products are used by 25,000+ global businesses, including 85 of the Fortune 100 corporations, to deliver extraordinary customer experiences, fight financial crime and ensure public safety. Every day, NiCE software manages more than 120 million customer interactions and monitors 3+ billion financial transactions.

Known as an innovation powerhouse that excels in AI, cloud and digital, NiCE is consistently recognized as the market leader in its domains, with over 8,500 employees across 30+ countries.

NiCE is proud to be an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, national origin, age, sex, marital status, ancestry, neurotype, physical or mental disability, veteran status, gender identity, sexual orientation or any other category protected by law.

 

Similar Jobs

Looks like we're out of matches

Set up an alert and we'll send you similar jobs the moment they appear!

Similar Skill Jobs

Looks like we're out of matches

Set up an alert and we'll send you similar jobs the moment they appear!

Jobs in Pune, Maharashtra, India

Looks like we're out of matches

Set up an alert and we'll send you similar jobs the moment they appear!

Devops Jobs

Looks like we're out of matches

Set up an alert and we'll send you similar jobs the moment they appear!

About The Company

Pune, Maharashtra, India (Hybrid)

Pune, Maharashtra, India (Hybrid)

Pune, Maharashtra, India (Hybrid)

Pune, Maharashtra, India (Hybrid)

Pune, Maharashtra, India (Hybrid)

Richardson, Texas, United States (On-Site)

Atlanta, Georgia, United States (On-Site)

View All Jobs

Get notified when new jobs are added by Nice