Senior Specialist Cloud SRE

undefined ago • 10 Years + • Devops

Job Summary

Job Description

NICE is seeking an experienced Cloud Site Reliability Engineer to support large, complex enterprise software clients. The role involves delivering real-time insights from massive scale data, requiring excellent problem-solving skills. Candidates will run and improve production environments, build systems for infrastructure and applications, and optimize performance. This position demands a holistic view of system health, strong collaboration with cross-functional teams, and a commitment to developing innovative solutions and positive user experiences. The ideal candidate brings fresh ideas and a unique viewpoint to a fast-paced, collaborative setting.
Must have:
  • Run production environment, monitor availability, and ensure system health.
  • Build software and systems for platform infrastructure and applications.
  • Improve reliability, quality, and time-to-market of software solutions.
  • Optimize system performance, innovate, and anticipate customer needs.
  • Provide operational support for large distributed software applications.
  • Analyze metrics from OS and applications for performance tuning and fault finding.
  • Partner with development teams to improve services via testing and release procedures.
  • Participate in system design, platform management, and capacity planning.
  • Create sustainable systems and services through automation and uplifts.
  • Balance feature development speed and reliability with service level objectives.
  • 10+ years programming/scripting experience (Go, Python, .Net (C#), Node).
  • 8 years experience in systems engineering, automation, and reliability.
  • Proficiency in Python, Go, Java, C#, Bash, PowerShell.
  • Deep understanding of AWS cloud platforms and services (EC2, ECS, Lambda, DynamoDB).
  • Experience with CloudFormation, Terraform for infrastructure as code.
  • Deep understanding of CI/CD concepts and tools (Jenkins, GitLab CI/CD, CircleCI).
  • Strong knowledge of Docker, Kubernetes, and microservices architecture.
  • Experience with monitoring tools (Prometheus, Grafana, ELK stack, Cloudwatch).
  • Excellent problem-solving and troubleshooting skills for distributed systems.
  • Experience in incident management and blameless postmortems.
Good to have:
  • Kubernetes certification
  • Grafana
  • AWS
  • Azure
  • DevOps experience
Perks:
  • Opportunity to learn and grow in a market-leading global company.
  • Endless internal career opportunities across multiple roles and disciplines.
  • NICE-FLEX hybrid model: 2 days office, 3 days remote work per week.

Job Details

At NiCE, we don’t limit our challenges. We challenge our limits. Always. We’re ambitious. We’re game changers. And we play to win. We set the highest standards and execute beyond them. And if you’re like us, we can offer you the ultimate career opportunity that will light a fire within you.

So, what’s the role all about?

NICE is looking for a Cloud Site Reliability Engineer. Candidates will work supporting large complex enterprise software clients including applications, servers, SQL, network and must have excellent problem-solving skills. As we expand our customer deployments, we are currently seeking an experienced SRE to deliver insights from massive scale data in real time. Specifically, we are searching for someone who brings fresh ideas, demonstrates a unique and informed viewpoint, and enjoys collaborating with a cross-functional team to develop real-world solutions and positive user experiences at every interaction.

How will you make an impact?

  • Run the production environment by monitoring availability and taking a holistic view of system health
  • Build software and systems to manage platform infrastructure and applications
  • Improve reliability, quality, and time-to-market of our suite of software solutions
  • Measure and optimize system performance, with an eye toward pushing our capabilities forward, getting ahead of customer needs, and innovating to continually improve
  • Provide primary operational support and engineering for multiple large distributed software applications
  • Gather and analyze metrics from both operating systems and applications to assist in performance tuning and fault finding
  • Partner with development teams to improve services through rigorous testing and release procedures
  • Participate in system design consulting, platform management, and capacity planning
  • Create sustainable systems and services through automation and uplifts
  • Balance feature development speed and reliability with well-defined service level objectives

Have you got what it takes?

  • 10+ years programming/scripting experience with any of the following: (Go, Python, .Net (C#), Node)
  • Bachelor’s degree in computer science, Engineering, or related field (or equivalent experience).
  • 8 years of working experience in a similar role, with a focus on systems engineering, automation, and reliability.
  • Proficiency in at least one programming language (e.g., Python, Go, Java, C#) and experience with scripting languages (e.g., Bash, PowerShell).
  • Deep understanding of cloud computing platforms (e.g., AWS), the working and reliability constraints of some of the prominent services (e.g., EC2, ECS, Lambda, DynamoDB etc)
  • Experience with infrastructure as code tools such as CloudFormation, Terraform.
  • Deep understanding of CI/CD concepts and experience with CI/CD tools such as Jenkins, GitLab CI/CD, or CircleCI.
  • Strong knowledge of containerization technologies (e.g., Docker, Kubernetes) and microservices architecture.
  • Experience with monitoring and observability tools (e.g., Prometheus, Grafana, ELK stack, Cloudwatch).
  • Excellent problem-solving skills and the ability to troubleshoot complex issues in distributed systems.
  • Experience of Incident management and blameless postmortems that includes driving the incident response efforts during outages and other critical incidents, resolution, and communication in a cross-functional team setup.

You will have an advantage if you also have:

Kubernetes + certification, Grafana, AWS, Azure, DevOps experience.

What’s in it for you?

Join an ever-growing, market disrupting, global company where the teams – comprised of the best of the best – work in a fast-paced, collaborative, and creative environment! As the market leader, every day at NICE is a chance to learn and grow, and there are endless internal career opportunities across multiple roles, disciplines, domains, and locations. If you are passionate, innovative, and excited to constantly raise the bar, you may just be our next NICEr!

Enjoy NICE-FLEX!

At NICE, we work according to the NICE-FLEX hybrid model, which enables maximum flexibility: 2 days working from the office and 3 days of remote work, each week. Naturally, office days focus on face-to-face meetings, where teamwork and collaborative thinking generate innovation, new ideas, and a vibrant, interactive atmosphere.

Requisition ID: 8355

Reporting into: Tech Manager

Role Type: Individual Contributor

About NiCE

NICE Ltd. (NASDAQ: NICE) software products are used by 25,000+ global businesses, including 85 of the Fortune 100 corporations, to deliver extraordinary customer experiences, fight financial crime and ensure public safety. Every day, NiCE software manages more than 120 million customer interactions and monitors 3+ billion financial transactions.

Known as an innovation powerhouse that excels in AI, cloud and digital, NiCE is consistently recognized as the market leader in its domains, with over 8,500 employees across 30+ countries.

NiCE is proud to be an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, national origin, age, sex, marital status, ancestry, neurotype, physical or mental disability, veteran status, gender identity, sexual orientation or any other category protected by law.

Similar Jobs

Looks like we're out of matches

Set up an alert and we'll send you similar jobs the moment they appear!

Similar Skill Jobs

Looks like we're out of matches

Set up an alert and we'll send you similar jobs the moment they appear!

Jobs in Pune, Maharashtra, India

Looks like we're out of matches

Set up an alert and we'll send you similar jobs the moment they appear!

Devops Jobs

Looks like we're out of matches

Set up an alert and we'll send you similar jobs the moment they appear!

About The Company

Hoboken, New Jersey, United States (Hybrid)

Pune, Maharashtra, India (Hybrid)

Pune, Maharashtra, India (Hybrid)

Pune, Maharashtra, India (Hybrid)

Sandy, Utah, United States (Hybrid)

Ra'anana, Center District, Israel (Hybrid)

Pune, Maharashtra, India (Hybrid)

View All Jobs

Get notified when new jobs are added by Nice

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug