Network Operations Center – Level 2 Support Engineer

1 Month ago • 2 Years +

Job Summary

Job Description

The NOC Level 2 Support Engineer will monitor, diagnose, troubleshoot, track, and document multiple environments, as well as provide day-to-day customer support. The engineer will monitor dashboards, proactively detect incidents, investigate and resolve issues, and escalate when needed. Responsibilities include working with incident managers, performing root cause analysis, documenting runbooks, implementing and improving processes, and developing dashboards. This role requires strong communication skills, adaptability, and the ability to work in shifts to provide 24/7 coverage.
Must have:
  • 2 years of experience in a 24/7 NOC support team.
  • Excellent troubleshooting and problem-solving abilities.
  • Background in Linux administration.
  • Good networking understanding (TCP/IP, DNS, etc.).
  • Understanding of technologies like Apache, Nginx, Databases, etc.
  • Experience with supporting Cloud-based applications (AWS).
  • Experience with monitoring systems (Datadog).
  • Experience coordinating with multiple teams (Helpdesk & SRE).
  • Excellent communication and interpersonal skills.
  • Ability to offer flexibility during peak times.
  • Experience in creating technical documentation and reports.
  • Readiness to offer training to colleagues.
Good to have:
  • Experience with Datadog Monitoring and Incident Management.
  • Experience maintaining CI/CD pipelines (Jenkins, CircleCI).
  • Scripting/programming knowledge (Unix shell scripting).
  • Background in Windows Administration.
  • Experience with Postgres.
  • Experience with Kubernetes.

Job Details

Purpose of the Role

We are seeking a NOC Level 2 Support Engineer to work in our Cape Town office, who can apply their technical skills in a fast-paced and complex environment. The Level 2 NOC Engineer’s duties will include monitoring, diagnosing, troubleshooting, tracking, and documenting the multiple environments and day-to-day customer support and interactions.   


Strong communication skills are a particularly important requirement for this role. NOC Level 2 Support Engineers should enjoy working in a fast-paced environment where adaptability and flexibility will be key to their success. The successful candidate will be able to work independently and within groups working in shifts to cover 24/7 coverage.

Responsibilities:

  • Monitor Dashboards of key metrics, proactively detecting any possible incidents before they occur.
  • Proactive monitoring of Slack channels for issues raised both internally and externally.
  • Investigate, diagnose, troubleshoot, and resolve incidents where possible.
  • Escalate incidents that require additional expertise with SRE, DBA’s, Dev Support, etc, and work with them until the incident is resolved.
  • Working with Incident managers and other teams in war rooms for P1/2 issues to restore operations as quickly as possible.
  • Be involved in the root cause analysis of incidents and help with incident reports.
  • Adding and updating documentation for runbooks used to help troubleshoot and resolve incidents and share knowledge with the rest of the team.
  • Implement and improve processes for monitoring/alerting, systems maintenance, and escalation.
  • Helping and guiding the development of tooling used to troubleshoot and resolve issues to make NOC work more effectively.
  • Develop key dashboards for transparency of reporting uptime and other metrics as identified.

Requirements:

  • A minimum of 2 years’ experience working in a NOC team offering 24/7 critical support. 
  • Excellent troubleshooting and creative problem-solving abilities.
  • Background in Linux administration.
  • Good networking understanding (TCP/IP, DNS, routing, firewalls, etc.).
  • Good understanding of technologies such as Apache, Nginx, Databases, DNS servers, etc.
  • Experience with supporting Cloud-based applications – we use Amazon Web Services (AWS).
  • Experience in using monitoring systems and investigating issues at a log level – we use Datadog.
  • Experience coordinating and collaborating with multiple teams such as Helpdesk & SRE.
  • Excellent communication and interpersonal skills.
  • Ability to offer flexibility during peak times and critical projects for changing shift patterns.
  • Experience in creating technical documentation and reports. 
  • Readiness to offer training to colleagues when needed.

 

Advantageous:

  • Experience with Datadog Monitoring and Incident Management is a plus.
  • Experience maintaining continuous integration and delivery pipelines with tools such as Jenkins and CircleCI.
  • Scripting/programming knowledge of at least Unix shell scripting.
  • Background in Windows Administration.
  • Experience with Postgres.
  • Experience with Kubernetes.

 

Similar Jobs

Looks like we're out of matches

Set up an alert and we'll send you similar jobs the moment they appear!

Similar Skill Jobs

Looks like we're out of matches

Set up an alert and we'll send you similar jobs the moment they appear!

Jobs in Cape Town, Western Cape, South Africa

Looks like we're out of matches

Set up an alert and we'll send you similar jobs the moment they appear!

Similar Category Jobs

Looks like we're out of matches

Set up an alert and we'll send you similar jobs the moment they appear!

About The Company

White Hat Gaming is a state-of-the-art iGaming platform providing a secure, scalable and flexible modular Casino and Sportsbook Player Account Management solution.

View All Jobs

Get notified when new jobs are added by White Hat Gaming

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug