Network Operations Center – Level 2 Support Engineer

2 Hours ago • 2 Years + • Administrative

Job Summary

Job Description

The Network Operations Center (NOC) Level 2 Support Engineer will monitor, diagnose, troubleshoot, and resolve incidents across multiple environments. Responsibilities include proactive monitoring of dashboards and Slack channels, escalating issues to appropriate teams (SRE, DBAs, Dev Support), participating in war rooms for critical incidents, performing root cause analysis, updating documentation, and improving monitoring/alerting processes. The role requires strong communication, problem-solving skills, and experience with Linux administration, networking, AWS, Datadog, and collaborating with various teams. The position requires working in shifts to provide 24/7 coverage and involves working both independently and as part of a team. Experience with Datadog, Jenkins, CircleCI, and scripting is advantageous.
Must have:
  • 2+ years NOC experience
  • Excellent troubleshooting skills
  • Linux administration background
  • Networking understanding (TCP/IP, DNS)
  • AWS experience
  • Datadog monitoring experience
  • Strong communication skills
Good to have:
  • Datadog & Incident Management
  • Jenkins/CircleCI experience
  • Unix shell scripting
  • Windows administration
  • Postgres experience
  • Kubernetes experience

Job Details

Purpose of the Role

We are seeking a NOC Level 2 Support Engineer to work in our Cape Town office, who can apply their technical skills in a fast-paced and complex environment. The Level 2 NOC Engineer’s duties will include monitoring, diagnosing, troubleshooting, tracking, and documenting the multiple environments and day-to-day customer support and interactions.   


Strong communication skills are a particularly important requirement for this role. NOC Level 2 Support Engineers should enjoy working in a fast-paced environment where adaptability and flexibility will be key to their success. The successful candidate will be able to work independently and within groups working in shifts to cover 24/7 coverage.

Responsibilities:

  • Monitor Dashboards of key metrics, proactively detecting any possible incidents before they occur.
  • Proactive monitoring of Slack channels for issues raised both internally and externally.
  • Investigate, diagnose, troubleshoot, and resolve incidents where possible.
  • Escalate incidents that require additional expertise with SRE, DBA’s, Dev Support, etc, and work with them until the incident is resolved.
  • Working with Incident managers and other teams in war rooms for P1/2 issues to restore operations as quickly as possible.
  • Be involved in the root cause analysis of incidents and help with incident reports.
  • Adding and updating documentation for runbooks used to help troubleshoot and resolve incidents and share knowledge with the rest of the team.
  • Implement and improve processes for monitoring/alerting, systems maintenance, and escalation.
  • Helping and guiding the development of tooling used to troubleshoot and resolve issues to make NOC work more effectively.
  • Develop key dashboards for transparency of reporting uptime and other metrics as identified.

Requirements:

  • A minimum of 2 years’ experience working in a NOC team offering 24/7 critical support. 
  • Excellent troubleshooting and creative problem-solving abilities.
  • Background in Linux administration.
  • Good networking understanding (TCP/IP, DNS, routing, firewalls, etc.).
  • Good understanding of technologies such as Apache, Nginx, Databases, DNS servers, etc.
  • Experience with supporting Cloud-based applications – we use Amazon Web Services (AWS).
  • Experience in using monitoring systems and investigating issues at a log level – we use Datadog.
  • Experience coordinating and collaborating with multiple teams such as Helpdesk & SRE.
  • Excellent communication and interpersonal skills.
  • Ability to offer flexibility during peak times and critical projects for changing shift patterns.
  • Experience in creating technical documentation and reports. 
  • Readiness to offer training to colleagues when needed.

 

Advantageous:

  • Experience with Datadog Monitoring and Incident Management is a plus.
  • Experience maintaining continuous integration and delivery pipelines with tools such as Jenkins and CircleCI.
  • Scripting/programming knowledge of at least Unix shell scripting.
  • Background in Windows Administration.
  • Experience with Postgres.
  • Experience with Kubernetes.

 

Similar Jobs

ByteDance - Senior Site Reliability Engineer, ML System - Foundation Model

ByteDance

Seattle, Washington, United States (On-Site)
2 Months ago
Rapt Studio - Senior Designer (Interior Design/Architecture)

Rapt Studio

Los Angeles, California, United States (Hybrid)
5 Months ago
NVIDIA - System Software Engineer Intern - Autonomous Vehicles - 2025

NVIDIA

Shanghai, Shanghai, China (On-Site)
2 Months ago
The Embassy - Pipeline TD

The Embassy

Vancouver, British Columbia, Canada (Hybrid)
1 Month ago
Luxoft - Orchestrade - Azure infrastructure cloud Regular engineer

Luxoft

Poland, Ohio, United States (Remote)
4 Months ago
Tesla - Customer Support Specialist

Tesla

Mumbai, Maharashtra, India (On-Site)
1 Month ago
Ubisoft - Application Specialist

Ubisoft

Bucharest, Bucharest, Romania (Hybrid)
1 Week ago
Gaming Innovation Group  - Infrastructure Engineer

Gaming Innovation Group

Sliema, Malta (Hybrid)
2 Weeks ago
Nintendo - Student Help Internal Communications (m/f/d)

Nintendo

Frankfurt, Hessen, Germany (On-Site)
4 Months ago
Next Level Business Services - SAP C4C Functional consultant

Next Level Business Services

Charlotte, North Carolina, United States (On-Site)
5 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Cold Symmetry - Character Animator

Cold Symmetry

(Remote)
2 Months ago
ByteDance - Senior Machine Learning Ops Engineer, ML System - Foundation Model

ByteDance

San Jose, California, United States (On-Site)
2 Months ago
Patreon - Site Reliability Engineer

Patreon

United States (Remote)
1 Week ago
PwC - IN_Manager_Data Migration Lead_Data & Analytics_Advisory_PAN India

PwC

Kolkata, West Bengal, India (On-Site)
4 Months ago
Tencent - Senior Backend Engineer for Global Realistic 3A Action Game

Tencent

Shenzhen, Guangdong Province, China (On-Site)
3 Months ago
Qatar Airways - DevOps Engineer

Qatar Airways

Ahmedabad, Gujarat, India (On-Site)
6 Months ago
Interactive Brokers - Technical Operations Specialist (TOPS)

Interactive Brokers

Greenwich, Connecticut, United States (Hybrid)
5 Months ago
NVIDIA - Senior Package Layout Engineer

NVIDIA

Yokne'am Illit, North District, Israel (On-Site)
2 Months ago
Zeta - Senior Site Reliability Engineer

Zeta

Hyderabad, Telangana, India (On-Site)
5 Months ago
PwC - AWS Data Engineer|Bangalore

PwC

Bengaluru, Karnataka, India (On-Site)
6 Months ago

Get notifed when new similar jobs are uploaded

Jobs in Western Cape, South Africa

Nagarro - Associate Staff Engineer, BI Reporting

Nagarro

South Africa (On-Site)
5 Months ago
White Hat Gaming  - Games Coordinator

White Hat Gaming

Western Cape, South Africa (Hybrid)
6 Days ago
White Hat Gaming  - Fraud Analyst

White Hat Gaming

Western Cape, South Africa (Hybrid)
18 Hours ago
PwC - Workivia Implementation Specialist

PwC

Johannesburg, Gauteng, South Africa (On-Site)
5 Months ago
WebFX - Remote Copywriter: Legal

WebFX

South Africa (Remote)
5 Months ago
Nagarro - Senior Staff Engineer (Project Manager/Scrum Master)

Nagarro

Johannesburg, Gauteng, South Africa (Remote)
5 Months ago
Collective Ace Group - Experienced Community Manager (Remote - 1 year contract)

Collective Ace Group

South Africa (Remote)
2 Months ago
Sporty Group - SA Customer Success Associate

Sporty Group

Mpumalanga, South Africa (On-Site)
23 Hours ago
WebFX - Copywriter (Digital Marketing & B2B) (South Africa)

WebFX

South Africa (Remote)
5 Months ago
WebFX - Technical Digital Marketer (MARTECH Implementation) (Cape Town)

WebFX

Cape Town, Western Cape, South Africa (On-Site)
3 Months ago

Get notifed when new similar jobs are uploaded

Administrative Jobs

Evolution - Purchasing Coordinator

Evolution

Tbilisi, Tbilisi, Georgia (On-Site)
2 Weeks ago
Click Therapeutics - Senior IT Systems Administrator

Click Therapeutics

New York, New York, United States (On-Site)
4 Months ago
ION - DBA Administrator

ION

Italy (Hybrid)
5 Months ago
Milk Visual Effects - Systems Administrator

Milk Visual Effects

(On-Site)
4 Months ago
Tesla - Procurement Specialist

Tesla

Rhineland-Palatinate, Germany (Hybrid)
1 Month ago
Rackspace Technology - Support Technician II (Help Desk)

Rackspace Technology

India (Remote)
2 Days ago
IGT - Lottery Service Technician I

IGT

Virginia, United States (On-Site)
4 Months ago
Nintendo - Sr Bilingual Communications Coordinator - Japanese

Nintendo

Redmond, Washington, United States (Hybrid)
4 Months ago
Next Level Business Services - SAP AII / OER Lead

Next Level Business Services

Raritan, New Jersey, United States (On-Site)
5 Months ago
The Walt Disney Company - Sr IAM Platform Engineer

The Walt Disney Company

Orlando, Florida, United States (On-Site)
1 Month ago

Get notifed when new similar jobs are uploaded

About The Company

White Hat Gaming is a state-of-the-art iGaming platform providing a secure, scalable and flexible modular Casino and Sportsbook Player Account Management solution.

Western Cape, South Africa (Hybrid)

(Remote)

Western Cape, South Africa (Hybrid)

Mosta, Malta (Hybrid)

View All Jobs

Get notified when new jobs are added by White Hat Gaming

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug