Network Operations Center – Level 2 Support Engineer

1 Month ago • 2 Years + • Administrative

Job Summary

Job Description

The Network Operations Center (NOC) Level 2 Support Engineer will monitor, diagnose, troubleshoot, and resolve incidents across multiple environments. Responsibilities include proactive monitoring of dashboards and Slack channels, escalating complex issues to SRE, DBAs, and Dev Support teams, participating in root cause analysis, and improving monitoring/alerting processes. The role requires strong communication and problem-solving skills, experience with Linux administration, networking technologies (TCP/IP, DNS, routing, firewalls), and cloud-based applications (AWS). Experience with Datadog and creating technical documentation is also essential. The successful candidate will work in shifts to provide 24/7 coverage and will collaborate with multiple teams to ensure smooth operations.
Must have:
  • 2+ years NOC experience
  • Excellent troubleshooting skills
  • Linux administration background
  • Networking understanding (TCP/IP, DNS)
  • AWS experience
  • Datadog experience
  • Excellent communication skills
Good to have:
  • Experience with Jenkins and CircleCI
  • Unix shell scripting
  • Windows administration
  • Postgres experience
  • Kubernetes experience

Job Details

Purpose of the Role

We are seeking a NOC Level 2 Support Engineer to work in our Cape Town office, who can apply their technical skills in a fast-paced and complex environment. The Level 2 NOC Engineer’s duties will include monitoring, diagnosing, troubleshooting, tracking, and documenting the multiple environments and day-to-day customer support and interactions.   


Strong communication skills are a particularly important requirement for this role. NOC Level 2 Support Engineers should enjoy working in a fast-paced environment where adaptability and flexibility will be key to their success. The successful candidate will be able to work independently and within groups working in shifts to cover 24/7 coverage.

Responsibilities:

  • Monitor Dashboards of key metrics, proactively detecting any possible incidents before they occur.
  • Proactive monitoring of Slack channels for issues raised both internally and externally.
  • Investigate, diagnose, troubleshoot, and resolve incidents where possible.
  • Escalate incidents that require additional expertise with SRE, DBA’s, Dev Support, etc, and work with them until the incident is resolved.
  • Working with Incident managers and other teams in war rooms for P1/2 issues to restore operations as quickly as possible.
  • Be involved in the root cause analysis of incidents and help with incident reports.
  • Adding and updating documentation for runbooks used to help troubleshoot and resolve incidents and share knowledge with the rest of the team.
  • Implement and improve processes for monitoring/alerting, systems maintenance, and escalation.
  • Helping and guiding the development of tooling used to troubleshoot and resolve issues to make NOC work more effectively.
  • Develop key dashboards for transparency of reporting uptime and other metrics as identified.

Requirements:

  • A minimum of 2 years’ experience working in a NOC team offering 24/7 critical support. 
  • Excellent troubleshooting and creative problem-solving abilities.
  • Background in Linux administration.
  • Good networking understanding (TCP/IP, DNS, routing, firewalls, etc.).
  • Good understanding of technologies such as Apache, Nginx, Databases, DNS servers, etc.
  • Experience with supporting Cloud-based applications – we use Amazon Web Services (AWS).
  • Experience in using monitoring systems and investigating issues at a log level – we use Datadog.
  • Experience coordinating and collaborating with multiple teams such as Helpdesk & SRE.
  • Excellent communication and interpersonal skills.
  • Ability to offer flexibility during peak times and critical projects for changing shift patterns.
  • Experience in creating technical documentation and reports. 
  • Readiness to offer training to colleagues when needed.

 

Advantageous:

  • Experience with Datadog Monitoring and Incident Management is a plus.
  • Experience maintaining continuous integration and delivery pipelines with tools such as Jenkins and CircleCI.
  • Scripting/programming knowledge of at least Unix shell scripting.
  • Background in Windows Administration.
  • Experience with Postgres.
  • Experience with Kubernetes.

 

Similar Jobs

Normalyze - Lead DevOps Engineer - Enterprise Cybersecurity - SaaS - Bay Area, CA

Normalyze

California, United States (Remote)
6 Months ago
ION - Technical Support Analyst, Toronto - 4363

ION

Toronto, Ontario, Canada (On-Site)
6 Months ago
Interactive Brokers - Software Developer - C++

Interactive Brokers

Greenwich, Connecticut, United States (On-Site)
6 Months ago
NVIDIA - Senior SRAM Engineer, Circuit Design

NVIDIA

Santa Clara, California, United States (Hybrid)
2 Months ago
ByteDance - Cloud Site Reliability Engineer

ByteDance

Seattle, Washington, United States (On-Site)
1 Month ago
1920 - Production Coordinator for Commercials

1920

London, England, United Kingdom (Hybrid)
4 Months ago
Nintendo - Intern - IT Security

Nintendo

Redmond, Washington, United States (On-Site)
5 Months ago
Notion - Enterprise Technical Support, German, EMEA

Notion

Dublin, County Dublin, Ireland (On-Site)
6 Months ago
CloudHire - Salesforce Developer L5/6 (Vlocity)

CloudHire

India (Remote)
1 Month ago
The Walt Disney Company - Agent(e) de Sécurité F/H/NB - CDI

The Walt Disney Company

Île-de-France, France (On-Site)
3 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

OMP - Quality Assurance Test Engineer - Senior

OMP

Maharashtra, India (Hybrid)
7 Months ago
Epic Games - Senior DevOps Programmer

Epic Games

Cary, North Carolina, United States (On-Site)
2 Months ago
CloudLinux - Senior Python Developer with Security Expertise

CloudLinux

Sofia City Province, Bulgaria (Remote)
1 Month ago
prizepicks - Senior Back End Engineer

prizepicks

(Remote)
1 Month ago
Anavation - Systems Administrator (SME)

Anavation

Clarksburg, West Virginia, United States (Remote)
4 Weeks ago
Power Integrations - IT Support Manager (APAC)

Power Integrations

Penang, Malaysia (On-Site)
6 Months ago
Turtle Rock Studios - Senior Tools Engineer

Turtle Rock Studios

California, United States (Remote)
1 Month ago
ION - Senior Linux Systems Administrator - Trumbull, CT

ION

Trumbull, Connecticut, United States (Hybrid)
6 Months ago
Werplay - QA Engineer

Werplay

Islamabad, Islamabad Capital Territory, Pakistan (On-Site)
4 Months ago
Epic Games - Senior DevOps Programmer

Epic Games

London, England, United Kingdom (On-Site)
2 Months ago

Get notifed when new similar jobs are uploaded

Jobs in Western Cape, South Africa

WebFX - Remote Copywriter: Legal

WebFX

South Africa (Remote)
6 Months ago
Nagarro - Senior Engineer, Frontend Angular2x

Nagarro

South Africa (On-Site)
6 Months ago
PwC - Workivia Implementation Specialist

PwC

Johannesburg, Gauteng, South Africa (On-Site)
6 Months ago
Collective Ace Group - Experienced Community Manager (Remote - 1 year contract)

Collective Ace Group

South Africa (Remote)
3 Months ago
Tesla - Employee Advisor

Tesla

Kokstad, KwaZulu-Natal, South Africa (On-Site)
2 Months ago
WebFX - Remote Copywriter: Technology & SaaS

WebFX

South Africa (Remote)
6 Months ago
WebFX - Remote Copywriter: Health

WebFX

South Africa (Remote)
6 Months ago
WebFX - Remote Copywriter: Finance/Investment/Money/Business

WebFX

South Africa (Remote)
6 Months ago
Nagarro - Associate Staff Engineer ,Fastapp developer

Nagarro

South Africa (On-Site)
6 Months ago
Nagarro - Senior Engineer, Mobile iOS

Nagarro

South Africa (On-Site)
6 Months ago

Get notifed when new similar jobs are uploaded

Administrative Jobs

PwC - Manager

PwC

Kolkata, West Bengal, India (On-Site)
6 Months ago
Canva - People Systems Engineer

Canva

Makati, Metro Manila, Philippines (Remote)
1 Month ago
Onward Search - Office Manager

Onward Search

Austin, Texas, United States (Hybrid)
1 Month ago
NVIDIA - Lab Manager - System Level Test Team

NVIDIA

Canada (On-Site)
1 Month ago
PlayerUnknown Productions - IT Manager (Part-Time)

PlayerUnknown Productions

Amsterdam, North Holland, Netherlands (Hybrid)
6 Months ago
Aristocrat Gaming - DB Developer

Aristocrat Gaming

Warsaw, Masovian Voivodeship, Poland (Hybrid)
1 Month ago
Funko - Order Entry Coordinator

Funko

London, England, United Kingdom (On-Site)
6 Months ago
DNEG - Tech Junior

DNEG

Chennai, Tamil Nadu, India (On-Site)
2 Months ago
Tesla - Field Service Technician (Electrician) Industrial Storage / Supercharging

Tesla

Zagreb County, Croatia (On-Site)
2 Months ago

Get notifed when new similar jobs are uploaded

About The Company

White Hat Gaming is a state-of-the-art iGaming platform providing a secure, scalable and flexible modular Casino and Sportsbook Player Account Management solution.

View All Jobs

Get notified when new jobs are added by White Hat Gaming

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug