Level 2 Engineer

1 Year ago • 5 Years +

Job Summary

Job Description

As a Level 2 Engineer at Thales, you will lead and coordinate level 2 support operations for mission-critical applications and infrastructure. Your responsibilities include providing troubleshooting and diagnostics for incidents, ensuring adherence to service level agreements (SLAs), and managing incidents as an incident manager for P1/P2 issues. You will also perform root cause analysis, recommend fixes, and manage change and patch management processes. Additionally, you will be responsible for documentation, compliance, configuration management, testing, verification, and knowledge management related to the supported systems. The role requires strong technical skills, leadership, and excellent communication.
Must have:
  • Lead and coordinate level 2 support operations
  • Troubleshoot and diagnose incidents
  • Manage incidents and perform root cause analysis
  • Perform operational impact assessment
  • Perform patch management readiness
  • Maintain readiness of operational documentation
  • Ensure operational readiness testing
Good to have:
  • Experience with RHEL, Windows Server, and Kubernetes
  • Knowledge of networking fundamentals
  • Experience with middleware and infrastructure (Nginx, Kubernetes)
  • Knowledge of message queues like IBM MQ and Kafka
  • Experience with databases such as SQL Server and PostgreSQL
  • Knowledge of ITIL/ITSM Process
  • Security Awareness

Job Details

Location: Singapore, Singapore

Thales people architect solutions at the heart of the defence-security continuum. Interoperable and secure information and telecommunications systems for defence, security, and civil operators, are based upon innovative use of radiocommunications, networks, and cybersecurity. We are ground breaking new digital technologies such as 4G mobile communications, cryptography, cloud computing and big data for use in physical protection systems, and critical information systems.

Thales established its presence in Singapore in 1973 to support the expansion of aerospace-related activities in the Asia-Pacific region. Throughout the last four decades, the company grew from strength to strength and is today involved in the primary businesses of Aerospace (including Air Traffic Management), Defence & Security, Ground Transportation and Digital Identity & Security. Thales today employs over 2,100 people in Singapore across all its business areas.

KEY ACTIVITIES AND RESPONSIBILITIES

As a Level 2 Engineer, you are accountable for:

Operational Support

  • Lead and coordinate level 2 support operations for mission-critical applications and infrastructure
  • Provide troubleshooting and diagnostics for incidents escalated from level 1
  • Ensure adherence to SLA, system availability

Incident & Problem Management

  • Act as incident manager for P1/P2 issues
  • Coordinate resolution and communications
  • Perform root cause analysis and recommend permanent fixes
  • Escalate unresolved issues that required software coding to Level 3 or engineering teams

Change Management

  • Perform operational impact assessment
  • Part of the CAB to review and approve change
  • Pre-Change Preparation such as review Change Request and Release Plan
  • Supervise post-change production verification
  • Documentation update and knowledge transfer
  • Post change review and feedback

Patch Management

  • Perform patch management readiness
  • Stakeholder coordination and team coordination
  • System Readiness and Post-Patch Validation
  • Documentation update and knowledge transfer
  • Compliance and audit readiness

Documentation and Compliance

  • Operational documentation. SOPs, Incident response checklist, RCA, PIR, monitoring and alert guidebook
  • Configuration & Infrastructure Documentation. System configuration baseline, application dependency maps, environment inventories such as hosts, services, accounts
  • Knowledge Base Articles for level 2 enablement and faster resolution e.g. Known Errors and Fixes, Frequent How-To Guides, Script Repositories, Lessons Learned
  • Knowledge Management

Configuration Management

  • Perform validation and accuracy of configurations
  • Maintain readiness of operational documentation
  • Perform audit to confirm compliance of configurations
  • CMDB asset verification
  • Change-linked configuration tracking
  • Ensure environment consistency between DEV – IVVQ – ISO-PROD – UAT and PROD

Testing and Verification

  • Ensure operational readiness testing before production deployment rollout
  • Ensure post-change verification coordination
  • Perform regression and sanity test following patching or upgrades, in UAT and PROD
  • Participation in user acceptance testing

Knowledge Management

  • Documentation of resolution
  • Knowledge Base Contribution
  • Validation of knowledge
  • Subject Matter Expertise Sharing

Root Cause Analysis

  • Gather logs, system metrics at the time of failure
  • Reproduction of issues in a controlled environment to understand the conditions under which it occurs
  • Determine the scope and severity in terms of the systems affected, downtime duration and business impact
  • Narrow down the possible sources of causing the failure
  • Use of diagnostic tools such to analyse the application behaviour
  • Correlation of events to sequence the chain of events leading up to the failure and identify the dependencies

KAST (Kubernetes Analytics Stack)

  • THALES proprietary Kubernetes-based platform that provides a foundational digital infrastructure across Thales business domain

Kubernetes

  • Kubernetes is an open-source platform developed by Google for automating the deployment, scaling, and management of containerized applications (typically Docker containers).

Docker

  • Docker Compose is a tool for defining and running multi-container Docker applications using a single configuration file (docker-compose.yml). It allows you to define, manage, and run multiple interconnected Docker containers as a single service stack.

Kafka

  • Apache Kafka is a high-performance distributed streaming platform used for building real-time data pipelines, stream processing, and event-driven architectures.

EMQX

  • EMQX is an MQTT broker that acts as a message middleware between publishers (e.g., sensors, devices) and subscribers (e.g., apps, dashboards, databases) using the MQTT protocol, which is a lightweight publish-subscribe messaging protocol ideal for low-bandwidth, high-latency, or constrained devices.

Elasticsearch

  • Elasticsearch is a distributed, open-source search and analytics engine built on top of Apache Lucene. It is widely used for full-text search, log and event data analysis, and real-time data exploration.

MinIO

  • MinIO is a high-performance, distributed object storage system that stores data as objects (like files, images, videos, backups) in buckets

Zookeeper

  • Apache ZooKeeper is an open-source coordination service for distributed applications. It provides a highly reliable, consistent, and available mechanism to store metadata, configuration, and state information. It complements Apache Kafka by acting as a metadata management and coordination layer in Kafka’s traditional architecture. ZooKeeper ensures reliability, consistency, and fault-tolerance in Kafka’s distributed setup.

Sparks

  • Apache Spark is an open-source, distributed computing system designed for fast, large-scale data processing. It was built for performance, especially for iterative algorithms in data science and machine learning.

RHEL

  • RHEL is a certified Linux operating system optimized for reliability, scalability, and security in business and production environments.

Ansible

  • Ansible is an open-source IT automation tool developed by Red Hat that simplifies the management of servers, applications, and infrastructure. It allows DevOps and system administrators to automate tasks such as configuration management, software deployment, and orchestration. It uses simple, human-readable YAML files (called playbooks) and SSH

Prometheus

  • Open-source monitoring and alerting toolkit that is used to collect, store and query metrics, for the monitoring of infrastructure, services, containers and microservices

Grafana

  • Open-source analytics and visualization platform used for monitoring, observability, and alerting. Commonly used with Prometheus


KEY KNOWLEDGE AND EXPERIENCE

To be successful in your role, you will have demonstrated and/or acquired the following knowledge and experience:

Education and Experience

  • Bachelor Degree in Information Technology, Computer Science, Engineering, or a closely related discipline
  • At least 5 years in Level 2 support for mission critical 24x7 production support, preferably in public sector
  • At least 2 years in a team lead or supervisory role, coordinating tasks and mentoring junior engineers
  • Proven experience in handling P1/P2 incidents, managing post-incident reviews (PIRs) and root cause analysis
  • Preferably certification in Red Hat Enterprise Linux or Kubernetes

Knowledge / Skills

  • Operating Systems. RHEL (90%) and Windows Server (10%)
  • Networking Fundamentals
  • Middleware & Infrastructure (Web Server – Nginx, App Servers – Kubernetes with containers (Docker + Spring Boot)
  • Message Queues (IBM MQ, Kafka)
  • Database (SQL Server, PostgreSQL)
  • ITIL/ITSM Process Knowledge
  • Security Awareness
  • DR and HA concepts
  • Strong Technical Skills
  • Leadership & Coordination
  • Communication & Collaboration
  • Operational Governance

At Thales we provide CAREERS and not only jobs. With Thales employing 80,000 employees in 68 countries our mobility policy enables thousands of employees each year to develop their careers at home and abroad, in their existing areas of expertise or by branching out into new fields. Together we believe that embracing flexibility is a smarter way of working. Great journeys start here, apply now!

Similar Jobs

Looks like we're out of matches

Set up an alert and we'll send you similar jobs the moment they appear!

Similar Skill Jobs

Looks like we're out of matches

Set up an alert and we'll send you similar jobs the moment they appear!

Jobs in Singapore

Looks like we're out of matches

Set up an alert and we'll send you similar jobs the moment they appear!

Similar Category Jobs

Looks like we're out of matches

Set up an alert and we'll send you similar jobs the moment they appear!

About The Company

Mulwala, New South Wales, Australia (On-Site)

Eagle Farm, Queensland, Australia (On-Site)

Singapore (Hybrid)

Limours, Île-de-France, France (On-Site)

Moirans, Auvergne-Rhône-Alpes, France (On-Site)

Châtellerault, Nouvelle-Aquitaine, France (On-Site)

La Ferté-Saint-Aubin, Centre-Val De Loire, France (On-Site)

Limours, Île-de-France, France (On-Site)

View All Jobs

Get notified when new jobs are added by Thales

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug