Quality & Risk Manager

NSCALE

Job Summary

The Quality & Risk Manager at Nscale is responsible for establishing and operating the company's risk and quality assurance framework across all enterprise and business-as-usual GPU and data-centre deployments. This role provides independent oversight, structured challenge, and practical support to ensure complex programmes are delivered safely, predictably, and to defined standards. It involves owning the enterprise risk framework, managing portfolio-level risks, defining quality standards, and supporting project teams, while ensuring compliance and reporting on risk and quality performance.

Must Have

  • Own and continuously improve the enterprise risk management framework.
  • Define and maintain standardised risk processes, templates, thresholds, and escalation paths.
  • Facilitate structured risk identification workshops and maintain live risk registers.
  • Maintain a consolidated, portfolio-level view of risks across all deployments.
  • Identify cross-programme and systemic risks and escalate critical items.
  • Define and own quality standards covering design, build, testing, commissioning, and handover.
  • Establish checklists, acceptance criteria, and documentation requirements.
  • Plan and conduct independent assurance reviews at key delivery gates.
  • Coach project teams on the practical application of risk and quality standards.
  • Support PMs in structuring effective risk registers and mitigation plans.
  • Lead or support root-cause analysis when issues occur.
  • Monitor adherence to defined methods, standards, and risk processes.
  • Track exceptions, trends, and recurring issues and report them.
  • Define, track, and report KPIs related to risk and quality performance.
  • 7+ years of experience in quality assurance, risk management, programme assurance, or PMO roles within large-scale infrastructure, data-centre, or technology environments.
  • Strong understanding of programme and portfolio risk management frameworks.
  • Experience defining and enforcing quality standards across complex engineering or infrastructure programmes.

Good to Have

  • Experience in GPU, data-centre, cloud infrastructure, or high-availability environments.
  • Relevant certifications (e.g. PRINCE2, MSP, PMI-RMP, ISO, or similar).

Job Description

About Nscale

Nscale is the GPU cloud engineered for AI. We deliver cost-effective, high-performance infrastructure that enables AI-first companies to scale rapidly while reducing complexity across design, build, and operations. Our platform supports strategic business outcomes across performance, cost efficiency, and sustainability.

We operate with a culture of ownership, accountability, and relentless improvement. As an Nscaler, you’ll work alongside high-performing teams to build the systems, processes, and infrastructure that power the future of AI at scale.

Role Overview

The Quality & Risk Manager is responsible for establishing and operating Nscale’s risk and quality assurance framework across all enterprise and business-as-usual (BAU) deployments. This role provides independent oversight, structured challenge, and practical support to ensure that complex GPU and data-centre programmes are delivered safely, predictably, and to defined standards.

The role sits at the intersection of programme delivery, engineering, and governance. The successful candidate will bring strong risk management discipline, an eye for quality, and the credibility to work with senior PMs, architects, and engineers while escalating issues decisively when required.

Key Responsibilities

Enterprise Risk Framework Ownership

  • Own and continuously improve the enterprise risk management framework across all GPU and data-centre deployments.
  • Define and maintain standardised risk processes, templates, thresholds, and escalation paths for Enterprise and BAU programmes.
  • Facilitate structured risk identification workshops and ensure every programme maintains a live risk register with clear ownership and mitigations.

Portfolio-Level Risk Management

  • Maintain a consolidated, portfolio-level view of risks across all deployments, including capacity, supply chain, power, fabric, multi-DC dependencies, and regulatory considerations.
  • Identify cross-programme and systemic risks and escalate critical items to senior leadership.
  • Support leadership decision-making by providing clear risk insights, mitigation options, and contingency scenarios.

Quality Assurance & Project Assurance

  • Define and own quality standards covering design, build, testing, commissioning, and handover.
  • Establish checklists, acceptance criteria, and documentation requirements aligned to Nscale’s reference architectures (e.g. 10K / 20K / 50K GPU designs).
  • Plan and conduct independent assurance reviews at key delivery gates, including design freeze, pre-build, pre-go-live, and post-implementation.

Support to PMs, Architects & Engineering Teams

  • Coach project teams on the practical application of risk and quality standards.
  • Support PMs in structuring effective risk registers and mitigation plans.
  • Lead or support root-cause analysis when issues occur, ensuring corrective and preventive actions are clearly defined.
  • Ensure lessons learned are systematically captured and fed back into standards, playbooks, and reference designs.

Compliance, Monitoring & Reporting

  • Monitor adherence to defined methods, standards, and risk processes across all programmes.
  • Track exceptions, trends, and recurring issues and report them to the Head of Enterprise Deployments.
  • Define, track, and report KPIs related to risk and quality performance (e.g. unmanaged high risks, assurance findings, defect rates, audit outcomes).

Key Deliverables

  • Risk Management Framework & Playbook - Documented processes, RACI, templates (risk registers, impact/likelihood matrices, escalation paths) tailored to GPU and data-centre deployments.
  • Programme & Portfolio Risk Register - Up-to-date risk logs for each major deployment and a consolidated portfolio-level risk dashboard for leadership review.
  • Quality Management Plan & Standards- Defined quality criteria for designs, engineering work packs, test plans, and handover documentation, aligned to Nscale reference architectures.
  • Assurance Review Reports - Formal outputs from design reviews, readiness reviews, and post-implementation reviews, including findings, risk ratings, and agreed actions.
  • Compliance & Audit Evidence - Clear evidence of alignment with internal policies, safety and regulatory requirements, and any external standards adopted by Nscale.
  • Lessons Learned & Continuous Improvement Log - Structured records of issues, root causes, and improvements feeding back into updated standards, playbooks, and training for PMO and Engineering teams.

Qualifications & Experience

  • 7+ years of experience in quality assurance, risk management, programme assurance, or PMO roles within large-scale infrastructure, data-centre, or technology environments.
  • Strong understanding of programme and portfolio risk management frameworks.
  • Experience defining and enforcing quality standards across complex engineering or infrastructure programmes.
  • Comfortable operating across multiple concurrent programmes with senior stakeholders.
  • Strong analytical, facilitation, and communication skills, with the ability to challenge constructively and escalate when necessary.
  • Experience in GPU, data-centre, cloud infrastructure, or high-availability environments strongly preferred.
  • Relevant certifications (e.g. PRINCE2, MSP, PMI-RMP, ISO, or similar) are a plus.

Inclusion & Accessibility

At Nscale, we are committed to fostering an inclusive, diverse, and equitable workplace. We believe that a variety of perspectives enriches our work environment, and we encourage applications from candidates of all backgrounds, experiences, and abilities. If there’s anything we can do to accommodate your specific situation, please let us know.

5 Skills Required For This Role

Team Management Communication Risk Management Game Texts Quality Control

Similar Jobs