Infrastructure Support Engineer

23 Minutes ago • All levels
IT & Infrastructure

Job Description

Nscale is seeking an Infrastructure Support Engineer to ensure the smooth operation, monitoring, and support of our global AI datacenter infrastructure. This role involves providing advanced technical support, troubleshooting, break-fix, and maintenance of high-performance datacenter systems, including GPUs, networking, cooling, and storage technologies. The engineer will be critical in guaranteeing uptime, performance, and reliability, collaborating with engineering, operations, and vendor partners to meet SLAs.
Must Have:
  • Provide support and troubleshooting for GPU datacenter infrastructure, including hardware, networking, and storage
  • Monitor system health, performance, and capacity across datacenter sites
  • Respond to infrastructure incidents, perform root cause analysis, and implement preventative measures
  • Support deployment, configuration, and scaling of new infrastructure hardware and systems
  • Ensure infrastructure systems meet compliance, security, and operational standards
  • Experience supporting datacenter infrastructure, including servers, network hardware, and storage systems
  • Strong troubleshooting skills with the ability to diagnose and resolve complex hardware and infrastructure issues
  • Working knowledge of Linux system administration
  • Familiarity with networking protocols, topologies, and troubleshooting (e.g., TCP/IP, BGP, VLANs)
  • Understanding of datacenter cooling, power distribution, and resiliency best practices
  • Experience working in large-scale cloud or datacenter environments
  • Strong communication skills and ability to collaborate across global teams

Add these skills to join the top 1% applicants for this job

communication
problem-solving
cost-management
game-texts
networking
linux

About Nscale

Nscale is the GPU cloud engineered for AI. We provide cost-effective, high-performance infrastructure for AI start-ups and large enterprise customers. Nscale enables AI-focused companies to achieve superior results by reducing the complexity of AI development. Our GPU cloud bolsters technical capabilities and directly supports strategic business outcomes, including cost management, rapid innovation, and environmental responsibility.

We thrive on a culture of relentless innovation, ownership, and accountability, where every team member takes pride in their work and drives it with excellence and urgency. As an Nscaler, you’ll build trust through openness and transparency, where everyone is inspired to do their best work. If you join our team, you’ll be contributing to building the technology that powers the future.

About the Role (Job Purpose)

Nscale is seeking an Infrastructure Support Engineer to ensure the smooth operation, monitoring, and support of our global AI datacenter infrastructure. You will be responsible for providing advanced technical support, troubleshooting, break-fix and maintenance of high-performance datacenter systems that include GPUs, networking, cooling, and storage technologies.

This role is critical to guaranteeing uptime, performance, and reliability of our AI infrastructure and requires strong collaboration with engineering, operations, and vendor partners in order to meet our SLAs.

What You'll be Doing (Responsibilities)

  • Provide support and troubleshooting for GPU datacenter infrastructure, including hardware, networking, and storage.
  • Monitor system health, performance, and capacity across datacenter sites.
  • Respond to infrastructure incidents, perform root cause analysis, and implement preventative measures.
  • Collaborate with engineering and operations teams to optimize datacenter performance and uptime.
  • Support deployment, configuration, and scaling of new infrastructure hardware and systems.
  • Ensure infrastructure systems meet compliance, security, and operational standards.
  • Create and maintain detailed documentation of processes, runbooks, and escalation paths.
  • Work closely with vendors and partners to resolve hardware/software issues and coordinate support.
  • Participate in on-call rotations to provide 24/7 coverage for critical infrastructure incidents.

The responsibilities outlined in this job description are not exhaustive and are intended to provide a general overview of the position. The employee may be required to perform additional duties, tasks, and responsibilities as assigned by management, consistent with the skills and qualifications required for the role.

About You (Skills / Qualifications / Experience)

  • Experience supporting datacenter infrastructure, including servers, network hardware, and storage systems.
  • Strong troubleshooting skills with the ability to diagnose and resolve complex hardware and infrastructure issues.
  • Working knowledge of Linux system administration
  • Familiarity with networking protocols, topologies, and troubleshooting (e.g., TCP/IP, BGP, VLANs).
  • Understanding of datacenter cooling, power distribution, and resiliency best practices.
  • Experience working in large-scale cloud or datacenter environments.
  • Strong communication skills and ability to collaborate across global teams

Equal Opportunities Statement

At NScale, we are committed to fostering an inclusive, diverse, and equitable workplace. We believe that a variety of perspectives enriches our work environment, and we encourage applications from candidates of all backgrounds, experiences, and abilities. We strongly encourage applications from people of colour, the LGBTQ+ community, people with disabilities, neurodivergent people, parents, carers, and people from lower socio-economic backgrounds.

If there’s anything we can do to accommodate your specific situation, please let us know.

Set alerts for more jobs like Infrastructure Support Engineer
Set alerts for new jobs by NSCALE
Set alerts for new IT & Infrastructure jobs in United Kingdom
Set alerts for new jobs in United Kingdom
Set alerts for IT & Infrastructure (Remote) jobs
Contact Us
hello@outscal.com
Made in INDIA 💛💙