System Tools Engineer - AI Hardware

Rebellions

Job Summary

This role involves designing and developing monitoring and management tools for Rebellions NPU accelerators, utilizing libraries for GPU hardware control, and creating diagnostic tools for fault detection and performance analysis. The engineer will collaborate with hardware and driver teams, continuously optimizing tools. Key qualifications include a minimum of 6 years in Linux systems engineering, proficiency in Linux OS (CentOS, RHEL, Ubuntu, Debian), C/C++ for low-level programming, and Python for scripting, alongside strong communication and problem-solving skills.

Must Have

  • Design and develop monitoring and management tools for Rebellions NPU accelerators.
  • Utilize libraries to access and control GPU hardware features programmatically.
  • Develop diagnostic tools for fault detection, performance analysis, and system reliability.
  • Collaborate closely with hardware engineers, driver developers, and architects to ensure seamless integration between software tools and hardware components.
  • Continuously benchmark and optimize monitoring and management tools to align with market demands.
  • Minimum of 6 years of experience in Linux systems engineering.
  • Proficiency in Linux operating systems such as CentOS, RHEL (Red Hat Enterprise Linux), Ubuntu, or Debian.
  • Proficiency in C/C++ for low-level system programming.
  • Proficiency in Python for scripting and extending functionalities.
  • Strong written and verbal communication skills, with the ability to deliver effective presentations.
  • Excellent problem-solving and collaboration skills.

Good to Have

  • Understanding of GPU internals, including memory management, clocking behavior, and power states.
  • Experience in developing or using debugging tools for performance analysis or fault detection.
  • Ability to quickly learn and adapt to new and emerging technologies.
  • Ability to work across multiple teams and contribute to cross-functional collaboration.

Job Description

Responsibilities and Opportunities

  • Design and develop of monitoring and management tools for Rebellions NPU accelerators
  • Utilize libraries to access and control GPU hardware features programmatically
  • Develop diagnostic tools for fault detection, performance analysis, and system reliability
  • Collaborate closely with hardware engineers, driver developers, and architects to ensure seamless integration between software tools and hardware components
  • Continuously benchmark and optimize Monitoring and management tools to align with market demands

Key Qualifications

  • Minimum of 6 years of experience in Linux systems engineering
  • Proficiency in Linux operating systems such as CentOS, RHEL (Red Hat Enterprise Linux), Ubuntu, or Debian
  • Proficiency in C/C++ for low-level system programming; Python is also used for scripting and extending functionalities
  • Strong written and verbal communication skills, with the ability to deliver effective presentations
  • Excellent problem-solving and collaboration skills

Ideal Qualifications

  • Understanding of GPU internals, including memory management, clocking behavior, and power states
  • Experience in developing or using debugging tools for performance analysis or fault detection
  • Ability to quickly learn and adapt to new and emerging technologies
  • Ability to work across multiple teams and contribute to cross-functional collaboration

Application Process

  • Document screening - On-line interview (including coding test) - Culture-fit interview - On-site interview - Compensation negotiation - Final acceptance
  • The selection process may vary by job, and may change depending on the schedule and circumstances.
  • The selection schedule and results will be individually notified to the email address you provided when applying.

Notes

  • This announcement may close early when recruitment is complete.
  • If there are any false facts in the application, the acceptance may be canceled.
  • Employment may be restricted if the legal qualifications required for recruitment and job performance are not met.
  • Being a veteran or a person with a disability does not cause any disadvantage in the hiring process.
  • The scope of duties may be changed considering the candidate's overall career and experience. If such a change is necessary, it will be communicated with the candidate at an appropriate time before the final acceptance notification.
  • For inquiries regarding recruitment, please contact the email address below.
  • recruit@rebellions.ai

10 Skills Required For This Role

Cross Functional Communication Problem Solving Performance Analysis Cpp Talent Acquisition Game Texts Cross Functional Collaboration Linux Python

Similar Jobs