Datacenter Hardware Operations Technician, AI Compute Infrastructure - Stargate

7 Hours ago • 7 Years + • $144,000 PA - $228,000 PA
Research Development

Job Description

OpenAI is building the world’s most advanced AI infrastructure ecosystem through its Stargate program, deploying massive data center campuses in partnership with industry leaders. We are seeking a senior datacenter hardware operations technician to coordinate physical hardware activities at a large partner-operated campus. This role involves collaborating with Oracle and their delivery teams to align compute requirements with day-to-day hardware work, focusing on technical alignment and shared problem-solving to ensure maintenance, repairs, and lifecycle activities support performance and reliability goals. The technician will also help develop standards and playbooks for future infrastructure projects.
Good To Have:
  • Familiarity with large-scale cluster management or monitoring tools (IPMI, BMC, Prometheus, Nagios).
  • Experience with GPU-accelerated compute clusters or other high-performance computing hardware.
  • Knowledge of Linux/Unix system administration and command-line diagnostic tools for hardware validation.
  • Industry certifications such as CompTIA Server+, OEM hardware certifications, or equivalent.
  • Experience applying Environmental Health and Safety best practices in mission-critical environments.
Must Have:
  • Serve as primary on-site hardware contact, collaborating with Oracle teams and vendors to plan and coordinate maintenance, repairs, and lifecycle activities.
  • Share technical requirements and verify that work performed supports compute needs and agreed quality targets.
  • Coordinate schedules, spare-parts planning, and issue escalation with partner teams to minimize downtime.
  • Work with fleet-health engineers to translate software-detected issues into on-site hardware actions.
  • Track hardware trends and provide joint recommendations with partner teams for design or operational improvements.
  • Prepare documentation and runbooks that capture joint best practices.
  • Offer technical guidance and context to partner personnel while respecting their operational ownership.
  • Collaborate with supply-chain teams to plan spares and manage hardware lifecycle activities.
  • 7+ years of experience in datacenter hardware operations, hardware engineering, or large-scale server maintenance, with at least 2 years in a senior or lead technician capacity.
  • Deep knowledge of high-density server hardware, including x86 platforms, GPUs, storage devices, and power/cooling systems.
  • Excel at diagnosing hardware issues, coordinating complex repairs, and maintaining strong working relationships across organizations.
  • Comfortable setting technical expectations and validating outcomes through collaboration, not direct management.
  • Adapt quickly to changing operational conditions and enjoy solving problems at both the strategic and on-site levels.
  • Communicate clearly and build trust across partner teams, vendors, and internal engineering stakeholders.
  • Willing to be based full-time at a partner-operated campus in Abilene, Texas 5 days per week.
Perks:
  • Equity
  • Performance-related bonus(es) for eligible employees
  • Medical, dental, and vision insurance for you and your family, with employer contributions to Health Savings Accounts
  • Pre-tax accounts for Health FSA, Dependent Care FSA, and commuter expenses (parking and transit)
  • 401(k) retirement plan with employer match
  • Paid parental leave (up to 24 weeks for birth parents and 20 weeks for non-birthing parents)
  • Paid medical and caregiver leave (up to 8 weeks)
  • Flexible PTO for exempt employees and up to 15 days annually for non-exempt employees
  • 13+ paid company holidays
  • Multiple paid coordinated company office closures throughout the year for focus and recharge
  • Paid sick or safe time (1 hour per 30 hours worked, or more, as required by applicable state or local law)
  • Mental health and wellness support
  • Employer-paid basic life and disability coverage
  • Annual learning and development stipend
  • Daily meals in our offices, and meal delivery credits as eligible
  • Relocation support for eligible employees
  • Additional taxable fringe benefits, such as charitable donation matching and wellness stipends

Add these skills to join the top 1% applicants for this job

excel
oracle
game-texts
linux
unix
prometheus
nagios
asana

About the Team

OpenAI, in close collaboration with our capital partners, is embarking on a journey to build the world’s most advanced AI infrastructure ecosystem. Our Stargate program develops and deploys massive, state-of-the-art data center campuses in partnership with industry leaders such as Oracle today—and through future infrastructure projects tomorrow. We design for scale, speed, and reliability, and we need experienced hardware professionals who can help ensure our high-density compute environment operates at peak performance.

About the Role

We are seeking a senior datacenter hardware operations technician to coordinate physical hardware activities at a large partner-operated campus. In this role you will work side-by-side with Oracle and their delivery teams, helping align our compute requirements with day-to-day hardware work on the ground. Rather than directing partner personnel, you will focus on collaboration, technical alignment, and shared problem solving, ensuring that maintenance, repairs, and lifecycle activities support the performance and reliability goals of both organizations. As the campus matures, you will help capture lessons learned and develop standards and playbooks to guide hardware operations at future infrastructure projects.

Candidates must be able to sit onsite in Abilene, Texas 5 days per week

In This Role You Will

  • Serve as primary on-site hardware contact, collaborating with Oracle teams and vendors to plan and coordinate maintenance, repairs, and lifecycle activities.
  • Share technical requirements and verify that work performed supports our compute needs and agreed quality targets.
  • Coordinate schedules, spare-parts planning, and issue escalation with partner teams to minimize downtime and keep operations running smoothly.
  • Work with fleet-health engineers to translate software-detected issues into on-site hardware actions in partnership with Oracle.
  • Track hardware trends and provide joint recommendations with partner teams for design or operational improvements.
  • Prepare documentation and runbooks that capture joint best practices and can be applied at additional campuses.
  • Offer technical guidance and context to partner personnel while respecting their operational ownership.
  • Collaborate with supply-chain teams to plan spares and manage hardware lifecycle activities.

You Might Thrive in This Role If You

  • Have 7+ years of experience in datacenter hardware operations, hardware engineering, or large-scale server maintenance, with at least 2 years in a senior or lead technician capacity.
  • Bring deep knowledge of high-density server hardware, including x86 platforms, GPUs, storage devices, and power/cooling systems.
  • Excel at diagnosing hardware issues, coordinating complex repairs, and maintaining strong working relationships across organizations.
  • Are comfortable setting technical expectations and validating outcomes through collaboration, not direct management.
  • Adapt quickly to changing operational conditions and enjoy solving problems at both the strategic and on-site levels.
  • Communicate clearly and build trust across partner teams, vendors, and internal engineering stakeholders.
  • Are willing to be based full-time at a partner-operated campus

Preferred Skills

  • Familiarity with large-scale cluster management or monitoring tools (IPMI, BMC, Prometheus, Nagios) to interpret alerts and coordinate partner responses.
  • Experience with GPU-accelerated compute clusters or other high-performance computing hardware.
  • Knowledge of Linux/Unix system administration and command-line diagnostic tools for hardware validation.
  • Industry certifications such as CompTIA Server+, OEM hardware certifications, or equivalent.
  • Experience applying Environmental Health and Safety best practices in mission-critical environments.

We are an equal opportunity employer, and we do not discriminate on the basis of race, religion, color, national origin, sex, sexual orientation, age, veteran status, disability, genetic information, or other applicable legally protected characteristic.

For additional information, please see Affirmative Action and Equal Employment Opportunity Policy Statement.

Qualified applicants with arrest or conviction records will be considered for employment in accordance with applicable law, including the San Francisco Fair Chance Ordinance, the Los Angeles County Fair Chance Ordinance for Employers, and the California Fair Chance Act. For unincorporated Los Angeles County workers: we reasonably believe that criminal history may have a direct, adverse and negative relationship with the following job duties, potentially resulting in the withdrawal of a conditional offer of employment: protect computer hardware entrusted to you from theft, loss or damage; return all computer hardware in your possession (including the data contained therein) upon termination of employment or end of assignment; and maintain the confidentiality of proprietary, confidential, and non-public information. In addition, job duties require access to secure and protected information technology systems and related data security obligations.

To notify that you believe this job posting is non-compliant, please submit a report through this form. No response will be provided to inquiries unrelated to job posting compliance.

We are committed to providing reasonable accommodations to applicants with disabilities, and requests can be made via this link.

Global Applicant Privacy Policy

At , we believe artificial intelligence has the potential to help people solve immense global challenges, and we want the upside of AI to be widely shared. Join us in shaping the future of technology.

Set alerts for more jobs like Datacenter Hardware Operations Technician, AI Compute Infrastructure - Stargate
Set alerts for new jobs by OpenAI
Set alerts for new Research Development jobs in United States
Set alerts for new jobs in United States
Set alerts for Research Development (Remote) jobs

Contact Us
hello@outscal.com
Made in INDIA 💛💙