Senior GPU Compiler Software Development Engineer

9 Hours ago • 3-5 Years • Research & Development

About the job

Summary

This Senior GPU Compiler Software Development Engineer role focuses on enhancing OpenAI/Triton's GPU support. Responsibilities include developing new features, optimizing existing ones, collaborating with developers and stakeholders, implementing tests, and documenting the system. The work involves improving neural network performance on GPUs using ROCm. The position requires strong C/C++ skills, GPGPU experience (HIP, CUDA, OpenCL), and basic Python knowledge. Performance analysis experience is also needed. The role is remote and based in Turkey.
Must have:
  • Strong C/C++ programming
  • GPGPU computing experience (HIP, CUDA, OpenCL)
  • Compiler internals experience (LLVM, GCC)
  • Basic Python programming
  • Performance analysis
Good to have:
  • Understanding of ML technologies
  • Experience with PyTorch
  • LLVM & MLIR compiler infrastructure knowledge
  • ROCm infrastructure knowledge
  • CMake, make/ninja build system experience
  • GEMM performance fundamentals
  • Docker experience
Not hearing back from companies?
Unlock the secrets to a successful job application and accelerate your journey to your next opportunity.
Project description

Working on GPU support for OpenAI/Triton — a language and compiler for writing highly efficient custom Deep-Learning primitives. Work with the open-source community to analyze, develop, test, and deploy performance improvements for neural networks implemented with Triton on GPUs with ROCm.

Responsibilities

New features development, support and optimization of OpenAI/Triton project for GPUs. Communication with other developers, customers and project managers. Test implementation, project documentation and verification of system with unit/component/functional tests.

Skills

Must have

Strong C/C++ programming skills

Experience with GPGPU (General purpose GPU) computing (HIP, CUDA, OpenCL, etc.) or Experience with compiler internals (llvm, gcc or any other)

Basic Python programming skills

Experience in performance analysis

Nice to have

Basic understanding of ML technologies

Experience with PyTorch

Experience with LLVM and MLIR compiler infrastructure, analysis or optimizations implementation

Knowledge of ROCm infrastructure

Experience in CMake, make/ninja build system

GEMM performance fundamentals

Experience with Docker

Other

Languages

English: B2 Upper Intermediate

Seniority

Senior

View Full Job Description

About The Company

Luxoft, a DXC Technology Company (NYSE: DXC), is a digital strategy and software engineering firm providing bespoke technology solutions that drive business change for customers the world over. Acquired by U.S. company DXC Technology in 2019, Luxoft is a global operation in 44 cities and 21 countries with an international, agile workforce of nearly 18,000 people. It combines a unique blend of engineering excellence and deep industry expertise, helping over 425 global clients innovate in the areas of automotive, financial services, travel and hospitality, healthcare, life sciences, media and telecommunications.

DXC Technology is a leading Fortune 500 IT services company which helps global companies run their mission critical systems. Together, DXC and Luxoft offer a differentiated customer-value proposition for digital transformation by combining Luxoft’s front-end digital capabilities with DXC’s expertise in IT modernization and integration. Follow our profile for regular updates and insights into technology and business needs.

View All Jobs

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug