Senior GPU Compiler Software Development Engineer

3 Weeks ago • 3-5 Years • Research & Development

About the job

Job Description

This Senior GPU Compiler Software Development Engineer role focuses on enhancing OpenAI/Triton's GPU support. Responsibilities include developing new features, optimizing existing ones, collaborating with developers and customers, and conducting thorough testing using unit/component/functional tests. The work involves improving the performance of neural networks implemented with Triton on GPUs using ROCm. This position requires strong C/C++ skills, GPGPU computing experience (HIP, CUDA, OpenCL), and familiarity with compiler internals (LLVM, GCC). Performance analysis experience is a plus.
Must have:
  • Strong C/C++ programming skills
  • GPGPU computing experience (HIP, CUDA, OpenCL)
  • Compiler internals experience (LLVM, GCC)
  • Basic Python programming skills
  • Performance analysis experience
Good to have:
  • Understanding of ML technologies
  • Experience with PyTorch
  • LLVM and MLIR compiler infrastructure experience
  • ROCm infrastructure knowledge
  • CMake, make/ninja build system experience
  • GEMM performance fundamentals
  • Docker experience
Project description

Working on GPU support for OpenAI/Triton — a language and compiler for writing highly efficient custom Deep-Learning primitives. Work with the open-source community to analyze, develop, test, and deploy performance improvements for neural networks implemented with Triton on GPUs with ROCm.

Responsibilities

New features development, support and optimization of OpenAI/Triton project for GPUs. Communication with other developers, customers and project managers. Test implementation, project documentation and verification of system with unit/component/functional tests.

Skills

Must have

Strong C/C++ programming skills

Experience with GPGPU (General purpose GPU) computing (HIP, CUDA, OpenCL, etc.) or Experience with compiler internals (llvm, gcc or any other)

Basic Python programming skills

Experience in performance analysis

Nice to have

Basic understanding of ML technologies

Experience with PyTorch

Experience with LLVM and MLIR compiler infrastructure, analysis or optimizations implementation

Knowledge of ROCm infrastructure

Experience in CMake, make/ninja build system

GEMM performance fundamentals

Experience with Docker

Other

Languages

English: B2 Upper Intermediate

Seniority

Senior

View Full Job Description

Add your resume

80%

Upload your resume, increase your shortlisting chances by 80%

About The Company

Luxoft, a DXC Technology Company (NYSE: DXC), is a digital strategy and software engineering firm providing bespoke technology solutions that drive business change for customers the world over. Acquired by U.S. company DXC Technology in 2019, Luxoft is a global operation in 44 cities and 21 countries with an international, agile workforce of nearly 18,000 people. It combines a unique blend of engineering excellence and deep industry expertise, helping over 425 global clients innovate in the areas of automotive, financial services, travel and hospitality, healthcare, life sciences, media and telecommunications.

DXC Technology is a leading Fortune 500 IT services company which helps global companies run their mission critical systems. Together, DXC and Luxoft offer a differentiated customer-value proposition for digital transformation by combining Luxoft’s front-end digital capabilities with DXC’s expertise in IT modernization and integration. Follow our profile for regular updates and insights into technology and business needs.

Kraków, Lesser Poland Voivodeship, Poland (On-Site)

Wrocław, Lower Silesian Voivodeship, Poland (On-Site)

Ukrainka, Kyiv Oblast, Ukraine (Remote)

New Delhi, Delhi, India (Remote)

Iași, Iași County, Romania (On-Site)

Kuala Lumpur, Federal Territory Of Kuala Lumpur, Malaysia (On-Site)

Bengaluru, Karnataka, India (On-Site)

Chennai, Tamil Nadu, India (On-Site)

United States (Remote)

View All Jobs

Get notified when new jobs are added by Luxoft

Similar Jobs

G5 Games - 2D Illustrator (Match-3 project)

G5 Games, Kazakhstan (Remote)

Playrix - Game Designer

Playrix, Portugal (Remote)

KPIT - CTO_ML/DL Data scientist

KPIT, India (On-Site)

Patterned Learning Career - Junior Android/iOS Developer

Patterned Learning Career, (Remote)

Meta - ASIC Engineer, Design Verification

Meta, United States (Remote)

Intel Corporation - GPU Firmware Development Engineer

Intel Corporation, United States (Hybrid)

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Get notifed when new similar jobs are uploaded

Jobs in Italy, New York, United States

Mattel  Inc  - Warehouse Associate

Mattel Inc , United States (On_site)

Overwolf - Business Development Manager - Nitro

Overwolf, United States (Remote)

Barbaricum - Software Safety Engineer

Barbaricum, United States (Remote)

Magnopus - Previs Lead

Magnopus, United States (Hybrid)

LAIKA Studios - Visual Effects General Interest (SIGGRAPH 2024)

LAIKA Studios, United States (On-Site)

Axinous - Partner Solutions Architect - Americas

Axinous, United States (Hybrid)

Netflix - Program Manager, Talent Data Governance

Netflix, United States (On-Site)

 Sagecor Solutions - Full Stack Developer (BRT - 031)

Sagecor Solutions, United States (On-Site)

Zones - Sr. Service Delivery Manager

Zones, United States (Remote)

Get notifed when new similar jobs are uploaded

Research & Development Jobs

Rivos - CPU Physical Design - Full time

Rivos, India (On-Site)

PlayStation Global - Senior Machine Learning Engineer, Anomaly Detection

PlayStation Global, United Kingdom (Hybrid)

Cadence - Lead Software Engineer, Synthesis

Cadence, United States (On-Site)

Social Discovery Group - Senior business analyst (R&D Growth)

Social Discovery Group, Serbia (Remote)

Nielsen Holdings - Principal Software Engineer - Windows\C++\.Net

Nielsen Holdings, India (Hybrid)

Avalara - Machine Learning Engineer

Avalara, India (Remote)

Meta - Sensor Hardware Electrical Engineer

Meta, United States (On-Site)

Get notifed when new similar jobs are uploaded