Senior GPU Kernel Developer

1 Hour ago • 3-5 Years • Research & Development

About the job

Summary

Luxoft seeks a Senior GPU Kernel Developer to optimize HIP kernels on AMD GPUs. This role involves collaborating with development teams to enhance GPU-accelerated applications, debugging and profiling code for performance improvements, and staying current with advancements in GPU architectures. Responsibilities include optimizing HIP kernels for specific AMD hardware and contributing to the enhancement of GPU-accelerated applications. The ideal candidate possesses a strong background in GPGPU applications, parallel programming, and a deep understanding of CUDA or HIP frameworks. Experience with optimization techniques is highly desirable. The position is remote and based in Italy.
Must have:
  • CUDA or HIP
  • GPGPU programming
  • C/C++ (C++17 or later)
  • Python
  • AI/ML/DL/NN/NLP/Computer Vision experience
  • GPU architecture understanding
  • Parallel programming
  • Optimization techniques
Good to have:
  • Linux
  • CPU Intrinsics (AVX/SSE)
  • GPU Assembler
  • Profiling
  • gdb/LLDB
  • Jinja2 or similar templating engines
Not hearing back from companies?
Unlock the secrets to a successful job application and accelerate your journey to your next opportunity.
Project description

Luxoft is searching for talented developers with GPU compute and performance profiling experience to join the rapidly growing team.
We are seeking an experienced individual proficient in GPGPU applications to join our team. The primary responsibility of this role will be to lead the effort in optimizing HIP kernels on AMD GPUs. The candidate should possess a strong background in GPU computing, parallel programming, and a deep understanding of CUDA or HIP frameworks. Additionally, familiarity with optimization techniques is highly desirable.

Responsibilities

The main task will be to help optimize HIP kernels for specific AMD hardware

Collaborate with development teams to optimize and enhance GPU-accelerated applications.

Debug, profile, and fine-tune code for performance improvements.

Stay updated with the latest advancements in GPU architectures and programming models.

Skills

Must have

CUDA or HIP

GPGPU programming proficiency

C/C++ (C++17 or later)

Python

One of AI/ML/DL/NN/NLP/Computer Vision experience

Mandatory Skills Description:

Proficiency with C++ and low-level programming

Proficiency in CUDA or HIP / ROCm programming

Solid understanding of GPU architectures, parallel programming models, and optimization techniques

Strong problem-solving skills and the ability to work in a collaborative environment

Nice to have

Linux

CPU Intrinsics (AVX/SSE)

GPU Assembler

Profiling

gdb/LLDB

Jinja2 or similar templating engines

Other

Languages

English: B2 Upper Intermediate

Seniority

Senior

View Full Job Description

About The Company

Luxoft, a DXC Technology Company (NYSE: DXC), is a digital strategy and software engineering firm providing bespoke technology solutions that drive business change for customers the world over. Acquired by U.S. company DXC Technology in 2019, Luxoft is a global operation in 44 cities and 21 countries with an international, agile workforce of nearly 18,000 people. It combines a unique blend of engineering excellence and deep industry expertise, helping over 425 global clients innovate in the areas of automotive, financial services, travel and hospitality, healthcare, life sciences, media and telecommunications.

DXC Technology is a leading Fortune 500 IT services company which helps global companies run their mission critical systems. Together, DXC and Luxoft offer a differentiated customer-value proposition for digital transformation by combining Luxoft’s front-end digital capabilities with DXC’s expertise in IT modernization and integration. Follow our profile for regular updates and insights into technology and business needs.

View All Jobs

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug