Senior GPU Kernel Developer

9 Months ago • 4-8 Years
Research Development

Job Description

Luxoft seeks a Senior GPU Kernel Developer proficient in HIP/ROCm to lead CUDA kernel porting to HIP. Responsibilities include collaborating with development teams to optimize GPU-accelerated applications, debugging, profiling, and fine-tuning code for performance improvements, and staying updated on GPU advancements. The ideal candidate possesses a strong background in GPU computing, parallel programming, and a deep understanding of CUDA or HIP frameworks. Optimization technique familiarity is highly desirable. The role involves porting CUDA kernels to HIP and collaborating on enhancing GPU-accelerated applications.
Good To Have:
  • Linux
  • CPU Intrinsics (AVX/SSE)
  • GPU Assembler
  • Python
  • AI/ML/DL/NN/NLP/Computer Vision
Must Have:
  • CUDA or HIP
  • GPU Programming (C/C++)
  • Parallel Programming
  • GPU Architecture Understanding
  • Optimization Techniques
  • Problem-solving skills
  • Collaboration

Add these skills to join the top 1% applicants for this job

computer-vision
python
linux
cuda

Project description

Luxoft is searching for talented developers with GPU compute and performance profiling experience to join the rapidly growing team.

We are seeking an experienced individual proficient in HIP / ROCm applications to join our team. The primary responsibility of this role will be to lead the effort in porting CUDA kernels to HIP. The candidate should possess a strong background in GPU computing, parallel programming, and a deep understanding of CUDA or HIP frameworks. Additionally, familiarity with optimization techniques is highly desirable.

Responsibilities

The main task will be to help port CUDA kernels on HIP

Collaborate with development teams to optimize and enhance GPU-accelerated applications.

Debug, profile, and fine-tune code for performance improvements.

Stay updated with the latest advancements in GPU architectures and programming models.

Skills

Must have

CUDA or HIP

GPGPU

C/C++

Python

One of AI/ML/DL/NN/NLP/Computer Vision

Mandatory Skills Description:

Proficiency with C++ and GPU Assembler

Proficiency in CUDA or HIP / ROCm programming

Solid understanding of GPU architectures, parallel programming models, and optimization techniques

Strong problem-solving skills and the ability to work in a collaborative environment

Nice to have

Linux

CPU Intrinsics (AVX/SSE)

GPU Assembler

Other

Languages

English: B2 Upper Intermediate

Seniority

Senior

Set alerts for more jobs like Senior GPU Kernel Developer
Set alerts for new jobs by Luxoft
Set alerts for new Research Development jobs in Mexico
Set alerts for new jobs in Mexico
Set alerts for Research Development (Remote) jobs
Contact Us
hello@outscal.com
Made in INDIA 💛💙