AI Computing Software Engineering Intern, TensorRT

8 Months ago • Upto 1 Years
Research Development

Job Description

NVIDIA seeks excellent Software Engineering interns to join its rapidly growing research and development team focused on GPU-accelerated deep learning inference using TensorRT. Interns will develop highly optimized deep learning kernels, conduct performance analysis, collaborate on software and architecture improvements across various teams (automotive, image understanding, speech, LLMs), and work with deep learning frameworks like TensorFlow or PyTorch. The role involves performance modeling, profiling, debugging, and code optimization across CPU and GPU architectures. Strong C/C++ programming, software design skills, and a deep curiosity about AI are essential. This internship provides hands-on experience contributing to cutting-edge deep learning software used globally.
Good To Have:
  • Python experience
  • CPU/GPU architectural knowledge
Must Have:
  • Excellent C/C++ programming skills
  • Deep learning kernel development
  • Performance analysis & modeling
  • GPU programming (CUDA/OpenCL)
  • Experience with TensorFlow/PyTorch

Add these skills to join the top 1% applicants for this job

tensorflow
algorithms
deep-learning
python
pytorch
opencl
cuda
problem-solving
performance-analysis

We are rapidly growing our research and software development for Inference. We seek excellent Software Engineers and Interns to join our team. We specialize in developing GPU-accelerated Deep Learning software. Researchers around the world are using NVIDIA GPUs to power a revolution in deep learning, enabling breakthroughs in numerous areas. Join the team that builds software to enable new solutions. Collaborate with the deep learning community to implement the latest algorithms for public release in TensorRT.

What you’ll be doing:

  • Develop deeply optimized deep learning kernels for inference.

  • Conduct performance analysis and modeling to understand the performance limiter of current software stack as well as underlying hardware architecture.

  • Collaborate with different teams to improve both the software and architectures to extend the state of the art in performance, efficiency, reliability, and programmability.

  • Work with cross-collaborative teams across automotive, image understanding, speech understanding, and large language models to develop creative solutions.

What we need to see:

  • Pursuing BS or higher degree in Computer Engineering, Computer Science, Electrical Engineer, or related computing focused degree.

  • Excellent C/C++ programming and software designing skills, including debugging, performance analysis, and test design.

  • Python experience is a plus.

  • Performance modelling, profiling, debugging, and code optimization or architectural knowledge of CPU and GPU.

  • GPU programming experience (CUDA or OpenCL).

  • Experience working with deep learning frameworks like TensorFlow or PyTorch.

  • Strong curiosity about artificial intelligence, awareness of the latest developments in LLMs, generative, and recommender models.

NVIDIA is widely considered to be one of the technology world’s most desirable employers. We have some of the most forward-thinking people in the world working for us. If you're creative and autonomous, we want to hear from you!

We are an equal opportunity employer and value diversity at our company. We do not discriminate on the basis of race, religion, color, national origin, gender, sexual orientation, age, marital status, veteran status, or disability status.

Set alerts for more jobs like AI Computing Software Engineering Intern, TensorRT
Set alerts for new jobs by NVIDIA
Set alerts for new Research Development jobs in Taiwan
Set alerts for new jobs in Taiwan
Set alerts for Research Development (Remote) jobs

Contact Us
hello@outscal.com
Made in INDIA 💛💙