NVIDIA's Silicon Solutions Group is seeking a full-stack developer with AI/LLM expertise to help integrate AI into its data analysis and automation infrastructure. The solutions developed will support multiple critical large-scale automation initiatives. In this role, you will lead strategies and design of AI solutions to improve the efficiency of our existing and new automation workflows. The ideal candidate will combine technical expertise with hands-on experience to drive all AI planning, design, and implementation aspects. At NVIDIA, we strive for excellence, encourage innovation, and provide opportunities to explore new ways to succeed!
What You'll Be Doing:
Study and develop groundbreaking techniques in deep learning, graphs, machine learning, and data analytics, and perform in-depth analysis.
Collaborate with developers and cross-functional teams to identify current and emerging challenges.
Design and implement end-to-end generative AI solutions, specializing in Large Language Model (LLM) training, efficient deployment strategies, and sophisticated Retrieval-Augmented Generation (RAG) workflows.
What We Need to See:
MS (or equivalent experience) with 6+ years of software development; 2+ years relevant work experience in developing and deploying AI solutions
Proven full-stack development experience with a focus on improving application performance and user experience
Proficiency in Python, C++ programming, and Deep Learning frameworks
Ability to work independently and as part of a team
Motivated self-starter with strong analytical and debug skills
Ability to balance multiple simultaneous projects
Excellent verbal and written communication skills
Ways to Standout from the crowd:
Experience with CUDA programming and benchmarking and analyzing performance AI Agentic systems
Expertise in training, fine-tuning, and evaluating LLMs using popular frameworks such as TensorFlow or PyTorch
Proficiency in model deployment and optimization techniques for efficient inference on various hardware platforms
Experience with NVIDIA GPUs and software libraries, such as NVIDIA NeMo Framework, NVIDIA Triton Inference Server, TensorRT, TensorRT-LLM
Proficiency in model deployment and optimization techniques for efficient inference on various hardware platforms · Experience with NVIDIA GPUs and software libraries, such as NVIDIA NeMo Framework, NVIDIA Triton Inference Server, TensorRT, TensorRT-LLM
NVIDIA is widely considered to be one of the world's most desirable employers in the technology field. We have some of the most forward-thinking and hardworking people in the world working for us. If you're creative and autonomous, we want to hear from you!
#LI-Hybrid
The base salary range is 184,000 USD - 356,500 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.You will also be eligible for equity and benefits. NVIDIA accepts applications on an ongoing basis.
Get notifed when new similar jobs are uploaded
Get notifed when new similar jobs are uploaded
Get notifed when new similar jobs are uploaded
Get notifed when new similar jobs are uploaded
Since its founding in 1993, NVIDIA (NASDAQ: NVDA) has been a pioneer in accelerated computing. The company’s invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined computer graphics, ignited the era of modern AI and is fueling the creation of the metaverse. NVIDIA is now a full-stack computing company with data-center-scale offerings that are reshaping industry.
Get notified when new jobs are added by NVIDIA