Engineer, Staff-Machine Learning-Tools
Qualcomm
Job Summary
As a Qualcomm Software Engineer in the Generative AI team, you will design, develop, and validate embedded and cloud edge software, focusing on integrating cutting-edge GenAI models on Qualcomm chipsets. This involves spearheading the development and commercialization of the Qualcomm AI Runtime (QAIRT) SDK, pushing performance limits from large models, and deploying C/C++ software stacks. The role requires expertise in GenAI advancements, LLMs/Transformers, and edge-based GenAI deployment to run models at high speeds with low power consumption.
Must Have
- Spearhead development and commercialization of Qualcomm AI Runtime (QAIRT) SDK.
- Push limits of performance from large models.
- Deploy large C/C++ software stacks using best practices.
- Stay on cutting edge of GenAI advancements.
- Understand LLMs/Transformers and edge-based GenAI deployment.
- Strong understanding of Generative AI models (LLM, LVM, LMMs, building blocks).
- Knowledge of Floating-point, Fixed-point representations and Quantization concepts.
- Experience optimizing algorithms for AI hardware accelerators (CPU/GPU/NPU).
- Strong C/C++ programming, Design Patterns, and OS concepts.
- Good scripting skills in Python.
- Excellent analytical and debugging skills.
- Good communication and collaboration skills.
Good to Have
- Strong understanding of SIMD processor architecture and system design.
- Proficiency in object-oriented software development.
- Familiarity with Linux and Windows environment.
- Strong background in kernel development for SIMD architectures.
- Familiarity with frameworks like llama.cpp, MLX, and MLC.
- Good knowledge of PyTorch, TFLite, and ONNX Runtime.
- Experience with parallel computing systems and languages like OpenCL and CUDA.
Perks & Benefits
- World-class health benefit options providing world-class coverage.
- Programs to help employees build and prepare for a financially secure future.
- Self and family resources to build emotional/mental strength and resilience.
- Wellbeing programs and resources to help employees Live+Well and Work+Well.
- Continuous learning and development programs.
- Tuition reimbursement.
- Mentorships.
Job Description
Job Posting Date
2026-01-04
Company:
Qualcomm India Private Limited
Job Area:
Engineering Group, Engineering Group > Software Engineering
General Summary:
As a leading technology innovator, Qualcomm pushes the boundaries of what's possible to enable next-generation experiences and drives digital transformation to help create a smarter, connected future for all. As a Qualcomm Software Engineer, you will design, develop, create, modify, and validate embedded and cloud edge software, applications, and/or specialized utility programs that launch cutting-edge, world class products that meet and exceed customer needs. Qualcomm Software Engineers collaborate with systems, hardware, architecture, test engineers, and other teams to design system-level software solutions and obtain information on performance requirements and interfaces.
Minimum Qualifications:
• Bachelor's degree in Engineering, Information Systems, Computer Science, or related field and 4+ years of Software Engineering or related work experience.
OR
• Master's degree in Engineering, Information Systems, Computer Science, or related field and 3+ years of Software Engineering or related work experience.
OR
• PhD in Engineering, Information Systems, Computer Science, or related field and 2+ years of Software Engineering or related work experience.
• 2+ years of work experience with Programming Language such as C, C++, Java, Python, etc.
Join the exciting Generative AI team at Qualcomm focused on integrating cutting edge GenAI models on Qualcomm chipsets. The team uses Qualcomm chips’ extensive heterogeneous computing capabilities to allow inference of GenAI models on-device without a need for connection to the cloud. Our inference engine is designed to help developers run neural network models trained in a variety of frameworks on Snapdragon platforms at blazing speeds while still sipping the smallest amount of power. Utilize this power efficient hardware and Software stack to run Large Language Models (LLMs) and Large Vision Models (LVM) at near GPU speeds!
Responsibilities:
In this role, you will spearhead the development and commercialization of the Qualcomm AI Runtime (QAIRT) SDK on Qualcomm SoCs. As an AI inferencing expert, you'll push the limits of performance from large models. Your mastery in deploying large C/C++ software stacks using best practices will be essential. You'll stay on the cutting edge of GenAI advancements, understanding LLMs/Transformers and the nuances of edge-based GenAI deployment. Most importantly, your passion for the role of edge in AI's evolution will be your driving force.
Requirements:
Master’s/Bachelor’s degree in computer science or equivalent.
6+ years of relevant work experience in software development.
Strong understanding of Generative AI models – LLM, LVM, LMMs and building blocks (self-attention, cross attention, kv caching etc.)
Floating-point, Fixed-point representations and Quantization concepts.
Experience with optimizing algorithms for AI hardware accelerators (like CPU/GPU/NPU).
Strong in C/C++ programming, Design Patterns and OS concepts.
Good scripting skills in Python.
Excellent analytical and debugging skills.
Good communication skills (verbal, presentation, written).
Ability to collaborate across a globally diverse team and multiple interests.
Preferred Qualifications
· Strong understanding of SIMD processor architecture and system design.
· Proficiency in object-oriented software development and familiarity
· Familiarity with Linux and Windows environment
· Strong background in kernel development for SIMD architectures.
· Familiarity with frameworks like llama.cpp, MLX, and MLC is a plus.
· Good knowledge of PyTorch, TFLite, and ONNX Runtime is preferred.
· Experience with parallel computing systems and languages like OpenCL and CUDA is a plus.