AI Frameworks Software Engineer – Model Compression Algorithm
Intel
Job Summary
The Intel Neural Compressor team is seeking a highly motivated software engineer to develop the Intel Neural Compressor product and related tools for Intel AI platforms (CPU, GPU, AI Accelerator). The role involves researching and implementing quantization and compression techniques for large language models (LLMs) and text-to-image/video generation models, as well as exploring cutting-edge directions in efficient model deployment and inference/finetuning acceleration.
Must Have
- Major in computer science or related subject
- Solid understanding of deep learning, deep learning framework and large language model (LLM) fundamentals
- Familiarity with model compression techniques such as quantization and pruning
- Proficiency in Python or other programming languages commonly used for deep learning development
- Passion to work as a professional software engineer and team player
- Strong self-motivation and problem-solving skill
- Good English oral and written skill
Good to Have
- Passion for technological innovation and practical engineering, with a drive for continuous exploration and improvement
- Experience in model fine-tuning, inference optimization or related tool development
Job Description
Job Details:
Job Description:
Intel Neural Compressor team is looking for a highly motivated talend to join us
Responsibilities includes:
• Develope Intel Neural Compressor product and related tools to support Intel AI platform, including CPU, GPU and AI Accelerator
• Research and implement quantization and compression techniques for large language models (LLMs) and text-to-image/video generation models
• Explore cutting-edge directions in efficient model deployment and inference/finetuing acceleration.
Qualifications:
Qualifications:
• Major in computer science or related subject
• Solid understanding of deep learning, deep learning framework and large language model (LLLM) fundamentals
• Familiarity with model compression techniques such as quantization and pruning
• Proficiency in Python or other programming languages commonly used for deep learning development
• Passion to work as a professional software engineer and team player
• Strong self-motivation and problem-solving skill
• Good English oral and written skill
Preferred Qualifications
• Passion for technological innovation and practical engineering, with a drive for continuous exploration and improvement
• Experience in model fine-tuning, inference optimization or related tool development is a plus
Job Type:
College Grad
Shift:
Shift 1 (China)
Additional Locations:
Business group:
The Sales and Marketing Group (SMG) leverages the product portfolio to drive Intel's revenue growth and market expansion, blending strategic initiatives with dynamic sales efforts to capture and retain customers. SMG is responsible for empowering the sales force with tools and insights needed to close deals and build lasting customer relationships. Sales analytics and market research ensure strategies are both targeted and impactful. In SMG, disciplined execution, creativity, and ambition are celebrated, providing ample opportunities for career advancement and skill development.
Posting Statement:
All qualified applicants will receive consideration for employment without regard to race, color, religion, religious creed, sex, national origin, ancestry, age, physical or mental disability, medical condition, genetic information, military and veteran status, marital status, pregnancy, gender, gender expression, gender identity, sexual orientation, or any other characteristic protected by local law, regulation, or ordinance.
Position of Trust
N/A
Work Model for this Role
This role will require an on-site presence. * Job posting details (such as work model, location or time type) are subject to change.