AI Frameworks Software Engineer – Model Compression Algorithm

Intel

| Shanghai, China (On Site) | Full Time | 1 day ago

Apply Now

Job Summary

The Intel Neural Compressor team is seeking a highly motivated software engineer to develop the Intel Neural Compressor product and related tools for Intel AI platforms (CPU, GPU, AI Accelerator). The role involves researching and implementing quantization and compression techniques for large language models (LLMs) and text-to-image/video generation models, as well as exploring cutting-edge directions in efficient model deployment and inference/finetuning acceleration.

Must Have

Major in computer science or related subject
Solid understanding of deep learning, deep learning framework and large language model (LLM) fundamentals
Familiarity with model compression techniques such as quantization and pruning
Proficiency in Python or other programming languages commonly used for deep learning development
Passion to work as a professional software engineer and team player
Strong self-motivation and problem-solving skill
Good English oral and written skill

Good to Have

Passion for technological innovation and practical engineering, with a drive for continuous exploration and improvement
Experience in model fine-tuning, inference optimization or related tool development

Job Description

Job Details:

Job Description:

Intel Neural Compressor team is looking for a highly motivated talend to join us

Responsibilities includes:

• Develope Intel Neural Compressor product and related tools to support Intel AI platform, including CPU, GPU and AI Accelerator

• Research and implement quantization and compression techniques for large language models (LLMs) and text-to-image/video generation models

• Explore cutting-edge directions in efficient model deployment and inference/finetuing acceleration.

Qualifications:

• Major in computer science or related subject

• Solid understanding of deep learning, deep learning framework and large language model (LLLM) fundamentals

• Familiarity with model compression techniques such as quantization and pruning

• Proficiency in Python or other programming languages commonly used for deep learning development

• Passion to work as a professional software engineer and team player

• Strong self-motivation and problem-solving skill

• Good English oral and written skill

Preferred Qualifications

• Passion for technological innovation and practical engineering, with a drive for continuous exploration and improvement

• Experience in model fine-tuning, inference optimization or related tool development is a plus

Job Type:

College Grad

Shift:

Shift 1 (China)

Additional Locations:

Business group:

The Sales and Marketing Group (SMG) leverages the product portfolio to drive Intel's revenue growth and market expansion, blending strategic initiatives with dynamic sales efforts to capture and retain customers. SMG is responsible for empowering the sales force with tools and insights needed to close deals and build lasting customer relationships. Sales analytics and market research ensure strategies are both targeted and impactful. In SMG, disciplined execution, creativity, and ambition are celebrated, providing ample opportunities for career advancement and skill development.

Posting Statement:

All qualified applicants will receive consideration for employment without regard to race, color, religion, religious creed, sex, national origin, ancestry, age, physical or mental disability, medical condition, genetic information, military and veteran status, marital status, pregnancy, gender, gender expression, gender identity, sexual orientation, or any other characteristic protected by local law, regulation, or ordinance.

Position of Trust

N/A

Work Model for this Role

This role will require an on-site presence. * Job posting details (such as work model, location or time type) are subject to change.

7 Skills Required For This Role

Revenue Growth Team Player Game Texts Market Research Model Deployment Deep Learning Python

Similar Jobs