LLM Algorithm Engineer

2 Weeks ago • 3 Years +

Job Summary

Job Description

The LLM Algorithm Engineer will be responsible for production deployment and optimization of LLM environments, LLM application development, and designing prompt engineering strategies. They will design distributed deployment solutions, build a multi-modal GPU cluster management system, and lead model fine-tuning using efficient parameter tuning techniques. The engineer will also develop enterprise-grade internal toolchains and design external customer systems. The role involves working in a tech-driven company with headquarters in the heart of the city, offering interesting and challenging projects.
Must have:
  • 3+ years of experience in NLP/LLM projects.
  • Proficiency in PyTorch/TensorFlow frameworks.
  • Familiarity with CUDA programming and NVLink topology.
  • Mastery of development and deployment frameworks.
Good to have:
  • Pre-training or fine-tuning of open-source large models.
  • Development of intelligent customer service systems.
  • Enterprise-level code assistance tools.
  • Construction of knowledge graphs in domains like e-commerce.
Perks:
  • Start with 20 days of annual leave.
  • Monthly lunch allowance.
  • English courses.
  • Onsite gym.

Job Details

Job Summary:

We are looking to add a Large Language Model (LLM) Algorithm Engineer in Changsha, China within our EM Labs team.

It is a great opportunity to work in a tech-driven company. In a relaxed and friendly environment, our headquarters are in the heart of the city, at Runhe Financial Center, full of interesting and challenging projects.

Company Intro:

EveryMatrix is a leading B2B SaaS provider delivering iGaming software, content and services. We provide casino, sports betting, platform and payments, and affiliate management to 200 customers worldwide. The company is profitable, has over EUR 100m in annual revenues, and 1200+ employees in offices across ten countries in Europe, Asia and the US. EveryMatrix was founded in 2008 and remains a founder-owned private company.

What You'll get to do:

  • Production Deployment & Optimization of LLM Environments:
  • LLM Application Development:
  • Design distributed deployment solutions based on NVIDIA hardware architecture (NVLink/NVSwitch). Lead framework selection and performance tuning for vLLM, TensorRT-LLM, SGLang, etc., to achieve high-throughput inference services.
  • Build a multi-modal GPU cluster management system to optimize KV Cache storage and loading strategies, improving service efficiency for long-context scenarios.
  • Model optimization and engineering deployment.
  • Design Prompt Engineering strategies combined with RAG (Retrieval-Augmented Generation) technology to enhance response accuracy in scenarios like intelligent customer service and knowledge-based Q&A. Familiarity with LangChain/AutoGen frameworks is required.
  • Lead model fine-tuning (Fine-tuning) using efficient parameter tuning techniques like LoRA/QLoRA to address long-tail issues in vertical domains. Proficiency in PEFT (Parameter-Efficient Fine-Tuning) methods is essential.
  • Develop enterprise-grade internal toolchains, including code assistance/generation tools, code review systems, and private knowledge-based systems.
  • Design external customer systems, such as smart customer service platforms (integrating speech recognition, ticket management, and compliance auditing). Build multi-Agent collaborative online assistants leveraging multi-Agent task allocation mechanisms.

Requirements:

  • Education & Experience:
  • Technical Proficiency:
  • Preferred Qualifications or Experience with:
  • Master’s degree or above in Computer Science, Artificial Intelligence, or related fields.
  • 3+ years of professional experience in NLP/LLM projects.
  • Proficiency in PyTorch/TensorFlow frameworks, with deep understanding of Transformer architecture and optimization of attention mechanisms.
  • Familiarity with CUDA programming and NVLink topology design. Experience in NVIDIA chip operator development (e.g., CUDA kernel optimization) is a plus.
  • Mastery of development and deployment frameworks such as LangChain, vLLM, and SGLang, with the ability to independently develop and deploy API services.
  • Pre-training or fine-tuning of open-source large models (e.g., LLaMA, DeepSeek).
  • Development of intelligent customer service systems (knowledge of IVR, ACD, and call center technologies required).
  • Enterprise-level code assistance tools (e.g., code generation, code review systems).
  • Construction of knowledge graphs in domains like e-commerce or internet industries.

Here's what we offer:

  • Start with 20 days of annual leave, with 2 additional days added each year, up to 30 days by your fifth year with us. Enjoy an additional 13 public holidays and time off for special events, including parental leave, sick leave, bereavement leave, and marriage leave.

Stay Healthy: 10 sick leave days per year, no doctor's note required.

Support for New Parents:

22 weeks of paid maternity leave, with the flexibility to work from home full-time until your child turns 1 year old.

4 weeks of paternity leave, plus the flexibility to work from home full-time until your child is 13 weeks old.

  • Our office perks include on-site massages, and frequent team-building activities in various locations.

Benefits & Perks:

  • Monthly lunch allowance.
  • English courses.
  • Onsite gym.
  • Access online learning platforms like Udemy for Business and LinkedIn Learning, and a budget for external training.

At EveryMatrix, we're committed to creating a supportive and inclusive workplace where you can thrive both personally and professionally. Come join us and experience the difference!

Similar Jobs

Attentive - Senior Software Engineer, Search Optimization

Attentive

(Remote)
2 Months ago
ByteDance - Software Engineer Intern (Doubao (Seed) - Machine Learning System) - 2025 Summer (PhD)

ByteDance

San Jose, California, United States (On-Site)
6 Months ago
The Walt Disney Company - Lead Applied AI Engineer

The Walt Disney Company

Santa Monica, California, United States (On-Site)
1 Month ago
Google - Manager, gTech Ads Customer Support, Tech CoE

Google

Gurugram, Haryana, India (On-Site)
2 Days ago
Meta - Research Scientist, Computer Vision for Generative AI (PhD)

Meta

New York, New York, United States (On-Site)
5 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

ByteDance - Research Engineer Intern (Doubao (Seed) - Machine Learning System) - 2025 Summer (MS)

ByteDance

San Jose, California, United States (On-Site)
6 Months ago
Zazz - Machine Learning Engineer

Zazz

(Remote)
2 Months ago
ByteDance - Senior Research Engineer / Scientist - AI for Databases

ByteDance

Seattle, Washington, United States (On-Site)
3 Days ago
Scale AI - Director, Agent Research

Scale AI

San Francisco, California, United States (On-Site)
1 Day ago
Salesforce - 2025 PhD Intern - AI Research, Singapore

Salesforce

Singapore, Singapore (On-Site)
6 Months ago
Fortanix - Staff Software Engineer

Fortanix

Bengaluru, Karnataka, India (Hybrid)
3 Months ago
Electronic Arts - Senior Software Engineer

Electronic Arts

Orlando, Florida, United States (On-Site)
4 Weeks ago
Google - ML System Engineer, AICore

Google

Taipei City, Taiwan (On-Site)
2 Days ago
Genies - Machine Learning Engineer: 3D Generative AI

Genies

San Mateo, California, United States (Remote)
6 Months ago
Playrix - Generative AI Engineer

Playrix

Cyprus (Remote)
2 Weeks ago

Get notifed when new similar jobs are uploaded

Jobs in Changsha, Hunan, China

Thatgamecompany - Marketing Manager - Offline Events - China

Thatgamecompany

Shanghai, Shanghai, China (On-Site)
1 Month ago
sony global (Games) - Global HR Platform Consultant

sony global (Games)

Dalian, Liaoning, China (On-Site)
1 Day ago
NVIDIA - Board Design Engineer, LDE

NVIDIA

Shenzhen, Guangdong Province, China (On-Site)
1 Month ago
Qualcomm - Test Technician, Senior

Qualcomm

Shenzhen, Guangdong Province, China (On-Site)
14 Hours ago
Yodo1 - China Publishing BD Manager

Yodo1

China (Remote)
9 Months ago
Paper Stacking games - Public Relations Manager

Paper Stacking games

Shanghai, China (On-Site)
1 Day ago
Ubisoft - Production Manager (Assassin's Creed)

Ubisoft

Chengdu, Sichuan, China (On-Site)
2 Days ago
NVIDIA - Manufacturing Engineer

NVIDIA

Shenzhen, Guangdong Province, China (On-Site)
1 Month ago
Paper Games - Business Management Trainee (Spring 2025)

Paper Games

Shanghai, Shanghai, China (On-Site)
1 Month ago
NVIDIA - Senior ASIC Verification Engineer

NVIDIA

Shanghai, Shanghai, China (Hybrid)
2 Weeks ago

Get notifed when new similar jobs are uploaded

Similar Category Jobs

Looks like we're out of matches

Set up an alert and we'll send you similar jobs the moment they appear!

About The Company

Bucharest, Bucharest, Romania (Hybrid)

Bucharest, Bucharest, Romania (Hybrid)

London, England, United Kingdom (Hybrid)

Lviv, Lviv Oblast, Ukraine (Hybrid)

Lviv, Lviv Oblast, Ukraine (Hybrid)

Sliema, Malta (Hybrid)

Bucharest, Bucharest, Romania (Hybrid)

Changsha, Hunan, China (On-Site)

Sliema, Malta (Hybrid)

London, England, United Kingdom (Hybrid)

View All Jobs

Get notified when new jobs are added by Every matrix

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug