Research Engineer, Machine Learning

1 Year ago • 4 Years +
Research Development

Job Description

Mistral AI is seeking a Research Engineer specializing in Machine Learning to build and optimize large-scale learning systems for their open-weight models. Working closely with Research Scientists, you will contribute to either the Platform RE Team, enhancing shared training frameworks, data pipelines, and cluster tooling, or the Embedded RE Team, integrating novel ideas into repeatable, scalable code within research squads. Responsibilities include accelerating researchers by managing large-scale ML pipelines, interfacing research with production by integrating checkpoints and streamlining evaluations, conducting experiments on cutting-edge deep learning techniques, designing and implementing ML algorithms, and delivering production-grade prototypes for products like 'Le Chat' and enterprise APIs. The role requires strong software design instincts, proficiency in Python, and experience with distributed training frameworks.
Good To Have:
  • CUDA experience
  • Data-pipeline experience
Must Have:
  • Master's or PhD in Computer Science or equivalent
  • 4+ years on large-scale ML codebases
  • Hands-on with PyTorch, JAX, or TensorFlow
  • Comfortable with distributed training
  • Experience in deep learning, NLP, or LLMs
  • Strong software-design instincts
  • Self-starter, low-ego, collaborative
Perks:
  • Competitive cash salary and equity
  • Daily lunch vouchers (France)
  • Monthly contribution to Gympass subscription (France)
  • Monthly contribution to a mobility pass (France)
  • Full health insurance for you and your family (France)
  • Generous parental leave policy (France)
  • Visa sponsorship (France)
  • Insurance (UK)
  • Reimbursement for office parking or public transport (UK)
  • Reimbursement for gym membership (UK)
  • Monthly meal allowance (UK)
  • Pension plan (UK)

Add these skills to join the top 1% applicants for this job

cuda
prototyping
pytorch
deep-learning
ci-cd
python
algorithms
tensorflow
machine-learning

About Mistral

At Mistral AI, we believe in the power of AI to simplify tasks, save time, and enhance learning and creativity. Our technology is designed to integrate seamlessly into daily working life.

We democratize AI through high-performance, optimised, open-source and cutting-edge models, products and solutions. Our comprehensive AI platform is designed to meet the needs of enterprises, whether on-premises or in cloud environments. Our offerings include Le Chat, the AI assistant for life and work.

We are a dynamic, collaborative team passionate about AI and its potential to transform society. Our diverse workforce thrives in competitive environments and is committed to driving innovation. Our teams are distributed between France, the USA, the UK, Germany and Singapore. We are creative, low-ego and team-spirited. Join us to be part of a pioneering company shaping the future of AI. Together, we can make a meaningful impact. See more about our culture on https://mistral.ai/careers.

Role Summary 

About the Research Engineering team
The team spans Platform (shared infra & clean code) and Embedded (inside research squads). Engineers can move along the research↔production spectrum as needs or interests evolve.

As a Research Engineer – ML track, you’ll build and optimise the large-scale learning systems that power our open-weight models. Working hand-in-hand with Research Scientists, you’ll either join:

-Platform RE Team: Enhance the shared training framework, data pipelines and cluster tooling used by every team; or
-Embedded RE Team: Sit inside a research squad (Alignment, Pre-training, Multimodal, …) and turn fresh ideas into repeatable, scalable code.

Location: Paris / London (hybrid) or remote EU/UK with one hub day per month.

What will you do

-Accelerate researchers by taking on the heavy parts of large-scale ML pipelines and building robust tools.
-Interface cutting-edge research with production: integrate checkpoints, streamline evaluation, and expose APIs.
-Conduct experiments on the latest deep-learning techniques (sparsified 70 B + runs, distributed training on thousands of GPUs).
-Design, implement and benchmark ML algorithms; write clear, efficient code in Python.
-Deliver prototypes that become production-grade components for Le Chat and our enterprise API.

About you

-Master’s or PhD in Computer Science (or equivalent proven track record).
-4 + years working on large-scale ML codebases.
-Hands-on with PyTorch, JAX or TensorFlow; comfortable with distributed training (DeepSpeed / FSDP / SLURM / K8s).
-Experience in deep learning, NLP or LLMs; bonus for CUDA or data-pipeline chops.
-Strong software-design instincts: testing, code review, CI/CD.
-Self-starter, low-ego, collaborative.

Benefits

France
💰 Competitive cash salary and equity
🥕 Food: Daily lunch vouchers
🥎 Sport: Monthly contribution to a Gympass subscription
🚴 Transportation: Monthly contribution to a mobility pass
🧑‍⚕️ Health: Full health insurance for you and your family
🍼 Parental: Generous parental leave policy
🌎 Visa sponsorship

UK
💰 Competitive cash salary and equity
🚑 Insurance
🚴 Transportation: Reimburse office parking charges, or  £90 per month for public transport
🥎 Sport:  £90 per month reimbursement for gym membership
🥕 Meal voucher: £200 monthly allowance for meals
💰 Pension plan: SmartPension (percentages are 5% Employee & 3% Employer)

Set alerts for more jobs like Research Engineer, Machine Learning
Set alerts for new jobs by Mistral AI
Set alerts for new Research Development jobs in France
Set alerts for new jobs in France
Set alerts for Research Development (Remote) jobs

Contact Us
hello@outscal.com
Made in INDIA 💛💙