AI Scientist - Palo Alto (Internship, Phd)

1 Month ago • All levels • Research Development

Job Summary

Job Description

Mistral AI is a dynamic, collaborative company focused on democratizing AI through high-performance, optimized, open-source models and a comprehensive AI platform. They are hiring AI Scientists to work with the fine-tuning team on state-of-the-art generative models. This is an on-site, fixed-term internship (3-6 months) for PhD candidates, based in their Bay Area offices. The role involves exploring LLM algorithms, assisting in model design, conducting research, and contributing to LLM system development and optimization.
Must have:
  • Explore state-of-the-art LLM algorithms for fine tuning LLMs, with the supervision of top level scientists
  • Assist in the design and implementation of machine learning models and algorithms
  • Conduct research on the latest advancements in natural language processing and LLMs
  • Contribute to the development and optimization of our LLM systems
  • Collaborate with cross-functional teams to integrate LLM technologies into various applications
  • Perform data analysis and visualization to support research and development efforts
  • Document research findings and contribute to technical reports and publications
  • Participate in team meetings and brainstorming sessions to share ideas and insights
  • Currently doing a Phd from tier 1 engineering schools / Universities
  • High scientific understanding of the field of generative AI
  • Broad knowledge of the field of AI, and specific knowledge or interest in fine-tuning and using language models for applications
  • Strong programming skills in Python, with experience in libraries such as TensorFlow, PyTorch, or similar
  • Familiarity with natural language processing techniques and machine learning algorithms
  • Design complex software and make them usable in production
  • Navigate the full MLOps technical stack, with a focus on architecture development and model evaluation and usage
  • Previous experience with LLMs or related technologies
  • Knowledge of deep learning frameworks and techniques
  • Experience with version control systems (e.g., Git) and linux shell environment
Good to have:
  • Experience in fine tuning LLMs
  • Used complex HPC infrastructure with full autonomy

Job Details

About Mistral

At Mistral AI, we believe in the power of AI to simplify tasks, save time, and enhance learning and creativity. Our technology is designed to integrate seamlessly into daily working life.

We democratize AI through high-performance, optimized, open-source and cutting-edge models, products and solutions. Our comprehensive AI platform is designed to meet enterprise needs, whether on-premises or in cloud environments. Our offerings include le Chat, the AI assistant for life and work.

We are a dynamic, collaborative team passionate about AI and its potential to transform society. Our diverse workforce thrives in competitive environments and is committed to driving innovation. Our teams are distributed between France, USA, UK, Germany and Singapore. We are creative, low-ego and team-spirited.

Join us to be part of a pioneering company shaping the future of AI. Together, we can make a meaningful impact. See more about our culture on https://mistral.ai/careers.

Mistral AI are hiring experts in the role of pre-training and fine-tuning large language models.

Role Summary

  • You will be working with the fine tuning team on making state-of-the-art generative models.
  • You will run autonomous work streams under the supervision of experienced scientists.
  • Internship duration : 3 to 6 months. We will only consider candidates looking for end of studies internships (Phd)

What you will do

  • Explore state-of-the-art LLM algorithms for fine tuning LLMs, with the supervision of top level scientists.
  • Assist in the design and implementation of machine learning models and algorithms.
  • Conduct research on the latest advancements in natural language processing and LLMs.
  • Contribute to the development and optimization of our LLM systems.
  • Collaborate with cross-functional teams to integrate LLM technologies into various applications.
  • Perform data analysis and visualization to support research and development efforts.
  • Document research findings and contribute to technical reports and publications.
  • Participate in team meetings and brainstorming sessions to share ideas and insights

About you

  • Currently doing a Phd from tier 1 engineering schools / Universities.
  • High scientific understanding of the field of generative AI.
  • Broad knowledge of the field of AI, and specific knowledge or interest in fine-tuning and using language models for applications.
  • Strong programming skills in Python, with experience in libraries such as TensorFlow, PyTorch, or similar.
  • Familiarity with natural language processing techniques and machine learning algorithms.
  • Design complex software and make them usable in production.
  • Navigate the full MLOps technical stack, with a focus on architecture development and model evaluation and usage.
  • Previous experience with LLMs or related technologies.
  • Knowledge of deep learning frameworks and techniques.
  • Experience with version control systems (e.g., Git) and linux shell environment.

Now, it would be ideal if you :

  • Have experience in fine tuning LLMs.
  • Have used complex HPC infrastructure with full autonomy.

Similar Jobs

Cognite - Senior Content Designer

Cognite

Phoenix, Arizona, United States (Hybrid)
1 Month ago
Progress - Lead Product Manager II- Vertical Solutions

Progress

Raleigh, North Carolina, United States (Hybrid)
3 Months ago
Abridge - Senior Implementation Manager, Enterprise/Commercial

Abridge

San Francisco, California, United States (Hybrid)
1 Month ago
Mindtickle - Senior Product Marketing Manager

Mindtickle

Bengaluru, Karnataka, India (Hybrid)
3 Months ago
Tencent - NLP Research Intern

Tencent

London, England, United Kingdom (On-Site)
9 Months ago
NXP - AI/ML driven ASIC Design and Implementation Automation Expert

NXP

San Diego, California, United States (On-Site)
2 Months ago
GT HQ - AI/ML Engineer

GT HQ

(Remote)
4 Months ago
flip fit - Senior Machine Learning Engineer

flip fit

(Remote)
4 Months ago
EMA - Front End Engineer - Agentic AI Experiences

EMA

Bengaluru, Karnataka, India (Hybrid)
1 Month ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Vidsy - Senior AI Engineer

Vidsy

London, England, United Kingdom (Hybrid)
1 Month ago
Shield AI - Senior Staff Engineer, C++ Modeling & Simulation Engineer (R3453)

Shield AI

Washington, District Of Columbia, United States (On-Site)
4 Weeks ago
Sprinkler - Lead Project Manager - CCaaS

Sprinkler

Gurugram, Haryana, India (On-Site)
3 Months ago
Bragg - Marketing Manager

Bragg

Ljubljana, Ljubljana, Slovenia (Hybrid)
1 Month ago
fireaxis - Head of Product

fireaxis

Sparks Glencoe, Maryland, United States (On-Site)
1 Month ago
Harvey - Strategic Business Development Lead, APAC

Harvey

Sydney, New South Wales, Australia (Hybrid)
1 Month ago
Condé Nast - Vice President, US Sales & Global Key Accounts, Fashion & Luxury and Vogue Brand Sales

Condé Nast

New York, United States (On-Site)
1 Month ago
EvenUp - Senior Machine Learning Engineer

EvenUp

San Francisco, California, United States (Hybrid)
4 Weeks ago
FlockSafety - Regional Sales Director - Southern California/Arizona/Rockies

FlockSafety

United States (Remote)
2 Months ago
Fortra - Marketing Campaign & Partner Lead

Fortra

United States (Remote)
3 Weeks ago

Get notifed when new similar jobs are uploaded

Jobs in Palo Alto, California, United States

deel. - SEC Analyst

deel.

United States (Remote)
4 Weeks ago
Toast - Engineering Manager - Services Infrastructure

Toast

United States (Remote)
1 Month ago
Rippling - Account Manager, SMB

Rippling

Austin, Texas, United States (Hybrid)
3 Months ago
Dynamis Inc - Technical Writer/Project Manager

Dynamis Inc

Philadelphia, Pennsylvania, United States (On-Site)
2 Months ago
InMobiInMobi - Senior Publisher Partnerships Manager – Enterprise Non-Gaming

InMobiInMobi

New York, United States (On-Site)
3 Weeks ago
Shield AI - Senior Engineering Manager, Hardware Test (R3679)

Shield AI

Dallas, Texas, United States (On-Site)
4 Weeks ago
AGBO - Assistant Game Producer

AGBO

Los Angeles, California, United States (On-Site)
3 Months ago
Proscia - Implementation Engineer

Proscia

Philadelphia, Pennsylvania, United States (On-Site)
9 Months ago
Global Business Travel - Data Advanced Insight Analyst

Global Business Travel

Chicago, Illinois, United States (Hybrid)
2 Months ago
Globalization Partners - Senior UX/UI Designer

Globalization Partners

United States (Remote)
3 Months ago

Get notifed when new similar jobs are uploaded

Research Development Jobs

bytedance - Research Scientist, Infrastructure System Lab

bytedance

San Jose, California, United States (On-Site)
4 Months ago
GoMotive - Engineering Manager, AI Reliability

GoMotive

Pakistan (Remote)
1 Month ago
Ansys - Lead R&D Software Engineer - C++/Python

Ansys

Chalandri, Greece (On-Site)
2 Months ago
bytedance - Senior Research Scientist, Infrastructure System Lab

bytedance

Seattle, Washington, United States (On-Site)
4 Months ago
Reltio - Staff AI Engineer

Reltio

Bengaluru, Karnataka, India (Hybrid)
1 Month ago
bytedance - Research Scientist in Foundation Model, Music Core Machine Learning Graduates - 2024 Start (PhD)

bytedance

San Jose, California, United States (On-Site)
9 Months ago
bytedance - Research Scientist Intern (Doubao (Seed) - Foundation Model, Speech Understanding) - 2024 Summer (PhD)

bytedance

San Jose, California, United States (On-Site)
9 Months ago
Apple - Neuro R&D Engineer

Apple

Cupertino, California, United States (On-Site)
1 Month ago
Scanline VFX - Research Scientist

Scanline VFX

Los Angeles, California, United States (Hybrid)
9 Months ago
Ubisoft - Senior R&D Engineer

Ubisoft

Pune, Maharashtra, India (Hybrid)
2 Months ago

Get notifed when new similar jobs are uploaded

About The Company

Paris, Île-de-France, France (Hybrid)

Paris, Île-de-France, France (On-Site)

Paris, Île-de-France, France (On-Site)

Amsterdam, North Holland, Netherlands (On-Site)

Palo Alto, California, United States (On-Site)

Paris, Île-de-France, France (Hybrid)

Paris, Île-de-France, France (On-Site)

New York, United States (Hybrid)

Palo Alto, California, United States (On-Site)

Paris, Île-de-France, France (On-Site)

View All Jobs

Get notified when new jobs are added by Mistral AI

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug