Machine Learning Engineer, Open-Source Software

il y a 2 semaines


Paris, Île-de-France Mistral Ai Temps plein

About Mistral 

At Mistral AI, we believe in the power of AI to simplify tasks, save time, and enhance learning and creativity. Our technology is designed to integrate seamlessly into daily working life.

We democratize AI through high-performance, optimized, open-source and cutting-edge models, products and solutions. Our comprehensive AI platform is designed to meet enterprise needs, whether on-premises or in cloud environments. Our offerings include le Chat, the AI assistant for life and work.

We are a dynamic, collaborative team passionate about AI and its potential to transform society.

Our diverse workforce thrives in competitive environments and is committed to driving innovation. Our teams are distributed between France, USA, UK, Germany and Singapore. We are creative, low-ego and team-spirited.

Join us to be part of a pioneering company shaping the future of AI. Together, we can make a meaningful impact. See more about our culture on 

Role Summary

You will be in charge of open-sourcing state-of-the-art models, whilst maintaining and improving Mistral's publicly available libraries. Your work is critical in helping turn research breakthroughs into tangible solutions and improve Mistral's open-source ecosystem.

About the Open Source Software team

Our OSS team is embedded in our Science team and works very closely with various engineering and marketing teams. All OSS team members can fluidly move on the production / research spectrum depending on where the needs are or where their interests lie

What you will do


Releasing our models to open-source platforms and libraries, e.g., vLLM, GitHub, Hugging Face


Maintaining Mistral's open-source libraries (mistral-common, mistral-finetune, mistral-inference)


Create and maintain tooling and services: both internal facing (internal research) and external facing (open-source libraries)


Implement and optimize open-source and internal libraries for performance and accuracy, ensuring production readiness and employing cutting-edge technology and innovative approaches


Collaborate with the open-source community (PyTorch, vLLM, Hugging Face)

About you


Master's degree in Computer Science, Machine Learning, Data Science, or a related field


Experience contributing to popular open-source libraries such as PyTorch, Tensorflow, JAX, vLLM, Transformers, , ...


Passion for contributing to the open-source software ecosystem


Expert programming skills in Python, PyTorch, MLOps


Adaptable, proactive, and autonomous


Attention to detail and a drive to go the last mile to build almost perfect tools


Deep understanding of machine learning approaches, especially LLMs and algorithms


Low-ego, collaborative and have a real team player mindset

Now, it would be ideal if you have:


Experience with training and fine-tuning large language models (e.g., distillation, supervised fine-tuning, policy optimization)


Experience working with Slurm


Worked with research teams before


Experience as a core-maintainer of a popular ML open-source library

Location & Remote

This role is primarily based at one of our European offices (Paris, France and London, UK). We will prioritize candidates who either reside there or are open to relocating. We strongly believe in the value of in-person collaboration to foster strong relationships and seamless communication within our team.

In certain specific situations, we will also consider remote candidates based in one of the countries listed in this job posting — currently France & UK. In that case, we ask all new hires to visit our local office:


•  for the first week of their onboarding (accommodation and travelling covered)


•  then at least 3 days per month

What we offer

Competitive salary and equity

Health insurance

Transportation allowance

Sport allowance

Meal vouchers

Private pension plan

Parental : Generous parental leave policy

Visa sponsorship


  • Machine Learning Engineer

    il y a 4 jours


    Paris, Île-de-France URBAN LINKER Temps plein

    Machine Learning Engineer – Scale-up Tech / Énergie – Paris (Hybrid)Paris, 3 jours sur site minimum| CDI| Rémunération : 50–80K€ selon profil À propos :Nous accompagnons unestart-up française en forte croissance, positionnée sur des sujets à fort impact autour de ladataet dumachine learning appliqué à l'énergie et à la performance...


  • Paris, Île-de-France Blackfluo Temps plein

    Job Description:Location: Paris, 3 to 4 days of remoteStart date: To be definedLanguages: English is mandatoryWe are seeking a highly skilled AI Engineer with deep expertise in Image Machine Learning to join our innovative technology team.The ideal candidate will combine advanced technical skills in AI/ML with robust developer capabilities.Key...

  • Machine Learning Engineer

    il y a 2 semaines


    Paris, Île-de-France Sia Temps plein

    Sia Partners réinvente le métier du conseil et apporte un regard innovant et des résultats concrets à ses clients. Nous avons développé des solutions basées sur l'Intelligence Artificielle et le design pour augmenter l'impact de nos missions de conseil. Notre présence globale et notre expertise dans plus de 30 secteurs et services nous permettent...

  • Machine Learning Engineer

    il y a 2 semaines


    Paris, Île-de-France Sia Temps plein

    Description de l'entreprise Sia Partners réinvente le métier du conseil et apporte un regard innovant et des résultats concrets à ses clients. Nous avons développé des solutions basées sur l'Intelligence Artificielle et le design pour augmenter l'impact de nos missions de conseil. Notre présence globale et notre expertise dans plus de 30 secteurs et...

  • Machine Learning Engineer

    il y a 2 semaines


    Paris, Île-de-France Sia Temps plein

    Sia est un groupe international de conseil en management de nouvelle génération, fondé en 1999. Nés à l'ère du digital, nous sommes augmentés par la data, enrichis par la créativité et guidés par la responsabilité. Nous collaborons avec nos clients pour relever les défis et saisir les opportunités. Dans un monde en pleine mutation, nous croyons...

  • Machine Learning Engineer

    il y a 2 semaines


    Paris, Île-de-France Sia Temps plein

    Description de l'entreprise Sia est un groupe international de conseil en management de nouvelle génération, fondé en 1999. Nés à l'ère du digital, nous sommes augmentés par la data, enrichis par la créativité et guidés par la responsabilité. Nous collaborons avec nos clients pour relever les défis et saisir les opportunités. Dans un monde en...

  • Machine Learning Engineer

    il y a 4 jours


    Paris, Île-de-France Gorgias Temps plein

    We believe conversations will become the #1 way to shop.At Gorgias, we're building the platform that makes this real: a unified AI agent that sells, supports, and re-engages customers across the entire journey. Conversational Commerce is the future of ecommerce, and we're leading that shift.Our mission is to turn every interaction between a brand and its...

  • Machine Learning Engineer

    il y a 4 jours


    Paris, Île-de-France Gorgias Temps plein

    We believe conversations will become the #1 way to shop.At Gorgias, we're building the platform that makes this real: a unified AI agent that sells, supports, and re-engages customers across the entire journey. Conversational Commerce is the future of ecommerce, and we're leading that shift.Our mission is to turn every interaction between a brand and its...


  • Paris, Île-de-France Raidium Temps plein

    Raidium develops a radiological foundation model as the "GPT" of radiology (manifesto). This new generation of AI will enable the building of an imaging biomarker factory for both clinical practice and research, tackling the complexity of precision medicine. As a Senior Machine Learning Engineer at Raidium, you will be a driving force in translating our...


  • Paris, Île-de-France SoundHound AI Temps plein

    Your Career, our Future—Together.Ready to join something big? At SoundHound AI, we bring voice, generative, and conversational AI together to transform how people interact with products and services. From voice-enabled vehicles to food ordering and customer support, our multilingual, omnichannel technology already impacts hundreds of millions worldwide.The...