Machine Learning Engineer, Open-Source Software
il y a 2 semaines
About Mistral
At Mistral AI, we believe in the power of AI to simplify tasks, save time, and enhance learning and creativity. Our technology is designed to integrate seamlessly into daily working life.
We democratize AI through high-performance, optimized, open-source and cutting-edge models, products and solutions. Our comprehensive AI platform is designed to meet enterprise needs, whether on-premises or in cloud environments. Our offerings include le Chat, the AI assistant for life and work.
We are a dynamic, collaborative team passionate about AI and its potential to transform society.
Our diverse workforce thrives in competitive environments and is committed to driving innovation. Our teams are distributed between France, USA, UK, Germany and Singapore. We are creative, low-ego and team-spirited.
Join us to be part of a pioneering company shaping the future of AI. Together, we can make a meaningful impact. See more about our culture on
Role Summary
You will be in charge of open-sourcing state-of-the-art models, whilst maintaining and improving Mistral's publicly available libraries. Your work is critical in helping turn research breakthroughs into tangible solutions and improve Mistral's open-source ecosystem.
About the Open Source Software team
Our OSS team is embedded in our Science team and works very closely with various engineering and marketing teams. All OSS team members can fluidly move on the production / research spectrum depending on where the needs are or where their interests lie
What you will do
• Releasing our models to open-source platforms and libraries, e.g., vLLM, GitHub, Hugging Face
• Maintaining Mistral's open-source libraries (mistral-common, mistral-finetune, mistral-inference)
• Create and maintain tooling and services: both internal facing (internal research) and external facing (open-source libraries)
• Implement and optimize open-source and internal libraries for performance and accuracy, ensuring production readiness and employing cutting-edge technology and innovative approaches
• Collaborate with the open-source community (PyTorch, vLLM, Hugging Face)
About you
• Master's degree in Computer Science, Machine Learning, Data Science, or a related field
• Experience contributing to popular open-source libraries such as PyTorch, Tensorflow, JAX, vLLM, Transformers, , ...
• Passion for contributing to the open-source software ecosystem
• Expert programming skills in Python, PyTorch, MLOps
• Adaptable, proactive, and autonomous
• Attention to detail and a drive to go the last mile to build almost perfect tools
• Deep understanding of machine learning approaches, especially LLMs and algorithms
• Low-ego, collaborative and have a real team player mindset
Now, it would be ideal if you have:
• Experience with training and fine-tuning large language models (e.g., distillation, supervised fine-tuning, policy optimization)
• Experience working with Slurm
• Worked with research teams before
• Experience as a core-maintainer of a popular ML open-source library
Location & Remote
This role is primarily based at one of our European offices (Paris, France and London, UK). We will prioritize candidates who either reside there or are open to relocating. We strongly believe in the value of in-person collaboration to foster strong relationships and seamless communication within our team.
In certain specific situations, we will also consider remote candidates based in one of the countries listed in this job posting — currently France & UK. In that case, we ask all new hires to visit our local office:
• for the first week of their onboarding (accommodation and travelling covered)
• then at least 3 days per month
What we offer
Competitive salary and equity
Health insurance
Transportation allowance
Sport allowance
Meal vouchers
Private pension plan
Parental : Generous parental leave policy
Visa sponsorship
-
Machine Learning Engineer
il y a 4 jours
Paris, Île-de-France URBAN LINKER Temps pleinMachine Learning Engineer – Scale-up Tech / Énergie – Paris (Hybrid)Paris, 3 jours sur site minimum| CDI| Rémunération : 50–80K€ selon profil À propos :Nous accompagnons unestart-up française en forte croissance, positionnée sur des sujets à fort impact autour de ladataet dumachine learning appliqué à l'énergie et à la performance...
-
Machine Learning Image Engineer
il y a 4 jours
Paris, Île-de-France Blackfluo Temps pleinJob Description:Location: Paris, 3 to 4 days of remoteStart date: To be definedLanguages: English is mandatoryWe are seeking a highly skilled AI Engineer with deep expertise in Image Machine Learning to join our innovative technology team.The ideal candidate will combine advanced technical skills in AI/ML with robust developer capabilities.Key...
-
Machine Learning Engineer
il y a 2 semaines
Paris, Île-de-France Sia Temps pleinSia Partners réinvente le métier du conseil et apporte un regard innovant et des résultats concrets à ses clients. Nous avons développé des solutions basées sur l'Intelligence Artificielle et le design pour augmenter l'impact de nos missions de conseil. Notre présence globale et notre expertise dans plus de 30 secteurs et services nous permettent...
-
Machine Learning Engineer
il y a 2 semaines
Paris, Île-de-France Sia Temps pleinDescription de l'entreprise Sia Partners réinvente le métier du conseil et apporte un regard innovant et des résultats concrets à ses clients. Nous avons développé des solutions basées sur l'Intelligence Artificielle et le design pour augmenter l'impact de nos missions de conseil. Notre présence globale et notre expertise dans plus de 30 secteurs et...
-
Machine Learning Engineer
il y a 2 semaines
Paris, Île-de-France Sia Temps pleinSia est un groupe international de conseil en management de nouvelle génération, fondé en 1999. Nés à l'ère du digital, nous sommes augmentés par la data, enrichis par la créativité et guidés par la responsabilité. Nous collaborons avec nos clients pour relever les défis et saisir les opportunités. Dans un monde en pleine mutation, nous croyons...
-
Machine Learning Engineer
il y a 2 semaines
Paris, Île-de-France Sia Temps pleinDescription de l'entreprise Sia est un groupe international de conseil en management de nouvelle génération, fondé en 1999. Nés à l'ère du digital, nous sommes augmentés par la data, enrichis par la créativité et guidés par la responsabilité. Nous collaborons avec nos clients pour relever les défis et saisir les opportunités. Dans un monde en...
-
Machine Learning Engineer
il y a 4 jours
Paris, Île-de-France Gorgias Temps pleinWe believe conversations will become the #1 way to shop.At Gorgias, we're building the platform that makes this real: a unified AI agent that sells, supports, and re-engages customers across the entire journey. Conversational Commerce is the future of ecommerce, and we're leading that shift.Our mission is to turn every interaction between a brand and its...
-
Machine Learning Engineer
il y a 4 jours
Paris, Île-de-France Gorgias Temps pleinWe believe conversations will become the #1 way to shop.At Gorgias, we're building the platform that makes this real: a unified AI agent that sells, supports, and re-engages customers across the entire journey. Conversational Commerce is the future of ecommerce, and we're leading that shift.Our mission is to turn every interaction between a brand and its...
-
Senior Machine Learning Engineer
il y a 4 jours
Paris, Île-de-France Raidium Temps pleinRaidium develops a radiological foundation model as the "GPT" of radiology (manifesto). This new generation of AI will enable the building of an imaging biomarker factory for both clinical practice and research, tackling the complexity of precision medicine. As a Senior Machine Learning Engineer at Raidium, you will be a driving force in translating our...
-
Senior Machine Learning Engineer, ASR
il y a 2 jours
Paris, Île-de-France SoundHound AI Temps pleinYour Career, our Future—Together.Ready to join something big? At SoundHound AI, we bring voice, generative, and conversational AI together to transform how people interact with products and services. From voice-enabled vehicles to food ordering and customer support, our multilingual, omnichannel technology already impacts hundreds of millions worldwide.The...