Lead Software Engineer, Runtime

il y a 2 semaines


Paris, Île-de-France Mistral Ai Temps plein

About Mistral 

At Mistral AI, we believe in the power of AI to simplify tasks, save time and enhance learning and creativity. Our technology is designed to integrate seamlessly into daily working life.

We democratize AI through high-performance, optimized, open-source and cutting-edge models, products and solutions. Our comprehensive AI platform is designed to meet enterprise needs, whether on-premises or in cloud environments. Our offerings include le Chat, the AI assistant for life and work.

We are a dynamic, collaborative team passionate about AI and its potential to transform society.

Our diverse workforce thrives in competitive environments and is committed to driving innovation. Our teams are distributed between France, USA, UK, Germany and Singapore. We are creative, low-ego and team-spirited.

Join us to be part of a pioneering company shaping the future of AI. Together, we can make a meaningful impact. See more about our culture on 

Role summary

As the Technical Lead for the Inference team, you will drive the architecture and optimization of our inference backbone, ensuring high performance, scalability, and efficiency in a dynamic environment. You will lead the acquisition and automation of benchmarks, collaborate with cross-functional teams, and innovate solutions to enhance our AI-powered applications.

What you will do


• Architect and optimize the inference for high-volume, low-latency, and high-availability environments.


• Lead the acquisition and automation of benchmarks at both micro and macro scales.


• Introduce new techniques and tools to improve performance, latency, throughput, and efficiency in our model inference stack.


• Build tools to identify bottlenecks and sources of instability, and design solutions to address them.


• Collaborate with machine learning researchers, engineers, and product managers to bring cutting-edge technologies into production.


• Optimize code and infrastructure to maximize hardware utilization and efficiency.


• Mentor and guide team members, fostering a culture of collaboration, innovation, and continuous learning.

About you


• Extensive experience in C++ and Python, with a strong focus on backend development and performance optimization.


• Deep understanding of modern ML architectures and experience with performance optimization for inference.


• Proven track record with large-scale distributed systems, particularly performance-critical ones.


• Familiarity with PyTorch, TensorRT, CUDA, NCCL.


• Strong grasp of infrastructure, continuous integration, and continuous development principles.


• Ability to lead and mentor team members, driving projects from concept to implementation.


• Results-oriented mindset with a bias towards flexibility and impact.


• Passion for staying ahead of emerging technologies and applying them to Al-driven solutions.


• Humble attitude, eagerness to help colleagues, and a desire to see the team succeed.

Our Culture

We're driven to build a strong company culture and are looking for individuals with solid alignment with the following:


• Reason with rigor


• Are you audacious enough?


• Make our customers succeed


• Ship early and accelerate


• Leave your ego aside 

Location & Remote

This role is based in one of our European offices (Paris, France and London, UK). We will only consider candidates who either reside or are open to relocating there. We strongly believe in the value of in-person collaboration and we encourage going to the office as much as we can (at least 3 days per week) to create bonds and smooth communication. Our remote policy aims to provide flexibility, improve work-life balance and increase productivity.

What we offer

Competitive salary and equity (stock-options)

Health insurance

Transportation allowance

Sport allowance

Meal vouchers

Private pension plan

Generous parental leave policy

Visa sponsorship


  • Lead Software Engineer

    il y a 7 jours


    Paris, Île-de-France Dataworks Temps plein

    Lead Software Engineer – Marketing/AdTechParis (Can be remote 50% of the time)€65–80K + BSPCETech Stack:Python, Javascript, React, AWSWe're partnering with a fast-growing global consumer platform that builds, acquires, and scales digital brands across e-commerce and retail. Their mission is to turn high-potential consumer brands into global leaders,...

  • Lead software engineer

    il y a 2 semaines


    Paris, Île-de-France Direction Générale de la Sécurité Extérieure Temps plein

    La Direction Générale de la Sécurité Extérieure, DGSE, recrute un lead software engineer - Systèmes distribués – Plateforme de données (H/F).Le poste est situé à Paris. La nationalité française est obligatoire.Domaine métierSciences et TechnologiesVotre environnement de travailLes flux de données traités par la DGSE sont massifs et...

  • Lead Software Engineer

    il y a 2 jours


    Paris, Île-de-France ILLUIN Technology Temps plein

    ILLUIN est à la pointe de la technologie sur des sujets d'Intelligence Artificielle et de Software Engineering, menant des projets clients sur mesure ainsi que ses propres produits innovants (agents conversationnels, parsing de documents, traitement de la voix.). En tant que Lead Software Engineer, votre rôle sera de guider techniquement l'équipe, de...


  • Paris, Île-de-France Siemens Digital Industries Software Temps plein

    Siemens Digital Industries Software - Where today meets tomorrow.Let's make the difference togetherMeet the team - VideoSiemens Digital Industries (DI) is an innovation leader in automation and digitalization. Closely, collaborating with partners and customers, we care about the digital transformation in the process and discrete industries. With our Digital...

  • Lead AI Software Engineer

    il y a 2 semaines


    Paris, Île-de-France DeepIP Temps plein

    Tech / EngineeringParisHybridLead AI Software Engineer*About DeepIPAt DeepIP, our vision is tobuild the AI operating system for IP practitioners*. Intellectual Property is not just legal paperwork—it's a company's strategic DNA. Yet today's patent professionals are drowning in inefficient processes, outdated tools, and mountains of prior art.We are...

  • Lead Software Engineer

    il y a 1 semaine


    Paris, Île-de-France Kleio Temps plein

    Kleio's Conversational AI is transforming sales and marketing, enabling AI-driven conversations that qualify, engage, and convert high-intent buyers in real time.Our mission is to redefine how B2C and B2B businesses connect with customers, making every interaction intelligent, personal, and effortless. Our multi-agent AI platform automates lead profiling,...

  • Software Engineer

    il y a 2 semaines


    Paris, Île-de-France Roland Berger Temps plein

    Company Description Founded in 1990, Roland Berger Paris is one of the leading consulting firms in France, and has more than 300 employees.The Paris office is recognized as a reference by the largest industrial and service groups, and covers multiple business sectors (Aerospace & Defense, Automotive, Consumer Goods, Energy & Environment, Industry, Private...


  • Paris, Île-de-France DeepIP Temps plein

    Tech / EngineeringParisHybridLead Fullstack Software EngineerAbout UsAt DeepIP, our vision is tobuild the AI operating system for IP practitioners. Intellectual Property is not just legal paperwork, it's a company's strategic DNA. Yet today's patent professionals are drowning in inefficient processes, outdated tools, and mountains of prior art.We are...


  • Paris, Île-de-France Siemens EDA (Siemens Digital Industries Software) Temps plein

    Siemens EDA is a global technology leader in Electronic Design Automation software. Our software tools enable companies around the world to develop highly innovative electronic products faster and more cost-effectively. Our customers use our tools to push the boundaries of technology and physics to deliver better products in the increasingly complex world of...

  • Lead AI Software Engineer

    il y a 2 semaines


    Paris, Île-de-France DeepIP Temps plein

    About DeepIPAt DeepIP, our vision is to build the AI operating system for IP practitioners. Intellectual Property is not just legal paperwork—it's a company's strategic DNA. Yet today's patent professionals are drowning in inefficient processes, outdated tools, and mountains of prior art.We are building AI-powered assistants to radically transform the...