Senior Data Scientist NLP/GenAI
il y a 4 jours
About Mirakl
Mirakl is the leading provider of eCommerce software solutions. Mirakl's suite of solutions provides enterprises with a transformative way to drive significant growth and efficiency in their online business.
Since 2012, Mirakl has been pioneering the platform economy, empowering retail and B2B enterprises with the most advanced, secure and scalable technology to digitize and expand product assortment through marketplace and dropship, improve efficiency in supplier catalog management and payments, personalize shopping experiences, and boost profits through retail media.
Mirakl is trusted by 400+ industry-leading businesses worldwide including Macy's, Decathlon, Best Buy, Airbus, Toyota Material Handling and Sonepar.
Headquartered in Paris with another office in Bordeaux and with offices in 7 countries, Mirakl is recognized as a Great Place to Work company.
With more than 350 people, Mirakl Labs teams are mainly based in France. They work together on a daily basis to develop our roadmap for our 5 SaaS solutions.
They also address the issues faced by our customers and users, responding to various challenges related to new features, scalability, security, and usability.
About the job
You'll join our Data Science team, where your main mission will be to prototype, iterate, and ship algorithms to production in close collaboration with Product, Data Engineering, and Software teams. Your projects will focus on Marketplace catalog challenges, including NLP, Computer Vision, and large-scale Generative AI (custom LLMs). The topics you'll tackle will have a real impact on our customers: we aim to make the most of our rich, diverse data to grow their revenue, streamline marketplace operations, and ensure user and transaction safety.
We're hiring on a permanent contract (CDI), based in Paris, Bordeaux, or fully remote.
Catalog topics:
- Automatic rewriting of marketing content based on business needs
- Extracting product attributes from images and free text
- Detecting product variants
- Product categorization
- Automated onboarding of sellers' products
- Merging product pages from multiple sources
- Predicting trending products
What's in it for you:
- Build algorithms that visibly impact 500+ e-commerce/marketplace sites in 40 countries, including some with very high volumes (millions of products, customers, and orders per year)
- Work with cutting-edge techniques (multimodal models, LLM fine-tuning, etc.). Mirakl is one of the few French players with fine-tuned LLMs in large-scale production. Join us and keep pushing that pioneer spirit
- Real autonomy and ownership over your projects
Our stack and tools:
Python, Tensorflow, Pytorch, Hugging Face, Databricks, Spark, AWS (Amazon Redshift, s3, etc.), SQL, Airflow, Delta Lake. Spécifiques LLM : Autotrain, Unsloth, Galileo, LangChain, Anyscale.
Day to day, you will:
- Analyze and prepare data, prototype algorithms
- Put them into production with Data Engineers and dev teams
- Build dashboards to demonstrate algorithm performance and monitor production
- Present results at the weekly data science meeting and join team brainstorms
- Partner with other teams to refine use cases, user experience, and integration paths
You'll love this job if:
- You have at least 4 years' experience as a Data Scientist, with strong hands-on NLP and applied ML in industry
- You've deployed Machine Learning algorithms to production
- You know NLP and Computer Vision algorithms and state-of-the-art architectures (e.g., Transformers). Knowledge of the latest LLMs is a plus
- You're fluent in Python and TensorFlow and/or PyTorch
- You have experience with Spark development
- You're pragmatic, data-driven, and business-oriented
- You take full ownership of your topics, work autonomously, and are a great team player
- You bring a positive mindset: respect and kindness are core to your values
- You enjoy sharing your work through internal talks, conferences, or writing
Meet Arthur Delaitre, Data Science Manager for the team:
Wants to join us ?
- A 30-minute phone call with one of our Tech recruiters. We'll discuss your background, expectations, and what Mirakl can offer you
- A 30-minute technical Zoom with someone from the Data Science team to dive into concrete aspects of your expertise and how it fits our projects
- A take-home assignment
- A 75-minute technical debrief and discussion with the Data Science team manager
- A final 1-hour Zoom with future Mirakl colleagues about our values and culture
We welcome collaborators with their diverse perspectives and experiences to power us forward. These often far exceed conventional job requirements and help us create a culture of continuous learning. If you're ready to join a global leader powering digital transformation for 450+ of the world's most innovative retailers and B2B organizations, we strongly encourage you to apply to any of our roles, even if you think you're not an exact match.
We may use Artificial Intelligence (AI) solutions to help streamline our hiring process, including screening applications, analyzing resumes, and assessing responses. While AI helps us work efficiently, all final hiring decisions are made by humans. For more information, visit our AI Guidelines for Candidates and Interviews.
-
Senior Data Scientist NLP/GenAI
il y a 4 jours
France Mirakl - Labs Temps pleinMirakl est le leader des solutions logicielles pour le e-commerce. Nous proposons aux entreprises une suite unique de solutions leur permettant de transformer significativement leur activité digitale afin d'accélérer de façon durable et rentable leur croissance.Depuis 2012, Mirakl accompagne les entreprises B2C et B2B avec la technologie la plus...
-
Senior NLP
il y a 4 jours
France Mirakl - Labs Temps pleinUne entreprise de technologie e-commerce recherche un Data Scientist pour intégrer son équipe Data Science. Le candidat sera responsable du prototypage et de la mise en production d'algorithmes centrés sur le catalogue marketplace, utilisant des techniques avancées comme le NLP et la Computer Vision. Le poste offre une grande autonomie et l'opportunité...
-
STAGE - Data Scientist GenAI - (H/F)
il y a 2 jours
Boulevard Pereire, Paris, France LittleBigCode Temps pleinTu intégreras une équipe Data Science passionnée, travaillant sur des projets GenAI appliqués au secteur du luxe.L'objectif : développement d'agents IA capables d'interagir avec des données financières afin d'enrichir l'analyse, d'automatiser certaines tâches et d'optimiser la prise de décision.Tes missions :En tant que Data Scientist GenAI , tu...
-
Senior Solutions Data Scientist
il y a 1 semaine
France, Paris; France, Remote Dataiku Temps pleinDataiku is The Universal AI Platform, giving organizations control over their AI talent, processes, and technologies to unleash the creation of analytics, models, and agents. Providing no-, low-, and full-code capabilities, Dataiku meets teams where they are today, allowing them to begin building with AI using their existing skills and knowledge. Dataiku's...
-
Stage Data Scientist NLP
il y a 4 jours
Boulevard de Sébastopol, Paris, France codoc Temps pleincodoc recherche d'un stagiaire Data Scientist dans l'équipe R&D. Tu participeras à l'optimisation des algorithmes de recherche développés en partenariat avec l'Institut Imagine et à leur intégration dans des projets nationaux, tels que Meditwin.Dans le cadre du projet Meditwin, codoc contribue au développement d'outils d'analyse sémantique avancés...
-
GenAI Data Content Rater-French
il y a 4 semaines
France Aceolution Temps pleinWe are seeking a GenAI Data Content Rater (French Language) for a 3-month contract.Key Responsibilities:Collaborate with GenAI researchers and engineers to understand data collection and evaluation requirements.Translate high-level requirements into detailed workflows and communicate them to the team.Execute data collection and evaluation workflows...
-
Manager Data Scientist
il y a 4 jours
France Publicisgroupe Temps pleinOverview Notre communauté Data est composée d'une quinzaine de collaborateurs regroupant Data Scientists, Data Engineers, Data Analysts, Data Strategists et Data Architects, travaillant sur la co-construction d'outils générant de la valeur à partir de la donnée de nos clients. Si vous aussi, vous partagez cette vision et souhaitez profiter et...
-
Data Scientist
il y a 2 semaines
Île-de-France Esmoz Temps pleinOffre CDI – Data Scientist / ML Engineer Localisation : Région Parisienne — Présence requise : 3 jours / semaine sur site Disponibilité : Janvier – Mars 2026 Contrat : CDI Secteur : BanqueContexteNous recherchons un Data Scientist / ML Engineer pour rejoindre un Data Lab dans le secteur bancaire. Vous interviendrez sur le développement et la mise...
-
Data Scientist
il y a 2 semaines
Île-de-France Esmoz Temps pleinOffre CDI – Data Scientist / ML Engineer Localisation : Région Parisienne — Présence requise : 3 jours / semaine sur site Disponibilité : Janvier – Mars 2026 Contrat : CDI Secteur : BanqueContexteNous recherchons un Data Scientist / ML Engineer pour rejoindre un Data Lab dans le secteur bancaire. Vous interviendrez sur le développement et la mise...
-
Data Scientist Senior
il y a 4 jours
Rue Cognacq-Jay, Paris, France Vertone Temps pleinVERTONE recherche un(e) Data Scientist Senior pour renforcer sa cellule Data.A ce titre, vous aurez l'opportunité d'approfondir vos connaissances et compétences en synergie avec les consultants dans des missions d'analyse de données à forts enjeux business.En cohérence avec les projets, vous interviendrez sur les aspects suivants :Comprendre le besoin...