Data Scientist Internship
il y a 2 semaines
Our Engineering team
Our Engineering team is responsible for developing our SaaS platform and building a comprehensive and user-friendly product. Pigment engineers participate in the entire application development lifecycle, focusing on design, coding, and keeping the production platform up and running. They can be specialized, but there is no strict separation between infrastructure, backend, and frontend.
We value user-centricity and pragmatism: we choose the most relevant tools for the problem we have to solve, understanding the strengths and constraints of each technology. Our engineering culture also values curiosity, humility, trust, ownership, and team spirit.
Curious to see what we're building? Check out our Tech Blog
Your mission- As a Data Scientist Intern, you will contribute to advancing Pigment's use of Large Language Models (LLMs) by helping us explore how open-source alternatives can complement or replace commercial APIs. You'll work closely with our AI engineering and data science teams to design, fine-tune, and evaluate models that improve both efficiency and performance across our product.
- Project Overview:
- Exploring Fine-Tuned Open-Source Alternatives to Commercial LLMs
- This project investigates replacing select calls to commercial large language models with fine-tuned open-source alternatives. The goal is to develop localized models that deliver similar or improved performance while reducing latency and cost.
- As an intern, you will:
- Experiment with fine-tuning techniques such as LoRA, QLoRA, and other Parameter-Efficient Fine-Tuning (PEFT) methods.
- Benchmark model performance, accuracy, and inference latency across a range of tasks.
- Identify and document trade-offs between accuracy, speed, and cost for different fine-tuning and deployment strategies.
- Contribute to the development of a scalable, cost-effective foundation for custom LLM deployment to complement Pigment's current stack.
- Collaborate with engineers and data scientists to design robust evaluation pipelines and share findings internally.
- You'll gain hands-on experience in modern LLM fine-tuning, LangChain-based orchestration, and MLOps for model deployment, contributing to research with a direct impact on how AI is embedded into Pigment's platform.
- Our current AI and engineering stack includes:
- Languages & Frameworks: Python, LangGraph
- Data & ML Infrastructure: Weights & Biases, Hugging Face Hub, Google Cloud Platform (GCP), Docker, Kubernetes
- Databases: PostgreSQL
- CI/CD & Experimentation: CircleCI, MLflow, and internal orchestration tools
- We don't expect you to know them all. What matters most is your ability to learn quickly, experiment rigorously, and translate data insights into actionable results.
-
Data Scientist F/H
il y a 14 heures
Paris, Île-de-France un emploi de Data Scientist FH Temps pleinData scientist F/H (Stage de 6 mois)Faire de la Data Science dans la musique, c'est analyser des millions d'écoutes quotidiennes, comprendre comment naissent les hits, comment évoluent les goûts et comment les artistes rencontrent leur public.Sony Music Entertainment France, un des leaders du secteur de la production, de la promotion et de la distribution...
-
Stage Data Scientist
il y a 2 semaines
Paris, Île-de-France un emploi de Stage Data Scientist chez Mirakl Temps pleinAbout MiraklMirakl is the leading provider of eCommerce software solutions. Mirakl's suite of solutions provides enterprises with a transformative way to drive significant growth and efficiency in their online business. Since 2012, Mirakl has been pioneering the platform economy, empowering retail and b2b enterprises with the most advanced, secure and...
-
Stage Data Scientist
il y a 1 semaine
Paris, Île-de-France un emploi de Stage Data Scientist chez Mirakl Temps pleinAbout MiraklMirakl is the leading provider of eCommerce software solutions. Mirakl's suite of solutions provides enterprises with a transformative way to drive significant growth and efficiency in their online business. Since 2012, Mirakl has been pioneering the platform economy, empowering retail and b2b enterprises with the most advanced, secure and...
-
Data Scientist NLP/LLM Engineer confirmé(e)
il y a 3 jours
Paris, Île-de-France Mp Data Temps pleinESN spécialisée Data & IA pour les environnements industriels. Pour l'un de nos clients, nous recherchons un LLM Engineer chargé d'industrialiser les POC GenAI développés par les équipes Data Science et de déployer des solutions robustes et scalables en production.Développement et Industrialisation des POC LLM / GenAI.Conception et optimisation de...
-
Data scientist
il y a 2 semaines
Paris, Île-de-France Collective Temps pleinDans le cadre du renforcement d'une équipe dédiée à la modélisation mathématique, le client recherche un Data Scientist Senior (>5 ans d'expérience) pour intervenir sur un ensemble de sujets variés : optimisation, simulation, analyse de performance, développement de modèles et d'outils applicatifs.La prestation s'inscrit dans un environnement...
-
Data Scientist
il y a 1 semaine
Paris, Île-de-France Ad Scientiam Temps pleinDescriptif du poste :Le Pôle Data Science d'Ad Scientiam est constitué de 4 Data Scientists, 1 Data Engineering Tech Lead et 1 Lead Data Scientist. L'équipe contribue au développement et à l'intégration de biomarqueurs digitaux qui permettent d'améliorer le suivi et la prise en charge de maladies variées (dont la Sclérose en Plaque et la...
-
Data scientist
il y a 2 semaines
Paris, Île-de-France Collective Temps pleinDans le cadre du renforcement d'une équipe dédiée à la modélisation mathématique, le client recherche un Data Scientist Senior ( >5 ans d'expérience) pour intervenir sur un ensemble de sujets variés : optimisation, simulation, analyse de performance, développement de modèles et d'outils applicatifs.La prestation s'inscrit dans un environnement...
-
Data scientiste
il y a 1 semaine
Paris, Île-de-France Craftman data Temps pleinLa Direction Technique du Numérique mène des projets transverses en étroite collaboration avec les autres directions, notamment la direction Data. La Direction Data, au sein de la Direction du Numérique, a été créée avec la volonté de faire de la Data un levier de croissance des offres numériques.La Direction Data a 3 grandes missions : maximiser...
-
Data Scientist Intern
il y a 1 semaine
Paris, Île-de-France Moody's Corporation Temps pleinAt Moody's, we unite the brightest minds to turn today's risks into tomorrow's opportunities. We do this by striving to create an inclusive environment where everyone feels welcome to be who they are—with the freedom to exchange ideas, think innovatively, and listen to each other and customers in meaningful ways. Moody's is transforming how the world sees...
-
cdi - consultant data scientist - h/f
il y a 2 jours
Paris, Île-de-France Havas Data Business Intelligence Temps pleinDans le cadre de son hyper croissance, HAVAS DBi, l'agence conseil en data marketing du Groupe Havas, recherche unConsultantData scientist.2 ans d'expérience minimum dans le domaine de la data science et du data marketing.Rôle clé dans une agence en pleine croissance, avec une expertise sur des projets innovants. En étroite collaboration avec les...