Data Engineer
il y a 1 semaine
Jasper is the leading AI marketing platform, enabling the world's most innovative companies to reimagine their end-to-end marketing workflows and drive higher ROI through increased brand consistency, efficiency, and personalization at scale.
Jasper has been recognized as "one of the Top 15 Most Innovative AI Companies of 2024" by Fast Company and is trusted by nearly 20% of the Fortune 500 – including Prudential, Ulta Beauty, and Wayfair. Founded in 2021, Jasper is a remote-first organization with team members across the US, France, and Australia.
About The RoleJasper Research is seeking an experienced Data Engineer who will play a pivotal role in supporting our image research team to help design, scale, and maintain our data infrastructure, as well as data processing pipelines powering the training of state-of-the-art multimodal models.
In this role, you will work closely with our research scientists and research engineers to collect, clean, and process large-scale datasets from a variety of sources, ensuring that our models are built on the best possible data foundations.
This role is open to candidates located in France. It will be a hybrid setup, which requires you to come into the office when necessary. The office is based at Station F in Paris, the vibrant hub of the French startup ecosystem. Our efficient and lean team at Station F thrives on innovation and collaboration.
What you will do at Jasper
Design and implement end-to-end scalable data pipelines to ingest, transform, and load data into our data warehouse.
Analyze existing datasets and implement robust data validation, deduplication, and bias mitigation processes to ensure the highest quality and diversity of training data.
Create training sets from existing data, using classical computer vision algorithms, vision models and LLMs.
Optimize data loading, preprocessing, and augmentation workflows to eliminate bottlenecks and maximize training efficiency.
Document all data processes, schemas, and transformations to ensure full reproducibility and transparency for the research team.
Work hand-in-hand with research scientists and engineers to understand their data needs, provide actionable insights, and rapidly iterate on pipeline improvements.
Source new multi-modal data from public sources.
Bachelor's or Master's degree in Computer Science, Data Engineering, or a related field.
Strong experience as a Data Engineer or in a similar data-focused role.
Strong experience in image manipulation at scale and understanding of computer vision.
Hands-on experience with distributed computing frameworks and cloud platforms for distributed ML training.
Familiarity with cloud-based data warehousing and storage solutions (e.g., BigQuery).
Strong attention to detail, commitment to data quality, and a proactive approach to supporting research needs.
Preferred Qualifications
Knowledge of data transformation and enrichment techniques, including clustering, deduplication, and synthetic data generation
Experience with vector databases for ML data
Proficiency in Python and SQL for data manipulation and analysis.
Proficiency in at least one ML library (TensorFlow, PyTorch, Jax). PyTorch preferred.
Contributions to open-source data tools or projects.
Familiarity with data privacy and compliance regulations (GDPR, CCPA).
Mutuelle coverage for hospitalisation and mental health care provided through Alan Comprehensive healthcare plan
Flexible PTO with a FlexExperience budget (€552 annually) to help you make the most of your time away from work
FlexWellness program (€1,640 annually) to help support your personal health goals
Generous budget for home office set up
€1,375 annual learning and development stipend
-
Data Engineer
il y a 3 jours
Paris, Île-de-France MP Data Temps pleinEn tant que Data Engineer Senior, vous jouerez un rôle clé dans la construction, l'optimisation et la fiabilisation de nos pipelines de données à grande échelle, au coeur de notre plateforme analytique. Votre expertise sur Databricks et l'environnement Spark sera essentielle pour garantir des traitements performants, sécurisés et scalables. Vos...
-
Data Engineer
il y a 6 jours
Paris, Île-de-France MP DATA Temps pleinEn tant que Data Engineer Senior, vous jouerez un rôle clé dans la construction, l'optimisation et la fiabilisation de nos pipelines de données à grande échelle, au cœur de notre plateforme analytique. Votre expertise sur Databricks et l'environnement Spark sera essentielle pour garantir des traitements performants, sécurisés et scalables.Vos...
-
Data Engineer Expérimenté
il y a 6 jours
Paris, Île-de-France MP DATA Temps pleinNous recherchons un(e)Data Engineer expérimenté(e)pour intervenir sur lamise en production, la fiabilisation et l'évolutiond'une plateforme data moderne basée surAWS, Spark et Dataiku.Vous participerez activement à laconstruction et l'optimisationdes environnements de traitement de données à grande échelle, en lien étroit avec les équipes Data...
-
Graduate Data Engineer
il y a 1 jour
Paris, Île-de-France Data Reply Temps pleinGraduate Data EngineerTasks• Implementing new use cases• Mapping data and data flows• Implementing data analysis and processing pipelines• Industrializing data flows and their visualization through dashboards and reporting• Carrying out unit tests and integration tests Benefits• Structured career progression – at Reply, we encourage career...
-
Data Scientist NLP/LLM Engineer confirmé(e)
il y a 7 jours
Paris, Île-de-France Mp Data Temps pleinESN spécialisée Data & IA pour les environnements industriels. Pour l'un de nos clients, nous recherchons un LLM Engineer chargé d'industrialiser les POC GenAI développés par les équipes Data Science et de déployer des solutions robustes et scalables en production.Développement et Industrialisation des POC LLM / GenAI.Conception et optimisation de...
-
Data Scientist NLP/LLM Engineer confirmé(e)
il y a 3 jours
Paris, Île-de-France MP DATA Temps pleinESN spécialisée Data & IA pour les environnements industriels. Pour l'un de nos clients, nous recherchons unLLM Engineerchargé d'industrialiser lesPOC GenAIdéveloppés par les équipes Data Science et de déployer des solutions robustes et scalables en production.Développement et Industrialisation des POC LLM / GenAI.Conception et optimisation de...
-
Data Engineer AWS
il y a 1 semaine
Paris, Île-de-France Data Reply FR Temps pleinData Replyest une filiale du groupe Reply offrant une large gamme de services d'analyse avancée et de données alimentées par l'IA.Nous opérons dans différents secteurs et fonctions commerciales, en travaillant directement avec des professionnels de haut niveau et des directeurs généraux pour leur permettre d'obtenir des résultats significatifs grâce...
-
Data Engineer
il y a 2 semaines
Paris, Île-de-France ec-0b91-4fd1-a05e-dc94127b83a9 Temps pleinEn quelques motsCherry Pick est à la recherche d'un Data Engineer / Python / Azure pour l'un de ses clients qui opère dans le secteur de l'énergieDescription? Contexte de missionDans le cadre du renforcement de l?équipe Data, nous recherchons Data Engineer spécialisé en Python, PySpark et Microsoft Fabric, évoluant dans un environnement Azure...
-
DATA ENGINEER
il y a 1 semaine
Paris, Île-de-France Collective Temps pleinBudget: 500 euros/jourMission :Data engineerLocalisation : Paris 17Démarrage : ASAPJours obligatoires sur site : 5 jours/semaineExpérience : 5-8 ans minimumTJM: 500FICHE MISSION – DATA ENGINEER AZURE (KPI, DBT, MÉTIER)IntituléData Engineer Azure – KPI, dbt (Data Build Tool) & compréhension métierContexteDans le cadre du renforcement de son équipe...
-
Data Engineer
il y a 1 semaine
Paris, Île-de-France Metroscope Temps pleinNous recherchons un.eData Engineer - Data Scientistpour notre équipe software, composée actuellement de 2 Engineering Managers, 6 développeurs full stack, 3 data scientists, 1 data engineer, 2 platform engineers, 2 PMs, 1 designer. Tu rejoindras l'équipe Data Platform, seras sous la responsabilité managériale d'un Engineering Manager et travailleras en...