Data Engineer
il y a 2 semaines
Aqemia is a next-generation pharmatech reinventing drug discovery with quantum-inspired physics and generative AI. Our mission: design innovative small-molecule drug candidates for dozens of critical diseases, faster and smarter, without relying on experimental data. Unlike traditional approaches, Aqemia starts drug discovery purely in silico. By combining physics-based models with large language models trained on proprietary data, we identify promising molecules with high accuracy before synthesis. We've already delivered multiple preclinical successes and secured strategic partnership. Our internal pipeline is growing fast, with several programs in in vivo optimization. We're a team of 65+ based in Paris and London, we bring together chemists, physicists, engineers, and machine learning experts to push the boundaries of what's possible in early-stage drug discovery.
The Role
As an Intermediate Data Engineer in the Data Team, you will work closely with scientists, engineers, and drug discovery portfolio team members to operate and improve the data flows that underpin Aqemia's research and discovery activities. You will handle compounds and experimental results, ensuring these datasets are correctly ingested, validated, modelled, and organised for reliable downstream use.
Your work strengthens the link between experimental results and computational discovery by ensuring high data quality and continuity across scientific datasets and systems. The role requires strong scientific literacy, a clear sense of data ownership, and a proactive approach to improving data structures, metadata, and the tools that scientists rely on. What You'll do
- Operate and maintain pipelines for experimental and molecular datasets, including compound and assay results
- Ensure accurate and timely ingestion, validation, and organisation of experimental data into shared systems
- Design and maintain clear, well-structured data models that reflect scientific and operational needs
- Work directly with scientists and engineers to capture requirements accurately and translate them into data solutions
- Contribute to data presentation, observability and visualization, making results accessible and interpretable for non-technical stakeholders
- Contribute to the evolution of tools, processes, and workflows for managing scientific and metadata assets
- Document datasets and pipelines clearly, enabling knowledge sharing across teams
- Participate actively in collaborative problem-solving, sprint planning, and cross-team discussions
- 2–4 years of experience in data engineering, ideally with exposure to scientific, pharmaceutical, or techbio domains
- Strong programming skills in Python and SQL
- Experience with data pipelines, dataset integration, and cloud-based data environments (AWS and Snowflake preferred)
- Exposure to data presentation or visualisation tools and techniques
- Experience with chemical, or biological datasets, or knowledge of the drug discovery process
- Excellent collaboration and communication skills, with proven ability to capture requirements and deliver solutions in multi-disciplinary teams
- You thrive in collaborative environments and value clear, precise communication
- You are curious about scientific data and motivated to understand research workflows
- You take pride in building data solutions that are trusted, interpretable, and directly empower discovery
- You care about accuracy, maintainability, and ensuring the needs of stakeholders are fully understood and addressed
At Aqemia, engineers don't just build software, they help discover real drugs.You'll work at the intersection of AI, physics and chemistry, transforming bold scientific ideas into robust, production-grade tools that accelerate discovery.
DeepTech Mission : Build the platform that powers AI-driven drug discovery, combining quantum-inspired physics with generative models Real-World Impact : Every feature shipped helps scientists prioritize molecules and design better candidates, faster Modern Stack & Challenges : Python, FastAPI, Airflow, Snowflake, Kubernetes, ML workflows, scientific infra, data engineering at scale High Ownership, High Impact : Engineers contribute to architecture, tooling, and scientific decision-making Interdisciplinary Team : Collaborate with chemists, physicists, ML researchers, and product teams Prime Locations : Central Paris or London offices, with 2 remote days/week Strategic Traction : Backed by $100M in funding and a $140M partnership with Sanofi
Join us if you're excited to shape the future of AI-driven drug discovery, and want your code to change the course of real diseases. We may use artificial intelligence (AI) tools to support parts of the hiring process, such as reviewing applications, analyzing resumes, or assessing responses. These tools assist our recruitment team but do not replace human judgment. Final hiring decisions are ultimately made by humans. If you would like more information about how your data is processed, please contact us.
-
Data Engineer
il y a 1 semaine
Paris, Île-de-France MP DATA Temps pleinEn tant que Data Engineer Senior, vous jouerez un rôle clé dans la construction, l'optimisation et la fiabilisation de nos pipelines de données à grande échelle, au cœur de notre plateforme analytique. Votre expertise sur Databricks et l'environnement Spark sera essentielle pour garantir des traitements performants, sécurisés et scalables.Vos...
-
Data Engineer Expérimenté
il y a 1 semaine
Paris, Île-de-France Mp Data Temps pleinNous recherchons un(e) Data Engineer expérimenté(e) pour intervenir sur la mise en production, la fiabilisation et l'évolution d'une plateforme data moderne basée sur AWS, Spark et Dataiku.Vous participerez activement à la construction et l'optimisation des environnements de traitement de données à grande échelle, en lien étroit avec les équipes...
-
Data Engineer Expérimenté
il y a 1 semaine
Paris, Île-de-France MP DATA Temps pleinNous recherchons un(e)Data Engineer expérimenté(e)pour intervenir sur lamise en production, la fiabilisation et l'évolutiond'une plateforme data moderne basée surAWS, Spark et Dataiku.Vous participerez activement à laconstruction et l'optimisationdes environnements de traitement de données à grande échelle, en lien étroit avec les équipes Data...
-
Senior AWS Data Engineer
il y a 3 jours
Paris, Île-de-France Data Reply Temps pleinSenior AWS Data EngineerTasksImplement new use cases and data pipelines on AWSMap data and data flows across cloud platformsDevelop and industrialize data pipelines and processing workflowsDesign and build dashboards and reporting toolsPerform unit and integration testing of data flowsParticipate in Data Reply events (Reply Xchange, hackathons, AWS summits,...
-
Graduate Data Engineer
il y a 3 jours
Paris, Île-de-France Data Reply Temps pleinGraduate Data EngineerTasks• Implementing new use cases• Mapping data and data flows• Implementing data analysis and processing pipelines• Industrializing data flows and their visualization through dashboards and reporting• Carrying out unit tests and integration tests Benefits• Structured career progression – at Reply, we encourage career...
-
Data Scientist NLP/LLM Engineer confirmé(e)
il y a 6 jours
Paris, Île-de-France MP DATA Temps pleinESN spécialisée Data & IA pour les environnements industriels. Pour l'un de nos clients, nous recherchons unLLM Engineerchargé d'industrialiser lesPOC GenAIdéveloppés par les équipes Data Science et de déployer des solutions robustes et scalables en production.Développement et Industrialisation des POC LLM / GenAI.Conception et optimisation de...
-
Data Engineer Tech Lead
il y a 3 jours
Paris, Île-de-France Craftman data Temps pleinInformations principales :Secteur : AssuranceLocalisation : Paris (3 jours par semaine sur site)Date de démarrage : Dès que possibleLangue : FrançaisDurée de mission : Jusqu?en février 2028Profil recherché : Data Engineer (Spark / Hadoop)Expérience minimum : 7 ansCompétences techniques indispensables :Solide maîtrise de JavaExcellente maîtrise de...
-
Paris, Île-de-France Craftman data Temps pleinNous recherchons une prestation de Data Engineer avec des compétences sur Python/PySpark/Databricks sur un environnement cloud AWS.Le Data Engineer sera responsable de la conception, du développement et de la mise en production de l'architecture de données.Il devra notamment :Collecter les exigences des métiers et des utilisateursConcevoir l'architecture...
-
LLM Engineer confirmé(e)
il y a 1 semaine
Paris, Île-de-France Mp Data Temps pleinESN spécialisée Data & IA pour les environnements industriels. Pour l'un de nos clients, nous recherchons un LLM Engineer chargé d'industrialiser les POC GenAI développés par les équipes Data Science et de déployer des solutions robustes et scalables en production.Développement et Industrialisation des POC LLM / GenAI.Conception et optimisation de...
-
Data Engineer
il y a 2 semaines
Paris, Île-de-France RED Global Temps plein***Data Engineer – Paris – Hybride***RED Global est à la recherche d'unData Engineerpour venir rejoindre les équipes de l'un de nos clients à Paris dans le cadre de leurs projets en cours.Compétences requises :Expérience de 3 à 5 ans maximum en tant que Data EngineerBonne maitrise de PythonExpérience solide avec Docker et l'orchestration des...