Data Engineer

il y a 2 semaines


Paris, France Owkin Temps plein

**About us**:
Owkin is an agentic AI company on a mission to explore complex biology to speed up and scale research for the creation of new treatments and diagnostics for patients. Owkin K, our AI co-pilot combines unparalleled access to multimodal data, cutting-edge AI to understand biology and pioneering agentic AI to achieve Biological Artificial Superintelligence in the future.

Owkin is comprised of a group of specialized companies that work seamlessly together, combining deep expertise with a shared mission to accelerate biological discovery. Owkin K is the technology foundation of our ecosystem, it provides cutting-edge co-pilots incorporating high-quality data infrastructure and AI tools to power breakthroughs for researchers, customers and our companies.

Owkin has raised over $300 million through investments from leading biopharma companies, including Sanofi and BMS, and venture funds like Fidelity, GV and Bpifrance, among others.Owkin is seeking the best and brightest to join our fast-growing and dynamic team.

**About the role**:
We have identified the need for a 6-month CDD Data Engineer to assist through a period of transition in our Engineering team. You will be supporting the development and maintenance of data pipelines for scientific processing and quality assurance. You will participate in designing, optimizing, and maintaining ETL/ELT pipelines using Airflow, working within established frameworks to ensure reliability, scalability, and compliance with data governance standards.

Your primary responsibilities will include organizing and structuring data systems, ensuring accurate reporting of pipeline performance, and contributing to scientific and healthcare data processing workflows. The role requires attention to detail, the ability to manage multiple priorities, and strong collaboration skills to work effectively with engineers, data scientists, and researchers.

You will focus on streamlining production workflows, ensuring proper monitoring and operational efficiency, and implementing best practices for data governance and security.
- Operate and optimize ETL/ELT pipelines using Airflow.
- Support the structuring and organization of data systems in alignment with predefined architectures.
- Ensure timely and accurate reporting of data pipeline performance and operational issues.
- Follow data governance, security, and compliance standards in all data processing activities.
- Work on containerized data infrastructures using Docker and Kubernetes under supervision.
- Contribute to operational tasks related to scientific data processing and quality control.
- Implement optimizations in Python and SQL-based workflows following team guidelines.
- Work within established frameworks for data lake and data warehouse maintenance.
- Collaborate with engineers and researchers to define data processing requirements.
- Contribute to the standardization and monitoring of production data workflows.

**In particular, you will**:

- Support the design and optimization of data pipelines using Airflow.
- Develop and operate Python and SQL-based solutions for data processing.
- Contribute to the development of scalable ETL/ELT pipelines to process and transform datasets.
- Work closely with data scientists, business developers, software engineers, and biomedical researchers to deliver high-quality data solutions.
- Contribute to management and monitoring of containerized data infrastructures with Docker, Kubernetes, and cloud platforms.
- Follow best practices for data governance, security, and compliance in all workflows.
- Operate on the data architectures, including data lakes, data warehouses, and analytical insights platforms.
- Contribute to the productionization of data processing pipelines, ensuring efficiency and scalability in scientific data workflows.
- Position is based in our Paris office or remotely in France._

**About you**:

- Proficiency in Python and SQL.
- Familiarity with Airflow for workflow orchestration.
- Familiarity with cloud-based data storage and cloud-native processing concepts.
- Familiarity with containerization technologies such as Docker and Kubernetes.
- Knowledge of data governance and security fundamentals.
- Ability to work with structured and unstructured datasets in predefined formats.

**Please submit your CV in English**

LI-MD1

LI-MD1

**What we offer**:

- Flexible work organization
- Friendly and informal working environment
- Opportunity to work with an international team with high technical and scientific backgrounds

**Recruitment Process & Security**:

- Please complete the form and submit your CV.
- Owkin is an Equal Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, sex, gender, sexual orientation, age, color, religion, national origin, protected veteran status or on the basis of disability.
- Legitimate Owkin interviews may be conducted over the phone, in person, or via an approved enterpr


  • Data Engineer

    Il y a 50 minutes


    Paris, Île-de-France MP DATA Temps plein

    En tant que Data Engineer Senior, vous jouerez un rôle clé dans la construction, l'optimisation et la fiabilisation de nos pipelines de données à grande échelle, au cœur de notre plateforme analytique. Votre expertise sur Databricks et l'environnement Spark sera essentielle pour garantir des traitements performants, sécurisés et scalables.Vos...

  • Data Engineer

    il y a 2 semaines


    Paris, France MP DATA Temps plein

    Généraliste des RH et chargée de recrutementEn tant que Data Engineer Senior, vous jouerez un rôle clé dans la construction, l’optimisation et la fiabilisation de nos pipelines de données à grande échelle, au cœur de notre plateforme analytique. Votre expertise sur Databricks et l’environnement Spark sera essentielle pour garantir des...

  • Data Engineer Expérimenté

    il y a 3 heures


    Paris, Île-de-France MP DATA Temps plein

    Nous recherchons un(e)Data Engineer expérimenté(e)pour intervenir sur lamise en production, la fiabilisation et l'évolutiond'une plateforme data moderne basée surAWS, Spark et Dataiku.Vous participerez activement à laconstruction et l'optimisationdes environnements de traitement de données à grande échelle, en lien étroit avec les équipes Data...

  • Data Engineer

    il y a 19 heures


    Paris, France MP DATA Temps plein

    En tant que Data Engineer Senior, vous jouerez un rôle clé dans la construction, l'optimisation et la fiabilisation de nos pipelines de données à grande échelle, au cœur de notre plateforme analytique. Votre expertise sur Databricks et l'environnement Spark sera essentielle pour garantir des traitements performants, sécurisés et scalables.Vos...

  • Data Engineer On Premise

    il y a 2 semaines


    Paris, Île-de-France Craftman data Temps plein

    Localisation : Région parisienne (2 à 3 jours par semaine sur site)Budget indicatif : Niveau seniorDémarrage : Début février 2026Durée : 6 moisProfil recherché : Data Engineer On-Premise expérimentéPrincipales missions :Conception et développement de services orientés dataIntégration de composants techniquesRéalisation de tests, benchmarks et...

  • Data Engineer Dbt

    il y a 2 semaines


    Paris, France Craftman data Temps plein

    5/10 ans d?Exp Pour un client dans le domaine du Retail Le client recherche un Data Engineer dans le cadre de la mise en place d'un projet ambitieux autours de la DATA. le Data engineer devra avoir une culture du Software Engineering. Maitrise de DBT et Fabric Bonne maitrise de la CI/CD ( GitAction, Outils CI moderne ) Une connaissance de Airflow et des...


  • Paris, France Data Recrutement Temps plein

    L’ENTREPRISE : CABINET DE CONSEIL Cabinet de conseil en Architectiure management SI créé en 2009 50 collaborateurs Organisation en mode agile CA : 8 Millions Grande diversité de client LA MISSION : ACCOMPAGNER LES CLIENTS DANS LEURS STRATEGIES BIG DATA Sous la responsabilité d'un manager du département Architecture et Engineering Big Data, vous...

  • Data Engineer

    il y a 7 jours


    Paris, France MP Data Temps plein

    En tant que Data Engineer Senior, vous jouerez un rôle clé dans la construction, l'optimisation et la fiabilisation de nos pipelines de données à grande échelle, au coeur de notre plateforme analytique. Votre expertise sur Databricks et l'environnement Spark sera essentielle pour garantir des traitements performants, sécurisés et scalables. Vos...


  • Paris, France MP DATA Temps plein

    Data Engineer Confirmé Databricks - IDF (H/F) En tant que Data Engineer Senior, vous jouerez un rôle clé dans la construction, l’optimisation et la fiabilisation de nos pipelines de données à grande échelle, au cœur de notre plateforme analytique. Votre expertise sur Databricks et l’environnement Spark sera essentielle pour garantir des...


  • Paris, Île-de-France Mp Data Temps plein

    ESN spécialisée Data & IA pour les environnements industriels. Pour l'un de nos clients, nous recherchons un LLM Engineer chargé d'industrialiser les POC GenAI développés par les équipes Data Science et de déployer des solutions robustes et scalables en production.Développement et Industrialisation des POC LLM / GenAI.Conception et optimisation de...