Engineer F/M Prodromal Trajectories of Neurodegenerative Diseases: Linking AP-HP Clinical Data Warehouse and National Health Data

il y a 3 jours


Paris, Île-de-France Inria Temps plein

Le descriptif de l'offre ci-dessous est en Anglais

Type de contrat : CDD

Contrat renouvelable : Oui

Niveau de diplôme exigé : Bac + 5 ou équivalent

Fonction : Ingénieur scientifique contractuel

Niveau d'expérience souhaité : Jeune diplômé

Contexte et atouts du poste

The ARAMIS team (Algorithms, Models and Methods for Images and Signals in Medical Research) is an INRIA team based at the Paris Brain Institute that develops computational methods for studying neurodegenerative diseases.

Within the framework of the PRAIRIE-PSAI initiative, we are looking for a Research Engineer to model longitudinal care trajectories and identify predictive patterns of neurodegenerative disease progression using large EHR/EMR data sets.

We have established a linkage between two major health data infrastructures:

  • The AP-HP Clinical Data Warehouse (EDS-APHP): over 11 million patients, detailed clinical data (medical notes, laboratory results, imaging).
  • The French National Health Data System (SNDS): 99% population coverage, longitudinal data on prescriptions, hospitalizations, and consultations since 2009.

This linkage encompasses over 150,000 patients with neurological diseases presenting motor symptoms: Parkinson's disease, multiple sclerosis, amyotrophic lateral sclerosis (ALS).

The central objective of this project is to correlate what happens before diagnosis (care trajectories, prescriptions, symptoms) with what happens after (evolution, prognosis) to understand:

  • How long before diagnosis does the disease actually begin?
  • What are the first detectable signals in health data?
  • Which factors (medications, comorbidities, care) modify disease risk or progression?
Mission confiée

The engineer will work on analyzing longitudinal care trajectories and identifying predictive patterns of neurodegenerative disease progression.

1. Extraction and Feature Engineering

  • Build features from trajectories of prescriptions, hospitalizations, and medical procedures 5–10 years pre-diagnosis.
  • Integrate heterogeneous data (ICD-10 codes, ATC codes, biological results, clinical notes via NLP).
  • Handle temporal irregularity of observations (data collected during opportunistic consultations).

2. Statistical Modeling and Machine Learning

Method selection will be adapted to the intern's profile and interests. Several approaches are possible.

Classical methods:

  • Survival models (Cox, Fine-Gray) to predict time to diagnosis or clinical events.
  • Mixed-effects models to model longitudinal biomarker trajectories.
  • Joint models combining longitudinal evolution and survival.

Exploratory methods (deep learning):

  • DeepSurv: neural networks for survival analysis with non-linear relationships.
  • Transformers / BERT: sequence models to capture complex temporal patterns in care trajectories.
  • Trajectory embeddings: vector representations of care pathways for clustering or prediction.
  • Recurrent Neural Networks (RNN / LSTM).

The project encourages methodological innovation while maintaining strong grounding in clinical questions. The goal is to do both: answer important clinical questions and explore new statistical/ML approaches.

Principales activités

Main activities:

  • bibliographical work, literature review
  • data management of large data sets of medical records
  • design, implementation and conduct of complex analysis plans
  • critical analysis results in light of the current literature
  • present results at scientific conferences and in peer-reviewed scientific journals.
Compétences

Required:

  • advanced statistics and/or machine learning (master level)
  • scientific computing including data management in Python and/or R (master level)
  • able to propose and implement complex data analysis plans

Understanding the key challenges of real-world data analysis would be a plus.

Languages : fluent in scientific english (oral and written)

Relational skills : able to work in multidisciplinary teams at the interface between statistics, medicine and epidemiology.

Other valued appreciated : interest in neurodegenerative diseases

Avantages
  • Subsidized meals
  • Partial reimbursement of public transport costs
  • Leave: 7 weeks of annual leave + 10 extra days off due to RTT (statutory reduction in working hours) + possibility of exceptional leave (sick children, moving home, etc.)
  • Possibility of teleworking and flexible organization of working hours
  • Professional equipment available (videoconferencing, loan of computer equipment, etc.)
  • Social, cultural and sports events and activities
  • Access to vocational training
  • Social security coverage
Informations générales
  • Thème/Domaine : Neurosciences et médecine numériques

Biologie et santé, Sciences de la vie et de la terre (BAP A)
- Ville : Paris
- Centre Inria : Centre Inria de Paris
- Date de prise de fonction souhaitée :
- Durée de contrat : 12 mois
- Date limite pour postuler :

Attention: Les candidatures doivent être déposées en ligne sur le site Inria. Le traitement des candidatures adressées par d'autres canaux n'est pas garanti.

Consignes pour postuler

Sécurité défense :

Ce poste est susceptible d'être affecté dans une zone à régime restrictif (ZRR), telle que définie dans le décret n° relatif à la protection du potentiel scientifique et technique de la nation (PPST). L'autorisation d'accès à une zone est délivrée par le chef d'établissement, après avis ministériel favorable, tel que défini dans l'arrêté du 03 juillet 2012, relatif à la PPST. Un avis ministériel défavorable pour un poste affecté dans une ZRR aurait pour conséquence l'annulation du recrutement.

Politique de recrutement :

Dans le cadre de sa politique diversité, tous les postes Inria sont accessibles aux personnes en situation de handicap.

Contacts
  • Équipe Inria : ARAMIS
  • Recruteur :

Durrleman Stanley /

L'essentiel pour réussir

The candidate should have the motivation to contribute to research projects in a interdisciplinary environment. Eager to learn by themselves under the guidance of the supervisors, curious about the research conducted by the peers, motivated to help making scientific contributions to the field.

In short:

  • Part hacker, part researcher: autonomous, curious, able to debug and quickly learn new libraries/methods.
  • Pragmatic: knows how to balance methodological rigor and efficiency.
  • Follower of latest methods: follows recent conferences and publications (NeurIPS, ICML, JMLR, Bioinformatics, etc.), tests new approaches.
  • Interest in medical applications: motivated by clinical and public health impact, not just algorithmic performance.

In our team, we value both the ability to answer important clinical questions and the desire to explore new statistical and computational methods.

A propos d'Inria

Inria est l'institut national de recherche dédié aux sciences et technologies du numérique. Il emploie 2600 personnes. Ses 215 équipes-projets agiles, en général communes avec des partenaires académiques, impliquent plus de 3900 scientifiques pour relever les défis du numérique, souvent à l'interface d'autres disciplines. L'institut fait appel à de nombreux talents dans plus d'une quarantaine de métiers différents. 900 personnels d'appui à la recherche et à l'innovation contribuent à faire émerger et grandir des projets scientifiques ou entrepreneuriaux qui impactent le monde. Inria travaille avec de nombreuses entreprises et a accompagné la création de plus de 200 start-up. L'institut s'efforce ainsi de répondre aux enjeux de la transformation numérique de la science, de la société et de l'économie.


  • Data Engineer F/H

    il y a 3 jours


    Paris 12 Reuilly, Île-de-France Ap-Hp Temps plein

    Informations générales Entité de rattachement du publieur L'Assistance publique-Hôpitaux de Paris (AP-HP) est un centre hospitalier universitaire à dimension européenne mondialement reconnu. Ses 38 hôpitaux accueillent chaque année 10 millions de personnes malades : en consultation, en urgence, lors d'hospitalisations programmées ou en...


  • Paris, Île-de-France ICON plc Temps plein

    Senior Clinical Data Science ProgrammerICON plc is a world-leading healthcare intelligence and clinical research organization. We're proud to foster an inclusive environment driving innovation and excellence, and we welcome you to join us on our mission to shape the future of clinical developmentAs a Senior Clinical Data Science Programmer, you'll step into...

  • Data Validation Engineer

    Il y a 6 minutes


    Paris, Île-de-France ICON plc Temps plein

    Data Validation EngineerICON plc is a world-leading healthcare intelligence and clinical research organization. We're proud to foster an inclusive environment driving innovation and excellence, and we welcome you to join us on our mission to shape the future of clinical developmentWe are currently seeking a Senior Lead Clinical Data Science Programmer to...

  • Junior Data Engineer

    Il y a 18 minutes


    Paris, Île-de-France Data Reply Temps plein

    Junior Data EngineerTasksImplementing new use casesMapping data and data flowsImplementing data analysis and processing pipelinesIndustrializing data flows and their visualization through dashboards and reportingCarrying out unit tests and integration tests    BenefitsStructured career progression – at Reply, we encourage career development and will...

  • Graduate Data Engineer

    Il y a 34 minutes


    Paris, Île-de-France Data Reply Temps plein

    Graduate Data EngineerTasks• Implementing new use cases• Mapping data and data flows• Implementing data analysis and processing pipelines• Industrializing data flows and their visualization through dashboards and reporting• Carrying out unit tests and integration tests  Benefits• Structured career progression – at Reply, we encourage career...


  • Paris, Île-de-France ICON plc Temps plein

    Principal Clinical Data Science LeadICON plc is a world-leading healthcare intelligence and clinical research organization. We're proud to foster an inclusive environment driving innovation and excellence, and we welcome you to join us on our mission to shape the future of clinical developmentPrincipal Clinical Data Science...


  • Paris, Île-de-France World Organisation for Animal Health Temps plein

    Internship – The take-up of official recognition of animal health status - Data Integration DepartmentContextThe World Organisation for Animal Health (WOAH – founded as OIE) is a leading intergovernmental organisation representing 183 Members worldwide. Through its activities, WOAH makes a decisive contribution to improving animal health, protecting...

  • Data Scientist F/H

    Il y a 27 minutes


    Paris 12 Reuilly, Île-de-France Ap-Hp Temps plein

    Informations générales Entité de rattachement du publieur L'Assistance publique-Hôpitaux de Paris (AP-HP) est un centre hospitalier universitaire à dimension européenne mondialement reconnu. Ses 38 hôpitaux accueillent chaque année 10 millions de personnes malades : en consultation, en urgence, lors d'hospitalisations programmées ou en...


  • Paris 12 Reuilly, Île-de-France Ap-Hp Temps plein

    Informations générales Entité de rattachement du publieur L'Assistance publique-Hôpitaux de Paris (AP-HP) est un centre hospitalier universitaire à dimension européenne mondialement reconnu. Ses 38 hôpitaux accueillent chaque année 10 millions de personnes malades : en consultation, en urgence, lors d'hospitalisations programmées ou en...

  • Data Engineer

    Il y a 22 minutes


    Paris, Île-de-France Bloomfield Robotics Temps plein

    About the CompanyAt Bloomfield, we are revolutionizing the way crops are monitored and managed. Our AI-powered imaging technology provides continuous, plant-level health and performance insights from seed to harvest. Our mission is to empower farmers with the tools they need to increase crop productivity and quality while using fewer scarce resources,...