Biomedical Data Engineer

il y a 2 semaines


Paris, France Scienta Lab Temps plein

Biomedical Data Engineer

Permanent Contract

Paris - As soon as possible

Scienta Lab is a deeptech company harnessing artificial intelligence to transform the drug discovery and development process in immunology and inflammation.

Scienta Lab develops a foundation model for immune-mediated inflammatory diseases leveraging multimodal data to support translational research.

Scienta Lab is partnering with top-tier academic research institutions, biotechs, and pharmaceutical companies worldwide to develop its model. The company has a proven scientific track record in precision medicine for immunology with scientific publications in top-tier medical journals and conferences.

Scienta Lab is based in Biolabs Hôtel-Dieu. Join us and be a part of our exciting journey to advance medical research in precision immunology

Scienta Lab is seeking a skilled and motivated individual to build and enhance our data stack. As a Biomedical Data Engineer, you will play a critical role in sourcing, storing, documenting, and providing access to multimodal datasets related to immune-mediated inflammatory diseases (IMIDs). Your mission will include developing technical solutions to ensure the quality, accessibility, and usability of multimodal datasets, empowering various teams within the company to drive impactful research and innovation.

You will join the Biomedical Team and report directly to the Chief Scientific Officer (CSO).

**Main Missions**:

- Data sourcing: identify public datasets in immuno-inflammation - including clinical and molecular data - and develop technical solutions to automate the collection and integration of datasets.
- Data processing: design, develop, and maintain robust bioinformatic pipelines to automate the curation, cleaning, and preparation of Scienta Lab data portfolio. Ensure the integrity and reliability of processed data for downstream analysis.
- Data annotation: implement and maintain high-quality annotations for datasets, ensuring they are comprehensive, accurate, and aligned with internal standards.
- Data documentation: Manage and maintain clear and detailed documentation for all datasets, including metadata, provenance, and usage guidelines. Ensure that documentation adheres to industry best practices and facilitates reproducibility.
- Data visualization: Create visualizations and conduct feasibility studies to evaluate datasets and support business decisions. Provide intuitive dashboards or tools to explore the datasets and get actionable insights.
- Coding Collaboration: You will work closely with the technical team to support their efforts in modeling data, fostering good practices in software engineering, and ensuring seamless integration of datasets into analytical workflows.
- Collaboration: work closely with business, scientific, and data science teams to ensure datasets are readily accessible, well-documented, and meet quality standards. Act as a point of contact for dataset-related inquiries and technical support.

**Who we are looking for**:

- Experience: 4+ years of experience in computer science, bioinformatics, computational biology, or a related field.
- Data Expertise: strong understanding of omics datasets (e.g., transcriptomics, proteomics) and clinical data structures.
- Data Interpretation: proven ability to create effective data visualizations and analyze complex datasets.
- Technical Skills:

- Proficiency with bioinformatics tools, libraries, and databases relevant to omics data analysis (e.g., Bioconductor, NCBI databases).
- Hands-on experience developing and optimizing bioinformatic pipelines using workflow management systems (e.g., Snakemake, Nextflow).
- Strong coding skills in Python, with a solid understanding of programming best practices.
- Language Skills: Full professional proficiency in English.

**How to stand out**:

- Teamwork: demonstrated ability to collaborate effectively with biologists, researchers, and software engineers in a multidisciplinary environment.
- Problem-Solving: a solution-oriented mindset and a strong sense of service to meet project and team needs.
- Domain Knowledge: familiarity with immunology or immune-mediated diseases is a valuable asset.

WHAT WE OFFER
- Competitive salary and benefits package
- Hybrid work culture with remote work policy
- Dynamic and enriching work environment
- Office at Biolabs Hôtel-Dieu - 1 parvis Notre-Dame 75004 Paris

Type d'emploi : CDI
Statut : Cadre

Rémunération : 50 000,00€ à 67 299,00€ par an

Exigences linguistiques flexibles:

- Français non requis

Formation:

- Bac +5 (Master / MBA) (Requis)

Expérience:

- Bioinformatique: 4 ans (Requis)
- omics data: 4 ans (Requis)

Lieu du poste : En présentiel


  • Data Engineer

    il y a 1 semaine


    Paris, Île-de-France MP Data Temps plein

    En tant que Data Engineer Senior, vous jouerez un rôle clé dans la construction, l'optimisation et la fiabilisation de nos pipelines de données à grande échelle, au coeur de notre plateforme analytique. Votre expertise sur Databricks et l'environnement Spark sera essentielle pour garantir des traitements performants, sécurisés et scalables. Vos...

  • Data Engineer

    il y a 2 semaines


    Paris, Île-de-France MP DATA Temps plein

    En tant que Data Engineer Senior, vous jouerez un rôle clé dans la construction, l'optimisation et la fiabilisation de nos pipelines de données à grande échelle, au cœur de notre plateforme analytique. Votre expertise sur Databricks et l'environnement Spark sera essentielle pour garantir des traitements performants, sécurisés et scalables.Vos...

  • Data Engineer Expérimenté

    il y a 2 semaines


    Paris, Île-de-France MP DATA Temps plein

    Nous recherchons un(e)Data Engineer expérimenté(e)pour intervenir sur lamise en production, la fiabilisation et l'évolutiond'une plateforme data moderne basée surAWS, Spark et Dataiku.Vous participerez activement à laconstruction et l'optimisationdes environnements de traitement de données à grande échelle, en lien étroit avec les équipes Data...

  • Data Engineer

    il y a 2 semaines


    Paris, France MP DATA Temps plein

    En tant que Data Engineer Senior, vous jouerez un rôle clé dans la construction, l'optimisation et la fiabilisation de nos pipelines de données à grande échelle, au cœur de notre plateforme analytique. Votre expertise sur Databricks et l'environnement Spark sera essentielle pour garantir des traitements performants, sécurisés et scalables.Vos...

  • Graduate Data Engineer

    il y a 7 jours


    Paris, Île-de-France Data Reply Temps plein

    Graduate Data EngineerTasks• Implementing new use cases• Mapping data and data flows• Implementing data analysis and processing pipelines• Industrializing data flows and their visualization through dashboards and reporting• Carrying out unit tests and integration tests  Benefits• Structured career progression – at Reply, we encourage career...

  • Data Platform Engineer

    il y a 2 semaines


    Paris, France Data Recrutement Temps plein

    Une entreprise de services numériques est à la recherche d'un Data Platform Engineer pour travailler sur des plateformes critiques dans le secteur bancaire. Le poste require environ 3 ans d'expérience, une expertise en Kubernetes et des connaissances en outils Big Data comme Spark et Airflow. Ce rôle offre une rémunération compétitive de 55-60K€ et...


  • Paris, Île-de-France Mp Data Temps plein

    ESN spécialisée Data & IA pour les environnements industriels. Pour l'un de nos clients, nous recherchons un LLM Engineer chargé d'industrialiser les POC GenAI développés par les équipes Data Science et de déployer des solutions robustes et scalables en production.Développement et Industrialisation des POC LLM / GenAI.Conception et optimisation de...

  • Data Engineer | ESN high level

    il y a 2 semaines


    Paris, France Data Recrutement Temps plein

    L'ENTREPRISE : ESN DATA EN FORTE CROISSANCE Magnifique ESN créée il y a quelques années par des experts du secteur IT, avec déjà plus de 70 consultants en mission et une croissance soutenue ! Elle accompagne aujourd’hui de grands comptes (banque, assurance, énergie, environnement) ainsi que de nouveaux secteurs comme les Life Sciences. Sa...

  • Senior Agentic Data Engineer

    il y a 2 semaines


    Paris, France OWKIN Temps plein

    OverviewSenior Data Engineer – Agentic Data Engineer in K product team. You will be at the heart of our mission to accelerate data-driven biomedical discoveries at scale. You will contribute to building and maintaining a virtual assistant system for Biomedical researchers that uses agents and tools to explore public biology knowledge, Owkin proprietary...


  • Paris, Île-de-France MP DATA Temps plein

    ESN spécialisée Data & IA pour les environnements industriels. Pour l'un de nos clients, nous recherchons unLLM Engineerchargé d'industrialiser lesPOC GenAIdéveloppés par les équipes Data Science et de déployer des solutions robustes et scalables en production.Développement et Industrialisation des POC LLM / GenAI.Conception et optimisation de...