Emplois actuels liés à Stagiaire Data Engineer - Paris, Île-de-France - CAST Software

  • Stagiaire Data Engineer

    il y a 5 jours


    Paris, Île-de-France DELIGHT Temps plein

     Le posteType de contrat: StageDébut du stage: Entre janvier et mars 2026Durée du stage: 6 moisLocalisation: remote et locaux Paris centreRémunération: 1000€ / moisDelight cherche un·eData Engineer, passionné·e par les technologies de la donnée (Data Warehouse) et le milieu du spectacle vivant.Présentation de DelightActeur référent de la data...

  • Data Analytics Engineer

    il y a 5 jours


    Paris, Île-de-France La French Tech Taiwan Temps plein

    Offres d'emploiLes SecteursIndustrieNumériqueSantéTransition écologiqueAgricultureRejoindre la Mission French TechDécouvrir les métiers de la TechData Analytics Engineer - Stage - ParisParisStagePostulerÀ proposNous, c'est papernest.Notre ambition? Devenir la plateforme numéro 1 dans la souscription et la gestion des contrats, et tout ça en un...

  • Data Engineer

    il y a 2 jours


    Paris, Île-de-France Bloomfield Robotics Temps plein

    About the CompanyAt Bloomfield, we are revolutionizing the way crops are monitored and managed. Our AI-powered imaging technology provides continuous, plant-level health and performance insights from seed to harvest. Our mission is to empower farmers with the tools they need to increase crop productivity and quality while using fewer scarce resources,...

  • Data Engineer

    il y a 7 jours


    Paris, Île-de-France Datamatics Technologies Temps plein

    Job Title: Data Engineer (Databricks, Teradata & Neo4j)Location: Remote (Candidates must be based in Europe)Experience: 5–7 YearsEmployment Type: Full-TimeClient Location: SwedenPosition OverviewWe are looking for an experienced Data Engineer with strong hands-on expertise in Databricks, Teradata, and Neo4j to join a leading technology-driven...

  • Data Engineer Junior

    il y a 5 jours


    Paris, Île-de-France Singulier Temps plein

    Who We AreGraphite is a specialist team within Singulier focused exclusively on data transformation for PE-backed companies.We believe mid-sized and large companies are still underserved when it comes to data. For years, only big corporates could afford the platforms and teams needed to truly leverage their data. But over the last decade, cloud technologies...

  • senior data engineer

    il y a 1 semaine


    Paris, Île-de-France STATION F Temps plein

    AboutRejoignez Allphins, une start-up insurtech dynamique qui redéfinit la gestion des risques dans l'industrie de la (ré)assurance grâce à une technologie innovante. En tant que Senior Data Engineer, vous aurez l'opportunité de jouer un rôle clé dans notre croissance rapide et notre expansion internationale.Notre parcours a commencé en 2019, au...

  • Data Engineer

    il y a 5 jours


    Paris, Île-de-France Harmattan AI Temps plein

    About UsHarmattan AI is a next-generation defense prime building autonomous and scalable defense systems. Following the close of a $200M Series B, valuing the company at $1.4 billion, we are expanding our teams and capabilities to deliver mission-critical systems to allied forces.Our work is guided by clear values: building technologies with real-world...

  • Data Engineer GCP

    il y a 5 jours


    Paris, Île-de-France INFOGENE Temps plein

    [Secteur Beauté & Cosmétique] – Data Engineer / Cloud Engineer GCP (H/F)Paris – Mission long termeUn leader mondial du secteurBeauté / Cosmétiquepoursuit sa transformation digitale et lance unprojet stratégiquedédié à la valorisation de données non structurées (documents, images, audio, vidéo).L'ambition : construire uneplateforme transverse...

  • Data Engineer/Analyst

    il y a 5 jours


    Paris, Île-de-France Movement8 Temps plein

    Finance Data Engineer / Data Analyst – BI, ETL, data warehouse, French/English speakingLocation: Paris, France, RemoteAbout the RoleWe're looking for a Finance Data Engineer for one of our Fintech clients.The role will be to bridge the gap between finance and engineering and ensure their finance team has clean data, automated access to the key business...

  • Data Engineer

    il y a 1 semaine


    Paris, Île-de-France Viseo Temps plein

    Rejoignez VISEO et vivez une aventure humaine, collective et stimulante Chez VISEO, nous plaçons l'Humain, le Collectif et le Challenge au cœur de notre ADN. Et si votre prochaine aventure professionnelle commençait ici ?Qui êtes-vous ?Vous êtes Data Engineer confirmé(e) et vous disposez d'une solide expérience en ingénierie de données dans des...

Stagiaire Data Engineer

il y a 3 semaines


Paris, Île-de-France CAST Software Temps plein

CAST, a Software Company based in Meudon ,  is the market leader in Software Intelligence.

Working at CAST R&D means being an important part of a highly-talented, fast-paced, multicultural and Agile team . 

Overview

We're building the foundation to ground AI with AAA Software Intelligence — Aggregated,

Accurated, and Augmented — sourced from real-world software and technology projects. This

role goes beyond manual curation: it's about using AI to empower AI. You will leverage LLMs,

embeddings, and NLP tools to clean, enrich, and validate data, enabling AI systems and

autonomous agents to rely on it for training and contextual understanding.

Responsibilities


• Aggregate and structure data from software ecosystems (codebases, APIs, tickets,

documentation, architecture specs).


• Apply LLMs, embeddings, and NLP tools to automate: data cleaning, entity extraction,

metadata tagging, and semantic annotation.


• Build and maintain semantic pipelines for LLM fine-tuning and RAG (Retrieval-Augmented

Generation).


• Organize datasets into formats suitable for Agent-to-Agent (A2A) interactions: APIs, vector

DBs, knowledge graphs, etc.


• Collaborate with AI teams to evolve schemas, prompts, labeling strategies, and evaluation

data.


• Ensure strong data lineage, reproducibility, and version control.

Requirements


• Experience in data engineering, ML data ops, or structured data curation.


• Proficient in Python, with strong data pipeline skills (Pandas, PyArrow, regex, Airflow).


• Experience with LLMs or NLP tools (e.g., Hugging Face, spaCy, LangChain).


• Ability to use AI to clean, enrich, classify, and organize technical content.


• Strong understanding of tokenization, chunking, and model input preparation.


• Experience working with software project data: Git repos, APIs, technical documentation, etc.

Bonus Skills


• Knowledge of vector DBs (FAISS, Qdrant, Weaviate) or knowledge graphs (Neo4j, RDF,

SPARQL).