ML Data Engineer

il y a 7 jours


Toulouse, Occitanie, France Autonomous Teaming Temps plein

Your mission

  • Design and maintain high-throughput, scalable pipelines to ingest and organize large volumes of time-series camera and sensor data (RGB, IR, thermal, acoustic, depth, IMU).
  • Own, curate, and continuously improve computer vision datasets for object detection and classification, ensuring high-quality, diverse, and statistically representative data.
  • Build and operate active learning loops to prioritize high-value samples and accelerate dataset improvements.
  • Write robust preprocessing and transformation pipelines using Python, NumPy, Pandas, and Albumentations for large-scale computer vision workloads.
  • Manage labeling workflows, including automation, QA validation, annotation consistency checks, and dataset versioning.
  • Collaborate with ML Engineers to fine-tune, train, and evaluate detection models, feeding insights back into data generation and selection.
  • Analyze model weaknesses, blind spots, bias, and drift to derive actionable data improvements.
  • Create internal tools and dashboards to visualize, audit, and analyze dataset quality, diversity, long-tail distributions, and model performance gaps.

Your profile

  • Strong experience in Python and data processing frameworks (Pandas, NumPy, vectorized operations, multiprocessing).
  • Hands-on experience building ETL/ELT pipelines for ingesting, transforming, and structuring large video and sensor datasets.
  • Experience with data orchestration and lifecycle management for ML and computer vision workflows, including dataset versioning and reproducibility.
  • Solid understanding of object detection pipelines (Detectron2, MMDetection, COCO format, bounding-box standards).
  • Experience with active learning, uncertainty sampling, or semi-supervised dataset workflows.
  • Familiarity with data annotation platforms (CVAT, Label Studio) and automated QA/consistency checks.
  • Strong grasp of evaluation metrics for object detection (IoU, mAP, precision-recall curves, class-wise metrics).
  • Comfortable with databases (SQL/NoSQL), file systems, and the management of large-scale image, video, and sensor datasets.
  • Ability to work cross-functionally with perception, deployment, robotics, and data infrastructure teams.
  • Fluent in English, German and/or French are a plus

Nice to Have:

  • Experience with cloud storage and MLOps tools (AWS S3, MinIO, ClearML, MLFlow, Weights & Biases).
  • Familiarity with ROS / robotics data formats (bag files, TF trees, sensor_msgs), Docker, or embedded ML workflows.
  • Prior work with robotics, drones, or multi-sensor perception systems, including IR, LiDAR, radar, or audio datasets.

Meta:

  • Outside-the-box creativity with a blend of conceptual and systematic design thinking.
  • High intrinsic motivation, attention to detail, and strong problem-solving mindset.
  • Structured, methodical, and reliable execution, even under uncertainty.
  • Humble, collaborative, and mission-driven — values collective success over ego.
  • High ethical standards and disciplined work ethic.
  • Extra-curricular achievements, leadership, or unique projects are a plus.
  • NATO-aligned nationality or close ally citizenship is required.
  • Successful candidates must obtain security clearance.

Why us?
Join us to shape the future of AI-driven defense

Do you feel that you fit the description, but don't think you fulfill all the criteria 100%? Apply to us anyway.

We look forward to receiving your detailed application via our online form.

The world is changing. Exponential technologies are enabling new types of security threats. We are committed to staying ahead by building nimble, scalable, and cost-effective defences. We are looking for passionate developers who are eager to create exceptional products, safeguard our freedom, and strengthen the resilience of democracies.

About Us
We are a defence-tech start-up specializing in machine vision solutions. If you have a passion for cutting-edge innovation, and drive to use your skills to create next generation solutions, this is an opportunity for you

What we do:
We are developing solutions that enable computers and sensors to collaborate as teams, working together to address emerging security challenges. Our primary mission is to defend against AI-powered asymmetric threats at scale, such as drone swarms and other UXVs.

Who we are:
Based in Munich, Berlin and Bordeaux/Toulouse we are rapidly expanding across Europe with plans to open more office hubs soon. We embrace a hybrid work culture - valuing the collaborations that happens in the office, while also empowering our team members to work remotely with responsibility and autonomy.


  • ML Data Engineer

    il y a 7 jours


    Toulouse, Occitanie, France autonomous-teaming Temps plein

    Your missionDesign and maintain high-throughput, scalable pipelines to ingest and organize large volumes of time-series camera and sensor data (RGB, IR, thermal, acoustic, depth, IMU).Own, curate, and continuously improve computer vision datasets for object detection and classification, ensuring high-quality, diverse, and statistically representative...

  • Data Engineer

    il y a 2 semaines


    Toulouse, Occitanie, France Craftman data Temps plein

    Nous recherchons un Data Engineer expérimenté (+4 ans) pour renforcer une équipe d?analyse marché dédiée aux données de réservation, partenaires et concurrence. La mission consiste à produire et optimiser des dashboards PowerBI (datasets, modélisation, visualisation), industrialiser les pipelines data et contribuer à l?exploitation analytique de...

  • Data Engineer

    il y a 1 semaine


    Toulouse, Occitanie, France Link Consulting Temps plein

    Notre client, spécialiste du Traitement et Analyse de données, est à la recherche de son futur Data Engineer (IA/Géospatial) (H/F) pour son site de Toulouse, dans le cadre de son activité. Intégré aux équipes du client, vous devrez : Développer et industrialiser des pipelines et modèles IA exploitant des données géospatiales et...

  • Data Engineer/ Expert GCP

    il y a 1 semaine


    Toulouse, Occitanie, France Free-Work Temps plein

    Mon Client recherche unData Engineer expérimenté (min. 5 ans en Cloud Engineering GCP)pour intervenir sur unnouveau projet GCP natifdans le domaineTrade-Corporate Sales, à partir du5 janvier 2026pour une durée initiale d'un an. La mission, basée exclusivementsur site à Toulouse, vise à concevoir et construire la première itération de la solution...

  • Data Engineer

    il y a 2 semaines


    Toulouse, Occitanie, France CTS CONSULTING & TECHNICAL SUPPORT Temps plein

    Dans le cadre du renforcement de nos activités aéronautiques, nous recherchons un Data Engineer expériménté. En tant que Data Engineer, vous rejoindrez une équipe dynamique dédiée à la valorisation des données aéronautiques. Votre rôle : concevoir, développer et optimiser des solutions data performantes dans un environnement technique stimulant....

  • Data Engineer

    il y a 2 semaines


    Toulouse, Occitanie, France CTS Consulting & Technical Support Temps plein

    CTS Consulting & Technical Support est une société de conseils en ingénierie et bureau d'études. L'attention réelle que nous portons a` nos consultants et le suivi personnalise´ de leurs carrières font partie des fondements de CTS Consulting & Technical Support.Sous la marque CTS, 8 entités autonomes, régionales et spécialisées font intervenir des...

  • Data Engineer

    il y a 7 jours


    Toulouse, Occitanie, France Evotec Temps plein

    Titre du poste :Data EngineerLieu :ToulouseDépartement :IT / Digital SystemsRelève de :Head of IT & Digital SystemsÀ propos de nous : qui nous sommesChez Just Evotec Biologics, nous croyons que la curiosité est l'étincelle qui stimule l'innovation et le succès. En tant qu'équipe avant-gardiste, nous nous efforçons de remettre en question le statu...

  • Lead Data Engineer

    il y a 5 jours


    Toulouse, Occitanie, France Ippon Technologies Temps plein

    Qui sommes-nous ? Ippon, c'est l'énergie du collectif au service de la technologie Nous sommes un cabinet de conseil et d'expertise technique. Nos équipes ont pour leitmotive de transformer des idées innovantes en solutions logicielles de haute qualité avec un focus particulier sur la valeur apportée aux utilisateurs. Située en centre ville, notre...

  • Data Engineer GCP

    il y a 5 jours


    Toulouse, Occitanie, France DGTL Performance Temps plein

    DGTL / Signe + est le facilitateur pour tous les acteurs qui recherchent des ressources ou des missions DATA.Spécialiste du marché Data et BI, nous intervenons dans toute la France comme à l'étranger ; en sous-traitance, pré-embauche, recrutement, portage commercial, portage salarial, etc.Depuis 2018, nous accompagnons nos clients avec proximité, juste...


  • Toulouse, Occitanie, France autonomous-teaming Temps plein

    Your mission Build AI that runs in the real world. On real robots. Under real constraints.  At Autonomous Teaming, we build autonomous robotic systems operating in extreme, GPS-denied environments. Our models run fully on edge hardware (Jetson, FPGA, custom boards), with no cloud, no fallback, no excuses.  We're looking for an engineer who loves hard...