ML Data Engineer

il y a 7 jours


Toulouse Munich DEU, France autonomous-teaming Temps plein


Your mission
  • Design and maintain high-throughput, scalable pipelines to ingest and organize large volumes of time-series camera and sensor data (RGB, IR, thermal, acoustic, depth, IMU).
  • Own, curate, and continuously improve computer vision datasets for object detection and classification, ensuring high-quality, diverse, and statistically representative data.
  • Build and operate active learning loops to prioritize high-value samples and accelerate dataset improvements.
  • Write robust preprocessing and transformation pipelines using Python, NumPy, Pandas, and Albumentations for large-scale computer vision workloads.
  • Manage labeling workflows, including automation, QA validation, annotation consistency checks, and dataset versioning.
  • Collaborate with ML Engineers to fine-tune, train, and evaluate detection models, feeding insights back into data generation and selection.
  • Analyze model weaknesses, blind spots, bias, and drift to derive actionable data improvements.
  • Create internal tools and dashboards to visualize, audit, and analyze dataset quality, diversity, long-tail distributions, and model performance gaps.


Your profile
  • Strong experience in Python and data processing frameworks (Pandas, NumPy, vectorized operations, multiprocessing).
  • Hands-on experience building ETL/ELT pipelines for ingesting, transforming, and structuring large video and sensor datasets.
  • Experience with data orchestration and lifecycle management for ML and computer vision workflows, including dataset versioning and reproducibility.
  • Solid understanding of object detection pipelines (Detectron2, MMDetection, COCO format, bounding-box standards).
  • Experience with active learning, uncertainty sampling, or semi-supervised dataset workflows.
  • Familiarity with data annotation platforms (CVAT, Label Studio) and automated QA/consistency checks.
  • Strong grasp of evaluation metrics for object detection (IoU, mAP, precision-recall curves, class-wise metrics).
  • Comfortable with databases (SQL/NoSQL), file systems, and the management of large-scale image, video, and sensor datasets.
  • Ability to work cross-functionally with perception, deployment, robotics, and data infrastructure teams.
  • Fluent in English, German and/or French are a plus
Nice to Have:
  • Experience with cloud storage and MLOps tools (AWS S3, MinIO, ClearML, MLFlow, Weights & Biases).
  • Familiarity with ROS / robotics data formats (bag files, TF trees, sensor_msgs), Docker, or embedded ML workflows.
  • Prior work with robotics, drones, or multi-sensor perception systems, including IR, LiDAR, radar, or audio datasets.
Meta:
  • Outside-the-box creativity with a blend of conceptual and systematic design thinking.
  • High intrinsic motivation, attention to detail, and strong problem-solving mindset.
  • Structured, methodical, and reliable execution, even under uncertainty.
  • Humble, collaborative, and mission-driven — values collective success over ego.
  • High ethical standards and disciplined work ethic.
  • Extra-curricular achievements, leadership, or unique projects are a plus.
  • NATO-aligned nationality or close ally citizenship is required.
  • Successful candidates must obtain security clearance.


Why us?

Join us to shape the future of AI-driven defense

Do you feel that you fit the description, but don't think you fulfill all the criteria 100%? Apply to us anyway.   
We look forward to receiving your detailed application via our online form. 

The world is changing. Exponential technologies are enabling new types of security threats. We are committed to staying ahead by building nimble, scalable, and cost-effective defences. We are looking for passionate developers who are eager to create exceptional products, safeguard our freedom, and strengthen the resilience of democracies.

About us

We are a defence-tech start-up specializing in machine vision solutions. If you have a passion for cutting-edge innovation, and drive to use your skills to create next generation solutions, this is an opportunity for you

What we do: We are developing solutions that enable computers and sensors to collaborate as teams, working together to address emerging security challenges. Our primary mission is to defend against AI-powered asymmetric threats at scale, such as drone swarms and other UXVs.

Who we are: Based in Munich, Berlin and Bordeaux/Toulouse we are rapidly expanding across Europe with plans to open more office hubs soon. We embrace a hybrid work culture - valuing the collaborations that happens in the office, while also empowering our team members to work remotely with responsibility and autonomy. 


  • Consultant Ml

    il y a 2 semaines


    Toulouse, France MP Data Temps plein

    Dans un contexte de forte croissance de nos activités Data & IA, MP DATA renforce son équipe toulousaine et recherche un ML Engineer (F/H) avec au moins 2 ans d'expérience dans l'industrialisation de modèles IA/ML. Chez MP DATA, nous accompagnons les acteurs industriels, technologiques et scientifiques dans la mise en production de solutions...

  • Consultant ML/AI Engineer

    il y a 2 semaines


    Toulouse, France MP Data Temps plein

    MP DATA est une société spécialisée dans l'acquisition, le traitement, et la valorisation des données. Depuis sa création en 2015, MP DATA accompagne ses clients, majoritairement industriels, dans le management de leur performance et l'exploitation de leur données. Les collaborateurs, tous issus de grandes écoles, incarnent au quotidien les valeurs...

  • ML Data Engineer

    il y a 7 jours


    Toulouse, Occitanie, France autonomous-teaming Temps plein

    Your missionDesign and maintain high-throughput, scalable pipelines to ingest and organize large volumes of time-series camera and sensor data (RGB, IR, thermal, acoustic, depth, IMU).Own, curate, and continuously improve computer vision datasets for object detection and classification, ensuring high-quality, diverse, and statistically representative...

  • ML Data Engineer

    il y a 7 jours


    Toulouse, Occitanie, France Autonomous Teaming Temps plein

    Your missionDesign and maintain high-throughput, scalable pipelines to ingest and organize large volumes of time-series camera and sensor data (RGB, IR, thermal, acoustic, depth, IMU).Own, curate, and continuously improve computer vision datasets for object detection and classification, ensuring high-quality, diverse, and statistically representative...


  • Toulouse, France Canonical Temps plein

    Python and Kubernetes Software Engineer - Data, AI/ML & AnalyticsJoin to apply for the Python and Kubernetes Software Engineer - Data, AI/ML & Analytics role at CanonicalContinue with Google Continue with GooglePython and Kubernetes Software Engineer - Data, AI/ML & Analytics4 months ago Be among the first 25 applicantsJoin to apply for the Python and...

  • Data Scientist H/F

    il y a 7 jours


    Toulouse, France MP Data Temps plein

    En tant que Data Scientist, vous serez responsable du développement de modèles avancés et de leur intégration dans des pipelines de production, en collaboration étroite avec les Data Engineers, ML Engineers et équipes métier. Vos missions incluront : - Développement & Modélisation - Concevoir, développer et optimiser des modèles de machine...

  • Data Engineer

    il y a 2 semaines


    Toulouse, Occitanie, France Craftman data Temps plein

    Nous recherchons un Data Engineer expérimenté (+4 ans) pour renforcer une équipe d?analyse marché dédiée aux données de réservation, partenaires et concurrence. La mission consiste à produire et optimiser des dashboards PowerBI (datasets, modélisation, visualisation), industrialiser les pipelines data et contribuer à l?exploitation analytique de...

  • Data Scientist

    il y a 6 jours


    Toulouse, France MP Data Temps plein

    MP DATA est une société spécialisée dans l'acquisition, le traitement, et la valorisation des données. Depuis sa création en 2015, MP DATA accompagne ses clients, majoritairement industriels, dans le management de leur performance et l'exploitation de leur données. Les collaborateurs, tous issus de grandes écoles, incarnent au quotidien les valeurs...

  • Consultant Data Engineer

    il y a 2 semaines


    Toulouse, France MP Data Temps plein

    Dans un contexte de croissance continue de nos activités Data & IA, MP DATA renforce son équipe toulousaine et recherche un Data Engineer (F/H) motivé, autonome et passionné par les architectures et pipelines de données. Chez MP DATA, nous accompagnons les acteurs industriels et technologiques dans la conception de solutions data robustes, scalables et...

  • Consultant Data Engineer

    il y a 2 semaines


    Toulouse, France MP Data Temps plein

    MP DATA est une société spécialisée dans l'acquisition, le traitement, et la valorisation des données. Depuis sa création en 2015, MP DATA accompagne ses clients, majoritairement industriels, dans le management de leur performance et l'exploitation de leur données. Les collaborateurs, tous issus de grandes écoles, incarnent au quotidien les valeurs...