ML Data Engineer

il y a 2 semaines


Toulouse Munich DEU, France autonomous-teaming Temps plein


Your mission
  • Design and maintain high-throughput, scalable pipelines to ingest and organize large volumes of time-series camera and sensor data (RGB, IR, thermal, acoustic, depth, IMU).
  • Own, curate, and continuously improve computer vision datasets for object detection and classification, ensuring high-quality, diverse, and statistically representative data.
  • Build and operate active learning loops to prioritize high-value samples and accelerate dataset improvements.
  • Write robust preprocessing and transformation pipelines using Python, NumPy, Pandas, and Albumentations for large-scale computer vision workloads.
  • Manage labeling workflows, including automation, QA validation, annotation consistency checks, and dataset versioning.
  • Collaborate with ML Engineers to fine-tune, train, and evaluate detection models, feeding insights back into data generation and selection.
  • Analyze model weaknesses, blind spots, bias, and drift to derive actionable data improvements.
  • Create internal tools and dashboards to visualize, audit, and analyze dataset quality, diversity, long-tail distributions, and model performance gaps.


Your profile
  • Strong experience in Python and data processing frameworks (Pandas, NumPy, vectorized operations, multiprocessing).
  • Hands-on experience building ETL/ELT pipelines for ingesting, transforming, and structuring large video and sensor datasets.
  • Experience with data orchestration and lifecycle management for ML and computer vision workflows, including dataset versioning and reproducibility.
  • Solid understanding of object detection pipelines (Detectron2, MMDetection, COCO format, bounding-box standards).
  • Experience with active learning, uncertainty sampling, or semi-supervised dataset workflows.
  • Familiarity with data annotation platforms (CVAT, Label Studio) and automated QA/consistency checks.
  • Strong grasp of evaluation metrics for object detection (IoU, mAP, precision-recall curves, class-wise metrics).
  • Comfortable with databases (SQL/NoSQL), file systems, and the management of large-scale image, video, and sensor datasets.
  • Ability to work cross-functionally with perception, deployment, robotics, and data infrastructure teams.
  • Fluent in English, German and/or French are a plus
Nice to Have:
  • Experience with cloud storage and MLOps tools (AWS S3, MinIO, ClearML, MLFlow, Weights & Biases).
  • Familiarity with ROS / robotics data formats (bag files, TF trees, sensor_msgs), Docker, or embedded ML workflows.
  • Prior work with robotics, drones, or multi-sensor perception systems, including IR, LiDAR, radar, or audio datasets.
Meta:
  • Outside-the-box creativity with a blend of conceptual and systematic design thinking.
  • High intrinsic motivation, attention to detail, and strong problem-solving mindset.
  • Structured, methodical, and reliable execution, even under uncertainty.
  • Humble, collaborative, and mission-driven — values collective success over ego.
  • High ethical standards and disciplined work ethic.
  • Extra-curricular achievements, leadership, or unique projects are a plus.
  • NATO-aligned nationality or close ally citizenship is required.
  • Successful candidates must obtain security clearance.


Why us?

Join us to shape the future of AI-driven defense

Do you feel that you fit the description, but don't think you fulfill all the criteria 100%? Apply to us anyway.   
We look forward to receiving your detailed application via our online form. 

The world is changing. Exponential technologies are enabling new types of security threats. We are committed to staying ahead by building nimble, scalable, and cost-effective defences. We are looking for passionate developers who are eager to create exceptional products, safeguard our freedom, and strengthen the resilience of democracies.

About us

Who we are: Autonomous Teaming is a defence-tech start-up specializing in machine vision solutions. Driven by cutting-edge innovation, our team works on next-generation technologies designed to meet rapidly evolving security challenges.

What we do: We develop systems that enable computers and sensors to operate as coordinated teams, collaborating in real time to counter AI-powered asymmetric threats at scale — including drone swarms and other UXVs. Our mission is to build resilient, intelligent defence capabilities that perform reliably in the most demanding environments.

Who we are: Based in Munich, Berlin, and Toulouse, we are expanding rapidly across Europe with plans to open additional office hubs. We value close, in-person collaboration as the foundation for building complex, high-impact technology, while maintaining flexibility aligned to role and team needs. Our culture is built on ownership, responsibility, and trust — with a shared commitment to growing and building together.


  • ML Data Engineer

    il y a 18 heures


    Toulouse, France Autonomous Teaming Temps plein

    ML Data Engineer – Computer Vision, Video & Sensor Data Join to apply for the ML Data Engineer – Computer Vision, Video & Sensor Data role at Autonomous Teaming. Get AI-powered advice on this job and more exclusive features. Responsibilities Design and maintain high-throughput, scalable pipelines to ingest and organize large volumes of time‑series...


  • Toulouse, France Canonical Temps plein

    A leading open source software provider is seeking a Python and Kubernetes Software Engineer to work on Data, Workflows, AI/ML, and Analytics Solutions. The role involves building solutions for cloud infrastructure and requires strong Python and Kubernetes skills, along with a deep understanding of Linux. Candidates should have a technical degree,...

  • Data Scientist H/F

    il y a 1 semaine


    Toulouse, France MP Data Temps plein

    En tant que Data Scientist, vous serez responsable du développement de modèles avancés et de leur intégration dans des pipelines de production, en collaboration étroite avec les Data Engineers, ML Engineers et équipes métier. Vos missions incluront : - Développement & Modélisation - Concevoir, développer et optimiser des modèles de machine...

  • Data Scientist H/F

    il y a 2 semaines


    Toulouse, Occitanie, France MP Data Temps plein

    En tant que Data Scientist, vous serez responsable du développement de modèles avancés et de leur intégration dans des pipelines de production, en collaboration étroite avec les Data Engineers, ML Engineers et équipes métier. Vos missions incluront : - Développement & Modélisation - Concevoir, développer et optimiser des modèles de machine...

  • ML engineer confirmé

    il y a 1 semaine


    Toulouse, Occitanie, France Ippon Technologies Temps plein

    À propos d'Ippon Technologies Depuis 2002, Ippon Technologies est à la pointe du conseil et de l'expertise technique. Spécialisés dans le développement logiciel et mobile, la data, l'IA, le cloud, ainsi que l'architecture et la stratégie IT, nous sommes fier·e·s de compter plus de 700 collaborateur·trice·s passionné·e·s à travers le monde....

  • Senior ML Engineer

    il y a 4 jours


    Toulouse, France MP Data Temps plein

    MP DATA est une société spécialisée dans l'acquisition, le traitement, et la valorisation des données. Depuis sa création en 2015, MP DATA accompagne ses clients, majoritairement industriels, dans le management de leur performance et l'exploitation de leur données. Les collaborateurs, tous issus de grandes écoles, incarnent au quotidien les valeurs...

  • Consultant Data Engineer

    il y a 2 semaines


    Toulouse, Occitanie, France MP Data Temps plein

    Dans un contexte de croissance continue de nos activités Data & IA, MP DATA renforce son équipe toulousaine et recherche un Data Engineer (F/H) motivé, autonome et passionné par les architectures et pipelines de données. Chez MP DATA, nous accompagnons les acteurs industriels et technologiques dans la conception de solutions data robustes, scalables et...

  • Consultant Data Engineer

    il y a 1 semaine


    Toulouse, France MP Data Temps plein

    Dans un contexte de croissance continue de nos activités Data & IA, MP DATA renforce son équipe toulousaine et recherche un Data Engineer (F/H) motivé, autonome et passionné par les architectures et pipelines de données. Chez MP DATA, nous accompagnons les acteurs industriels et technologiques dans la conception de solutions data robustes, scalables et...


  • Toulouse, France Canonical Temps plein

    Python and Kubernetes Software Engineer – Data, Workflows, AI/ML & Analytics Canonical is a leading provider of open source software and operating systems to the global enterprise and technology markets. Our platform, Ubuntu, is widely used in breakthrough enterprise initiatives such as public cloud, data science, AI, engineering innovation and IoT. We are...

  • Senior Ml Engineer

    il y a 3 jours


    Toulouse, France MP Data Temps plein

    MP DATA est un cabinet de conseil spécialisé dans l'accompagnement de projets Data et IA de bout en bout. Nous renforçons actuellement notre équipe pour intervenir sur des projets industriels de grande envergure, notamment pour le compte d'acteurs majeurs de l'aéronautique. En tant que Machine Learning Engineer (MLE), vous rejoindrez la Digital...