Data Engineer – AI Compliance

il y a 2 semaines


France Spain Portugal All Cares Temps plein

About the Company

Cephalgo is a Strasbourg-based technology company founded in 2020, focused on developing AI solutions that ensure safety, compliance, and trust in human-AI interactions. Originally rooted in healthcare innovation, Cephalgo's platform helps organizations securely analyze and monitor voice and emotion data while meeting privacy, security, and regulatory standards.

Backed by over €3 million in funding, Cephalgo combines deep expertise in voice AI, data protection, and compliance frameworks to help enterprises build and deploy responsible AI systems. The company collaborates with leading European partners in AI ethics, healthcare, and regulatory technology.

About the Role

We are seeking a Data Engineer to build and scale systems that support text and voice analysis, risk detection, and classifier training workflows. You will be responsible for production-grade machine learning pipelines (0 → 1) and collaborate closely with data scientists and AI engineers to deliver compliant, reliable data infrastructure and services.

What You'll Do

Pipeline Development

  • Build and maintain end-to-end ML pipelines: data ingestion, preprocessing, feature extraction, model training, evaluation and deployment.

  • Develop reliable workflows specifically for voice and text analysis models.

Data Infrastructure

  • Design and maintain data storage, ETL workflows, and streaming/batch systems.

  • Implement data-quality, data-labeling, versioning and governance practices.

ML Collaboration

  • Work with data scientists and AI engineers to productionize models (e.g., text classifiers, anomaly-detection models, compliance-scoring models).

  • Support model monitoring and performance tracking once models are live.

Scalability & Reliability

  • Build robust, scalable, fault-tolerant pipelines.

  • Add observability layers: logging, monitoring, alerting for data and model pipelines.

Documentation & Governance

  • Document ETL processes, schemas, architecture and workflows.

  • Support compliance, data governance, and security standards in data pipelines and infrastructure.

You Might Be a Fit If You Have:

Experience

  • 3+ years in data engineering or ML engineering roles.

  • Proven experience building ML pipelines from scratch.

  • Experience with text classification, voice analysis or similar ML tasks is a strong plus.

Technical Skills

  • Strong programming skills (Python, Scala or Java).

  • Experience with big-data/streaming frameworks (Spark, Beam, Kafka or similar).

  • Familiarity with ML frameworks (PyTorch, TensorFlow, scikit-learn).

  • Experience with cloud data infrastructure and production deployment.

Soft Skills

  • Strong analytical and problem-solving skills.

  • Excellent collaborator and communicator—capable of working with data scientists, engineers and product/compliance stakeholders.

  • Detail-oriented, documentation-focused and comfortable in a fast-paced environment.

Education

  • Degree in Data Engineering, Computer Science, Machine Learning or related field (or equivalent experience).

Why Join Cephalgo?
  • Be at the intersection of cutting-edge AI/voice technology and compliance.
  • Make an impact by shaping a growing brand in a high-growth market.
  • Work with a collaborative, high-energy remote team driving forward-thinking solutions.
  • Grow your career and influence across product, marketing and business domains.


  • France / Spain / Italy / Germany / Netherlands All Cares Temps plein

    About the Company Cephalgo is a Strasbourg-based technology company founded in 2020, focused on developing AI solutions that ensure safety, compliance, and trust in human-AI interactions. Originally rooted in healthcare innovation, Cephalgo's platform helps organizations securely analyze and monitor voice and emotion data while meeting privacy, security, and...

  • Data Engineer

    il y a 2 semaines


    Rue du Dôme, Boulogne-Billancourt, France Mp Data Temps plein

    MP DATA, société experte dans l'acquisition, le traitement et la valorisation de données, recherche un(e) Data Engineer – Databricks passionné(e) pour renforcer son équipe.Depuis 2015, nous accompagnons nos clients – principalement issus du secteur industriel – sur des projets Data & IA ambitieux. Nos valeurs d'Excellence, d'Engagement et de...

  • Data engineer confirmé

    il y a 2 semaines


    Rue de Vidailhan, Balma, France Mp Data Temps plein

    Dans le cadre du déploiement à l'échelle mondiale de la Data Platform d'un client dans le secteur aéronautique, MP DATA a été missionné pour piloter une initiative stratégique de réduction et d'optimisation des coûts.Ce projet d'envergure fait suite à un audit FinOps approfondi réalisé par nos experts fin 2025. L'enjeu est désormais de...

  • Data Engineer expérimenté·e

    il y a 2 semaines


    Avenue Foch, Paris, France Modeo Temps plein

    Modeo est un groupement d'experts en Data Engineering, animé par une mission : "Master The Data Engineering Landscape".La Data Engineering est devenu un véritable défi pour les entreprises, un domaine complexe, évolutif et marqué par des pratiques encore disparates. Nous sommes là pour accompagner ces entreprises dans leurs problématiques, afin...

  • Data Engineer Lead

    il y a 2 semaines


    France Redslim Temps plein

    Are you passionate about building data solutions that empower organizations to make smarter decisions? Redslim is seeking a Data Engineer Lead to drive innovation and scalability across our enterprise data architecture. This is a leadership role for someone who thrives on solving complex technical challenges, mentoring teams, and delivering high-performance...

  • Data Engineer GCP Freelance

    il y a 4 semaines


    France Cherry Pick Temps plein

    En quelques motsCherry Pick est à la recherche d'un Data Engineer (H/F) pour l'un de ses clients qui opère dans le secteur de média.Description Contexte de missionLa mission s’inscrit au sein d’une Direction Data intégrée à une Direction du Numérique, dont l’ambition est de faire de la Data un levier stratégique de croissance des offres...

  • Data Scientist

    il y a 7 jours


    Rue du Dôme, Boulogne-Billancourt, France Mp Data Temps plein

    Nous recherchons un profil Data Scientist IAGen 3D pour renforcer notre équipe Data dans l'industrie. Vous interviendrez sur plusieures étapes des projets IA : sur l'exploration et la valorisation notamment.Vos responsabilités incluront :Analyser, structurer et valoriser des jeux de données internes et externes.Développer et fine tuner des modèles de...

  • Senior Data Engineer

    il y a 2 semaines


    Paris Area, France / Barcelona Area / Italy / Poland / Portugal Contentsquare Temps plein

    Contentsquare is the all-in-one experience intelligence platform designed to be easily used by anyone who cares about digital journeys. With our flexible and scalable platform, organizations quickly get a deep understanding of their customers' whole online journey. We are a global leader in the experience analytics space, with a growing presence across 15...

  • Senior Data engineer H/F

    il y a 2 semaines


    Rue Anatole France, Lille, France Devoteam Data Driven France Temps plein

    Description de l'entreprise Présentation de l'agenceUne agence multi spécialiste de plus de 40 collaborateurs qui forment des communautés d'experts sur le Cloud & DevOps, la data et la gestion de projet. Devoteam Lille accompagne ses clients dans une transformation digitale durable. A l'agence de Lille, les consultants peuvent intégrer nos clubs...

  • Data Engineer

    il y a 2 semaines


    Rue Anatole France, Levallois-Perret, France Devoteam Data Driven France Temps plein

    Description de l'entreprise Devoteam Data Driven est l'équipe spécialiste de la Data du Groupe Devoteam. Nous sommes 300 en France et notre ambition est d'accompagner les organisations qui ont décidé de placer les données au cœur de leur stratégie, les aidant à transformer la valeur de leurs données en succès durable. Description du poste Expert...