Data Engineer, Aws Data

il y a 11 heures


Paris, France AWS EMEA SARL (France Branch) Temps plein

* Bachelors or Master’s in Computer Science or Engineering, or equivalent experience
- Knowledge of relational databases, SQL, and UNIX/Linux
- Knowledge of the Python programming language

Job summary

The AWS Data & ML Research Division is looking for a data engineer to join the Estel team located in Paris, France. Our research teams extract fundamental scientific problems from AWS products and business needs as informed by customers, and develop innovative solutions that will have a long-lasting, transformative impact on our products and customer experiences.

The Estel team specializes in resource optimization of large-scale cloud data analytics. It aims to bring the best cost performance benefits to customers, in an unprecedented manner as enabled by cutting-edge research. Such innovations include principled multi-objective optimization approaches, and adoption of large-scale machine learning to bear on the process of performance and cost modeling, thereby enabling intelligent optimization decisions based on such models. We envision that such techniques will be broadly applied to a range of AWS products, unlock the power of big data and machine learning for intelligent decision making, and ultimately enable big leaps forward for offering best cost performance benefits to customers and long-term sustainable computing.

If you share our vision and are motivated by a career path that defines new problems, explores in uncharted territory, pushes the scientific boundary, innovates and delivers until a real-world impact is generated, then come and join us — we will work together to make our vision true

Key job responsibilities

As a Data Engineer in the Estel team, you will:

- Develop large ETL pipelines to extract query logs, data statistics, system performance metrics, etc. from production clusters to a processing backend on a daily or hourly basis;
- Handle data cleaning and integration of diverse data sources to build a clean, informative profile of each query executed in recent past and do so in a timely manner to prepare for model training;
- Handle storage and indexing of large numbers of query profiles to enable the development of a history-based optimizer;
- Adapt such ETL pipelines and query profile databases for different AWS products;
- Work within a fast moving, startup environment in a large company, interacting with different product teams to rapidly deliver prototypes and/or products that have a broad business impact.

You should be somebody who enjoys working on large data sets, is customer-centric, is passionate about building quality, performant ETL and related data management software, as well as achieving operational excellence. You should also be a fast learner in order to understand how key AWS products function (including the data API, system architecture, etc.). A commitment to team work, and strong communication skills (both within the research team and with different product teams) are essential. You will work with fellow scientists and engineers to solve challenging problems and have an opportunity to publish your work in the scientific community.
- Masters in Computer Science, with a focus on database systems and big data systems
- 3-5 years of relevant work experience in the query processing or data analysis domain
- Experience with database internals such as query processing and optimization, in particular, query plans, indexes, data statistics, runtime query execution
- Knowledge of ETL tasks such as parsing, data cleaning, and data integration
- Ability to design and develop a storage and retrieval system for extracted query logs and system metrics
- Good understanding of parallel computing, system efficiency, and scalability issues
- Experience in characterizing, debugging, and correcting performance issues in large-scale ETL or data management systems
- Proven ability to drive tasks to completion, work independently, and take on full ownership of projects


  • Senior AWS Data Engineer

    il y a 7 heures


    Paris, Île-de-France Data Reply Temps plein

    Senior AWS Data EngineerTasksImplement new use cases and data pipelines on AWSMap data and data flows across cloud platformsDevelop and industrialize data pipelines and processing workflowsDesign and build dashboards and reporting toolsPerform unit and integration testing of data flowsParticipate in Data Reply events (Reply Xchange, hackathons, AWS summits,...


  • Paris, Île-de-France MP DATA Temps plein

    Nous recherchons un(e)Data Engineer expérimenté(e)pour intervenir sur lamise en production, la fiabilisation et l'évolutiond'une plateforme data moderne basée surAWS, Spark et Dataiku.Vous participerez activement à laconstruction et l'optimisationdes environnements de traitement de données à grande échelle, en lien étroit avec les équipes Data...


  • Paris, Île-de-France Mp Data Temps plein

    Nous recherchons un(e) Data Engineer expérimenté(e) pour intervenir sur la mise en production, la fiabilisation et l'évolution d'une plateforme data moderne basée sur AWS, Spark et Dataiku.Vous participerez activement à la construction et l'optimisation des environnements de traitement de données à grande échelle, en lien étroit avec les équipes...

  • Data Engineer AWS

    il y a 1 semaine


    Paris, France Data Reply FR Temps plein

    Data Reply est une filiale du groupe Reply offrant une large gamme de services d'analyse avancée et de données alimentées par l'IA. Nous opérons dans différents secteurs et fonctions commerciales, en travaillant directement avec des professionnels de haut niveau et des directeurs généraux pour leur permettre d'obtenir des résultats significatifs...

  • Data Engineer AWS

    il y a 7 jours


    Paris, France Data Reply FR Temps plein

    Data Reply est une filiale du groupe Reply offrant une large gamme de services d'analyse avancée et de données alimentées par l'IA. Nous opérons dans différents secteurs et fonctions commerciales, en travaillant directement avec des professionnels de haut niveau et des directeurs généraux pour leur permettre d'obtenir des résultats significatifs...

  • Data Engineer

    il y a 7 jours


    Paris, Île-de-France MP DATA Temps plein

    En tant que Data Engineer Senior, vous jouerez un rôle clé dans la construction, l'optimisation et la fiabilisation de nos pipelines de données à grande échelle, au cœur de notre plateforme analytique. Votre expertise sur Databricks et l'environnement Spark sera essentielle pour garantir des traitements performants, sécurisés et scalables.Vos...

  • Data Engineer

    il y a 7 jours


    Paris, France MP DATA Temps plein

    En tant que Data Engineer Senior, vous jouerez un rôle clé dans la construction, l'optimisation et la fiabilisation de nos pipelines de données à grande échelle, au cœur de notre plateforme analytique. Votre expertise sur Databricks et l'environnement Spark sera essentielle pour garantir des traitements performants, sécurisés et scalables.Vos...

  • Data Engineer AWS

    il y a 7 heures


    Paris, Île-de-France Cherry Pick Temps plein

    En quelques motsCherry Pick est à la recherche d'un "Data Engineer AWS" pour un client dans le secteur des transportsDescription? Contexte de la missionAu sein du domaine Data & IA, le client recherche un MLOps Engineer / Data Engineer confirmé pour intervenir sur l?ensemble du cycle de vie des solutions d?intelligence artificielle.L?objectif :...

  • Graduate Data Engineer

    il y a 15 heures


    Paris, Île-de-France Data Reply Temps plein

    Graduate Data EngineerTasks• Implementing new use cases• Mapping data and data flows• Implementing data analysis and processing pipelines• Industrializing data flows and their visualization through dashboards and reporting• Carrying out unit tests and integration tests  Benefits• Structured career progression – at Reply, we encourage career...

  • Senior Data Engineer | FinTech

    il y a 2 semaines


    Paris, France Data Recrutement Temps plein

    L'ENTREPRISE : FINTECH INNOVANTE DANS LA FINANCE DURABLE ET ESG Startup pionnière dans la finance durable, ayant développé une solution SaaS innovante pour apporter plus de transparence et de technologie au secteur financier. Grâce à une approche data-driven et centrée sur l'ESG, elle accompagne des clients internationaux dans leur transformation...