Data Engineer, Aws Data
il y a 11 heures
* Bachelors or Master’s in Computer Science or Engineering, or equivalent experience
- Knowledge of relational databases, SQL, and UNIX/Linux
- Knowledge of the Python programming language
Job summary
The AWS Data & ML Research Division is looking for a data engineer to join the Estel team located in Paris, France. Our research teams extract fundamental scientific problems from AWS products and business needs as informed by customers, and develop innovative solutions that will have a long-lasting, transformative impact on our products and customer experiences.
The Estel team specializes in resource optimization of large-scale cloud data analytics. It aims to bring the best cost performance benefits to customers, in an unprecedented manner as enabled by cutting-edge research. Such innovations include principled multi-objective optimization approaches, and adoption of large-scale machine learning to bear on the process of performance and cost modeling, thereby enabling intelligent optimization decisions based on such models. We envision that such techniques will be broadly applied to a range of AWS products, unlock the power of big data and machine learning for intelligent decision making, and ultimately enable big leaps forward for offering best cost performance benefits to customers and long-term sustainable computing.
If you share our vision and are motivated by a career path that defines new problems, explores in uncharted territory, pushes the scientific boundary, innovates and delivers until a real-world impact is generated, then come and join us — we will work together to make our vision true
Key job responsibilities
As a Data Engineer in the Estel team, you will:
- Develop large ETL pipelines to extract query logs, data statistics, system performance metrics, etc. from production clusters to a processing backend on a daily or hourly basis;
- Handle data cleaning and integration of diverse data sources to build a clean, informative profile of each query executed in recent past and do so in a timely manner to prepare for model training;
- Handle storage and indexing of large numbers of query profiles to enable the development of a history-based optimizer;
- Adapt such ETL pipelines and query profile databases for different AWS products;
- Work within a fast moving, startup environment in a large company, interacting with different product teams to rapidly deliver prototypes and/or products that have a broad business impact.
You should be somebody who enjoys working on large data sets, is customer-centric, is passionate about building quality, performant ETL and related data management software, as well as achieving operational excellence. You should also be a fast learner in order to understand how key AWS products function (including the data API, system architecture, etc.). A commitment to team work, and strong communication skills (both within the research team and with different product teams) are essential. You will work with fellow scientists and engineers to solve challenging problems and have an opportunity to publish your work in the scientific community.
- Masters in Computer Science, with a focus on database systems and big data systems
- 3-5 years of relevant work experience in the query processing or data analysis domain
- Experience with database internals such as query processing and optimization, in particular, query plans, indexes, data statistics, runtime query execution
- Knowledge of ETL tasks such as parsing, data cleaning, and data integration
- Ability to design and develop a storage and retrieval system for extracted query logs and system metrics
- Good understanding of parallel computing, system efficiency, and scalability issues
- Experience in characterizing, debugging, and correcting performance issues in large-scale ETL or data management systems
- Proven ability to drive tasks to completion, work independently, and take on full ownership of projects
-
Senior AWS Data Engineer
il y a 7 heures
Paris, Île-de-France Data Reply Temps pleinSenior AWS Data EngineerTasksImplement new use cases and data pipelines on AWSMap data and data flows across cloud platformsDevelop and industrialize data pipelines and processing workflowsDesign and build dashboards and reporting toolsPerform unit and integration testing of data flowsParticipate in Data Reply events (Reply Xchange, hackathons, AWS summits,...
-
Data Engineer Expérimenté
il y a 7 jours
Paris, Île-de-France MP DATA Temps pleinNous recherchons un(e)Data Engineer expérimenté(e)pour intervenir sur lamise en production, la fiabilisation et l'évolutiond'une plateforme data moderne basée surAWS, Spark et Dataiku.Vous participerez activement à laconstruction et l'optimisationdes environnements de traitement de données à grande échelle, en lien étroit avec les équipes Data...
-
Data Engineer Expérimenté
il y a 5 jours
Paris, Île-de-France Mp Data Temps pleinNous recherchons un(e) Data Engineer expérimenté(e) pour intervenir sur la mise en production, la fiabilisation et l'évolution d'une plateforme data moderne basée sur AWS, Spark et Dataiku.Vous participerez activement à la construction et l'optimisation des environnements de traitement de données à grande échelle, en lien étroit avec les équipes...
-
Data Engineer AWS
il y a 1 semaine
Paris, France Data Reply FR Temps pleinData Reply est une filiale du groupe Reply offrant une large gamme de services d'analyse avancée et de données alimentées par l'IA. Nous opérons dans différents secteurs et fonctions commerciales, en travaillant directement avec des professionnels de haut niveau et des directeurs généraux pour leur permettre d'obtenir des résultats significatifs...
-
Data Engineer AWS
il y a 7 jours
Paris, France Data Reply FR Temps pleinData Reply est une filiale du groupe Reply offrant une large gamme de services d'analyse avancée et de données alimentées par l'IA. Nous opérons dans différents secteurs et fonctions commerciales, en travaillant directement avec des professionnels de haut niveau et des directeurs généraux pour leur permettre d'obtenir des résultats significatifs...
-
Data Engineer
il y a 7 jours
Paris, Île-de-France MP DATA Temps pleinEn tant que Data Engineer Senior, vous jouerez un rôle clé dans la construction, l'optimisation et la fiabilisation de nos pipelines de données à grande échelle, au cœur de notre plateforme analytique. Votre expertise sur Databricks et l'environnement Spark sera essentielle pour garantir des traitements performants, sécurisés et scalables.Vos...
-
Data Engineer
il y a 7 jours
Paris, France MP DATA Temps pleinEn tant que Data Engineer Senior, vous jouerez un rôle clé dans la construction, l'optimisation et la fiabilisation de nos pipelines de données à grande échelle, au cœur de notre plateforme analytique. Votre expertise sur Databricks et l'environnement Spark sera essentielle pour garantir des traitements performants, sécurisés et scalables.Vos...
-
Data Engineer AWS
il y a 7 heures
Paris, Île-de-France Cherry Pick Temps pleinEn quelques motsCherry Pick est à la recherche d'un "Data Engineer AWS" pour un client dans le secteur des transportsDescription? Contexte de la missionAu sein du domaine Data & IA, le client recherche un MLOps Engineer / Data Engineer confirmé pour intervenir sur l?ensemble du cycle de vie des solutions d?intelligence artificielle.L?objectif :...
-
Graduate Data Engineer
il y a 15 heures
Paris, Île-de-France Data Reply Temps pleinGraduate Data EngineerTasks• Implementing new use cases• Mapping data and data flows• Implementing data analysis and processing pipelines• Industrializing data flows and their visualization through dashboards and reporting• Carrying out unit tests and integration tests Benefits• Structured career progression – at Reply, we encourage career...
-
Senior Data Engineer | FinTech
il y a 2 semaines
Paris, France Data Recrutement Temps pleinL'ENTREPRISE : FINTECH INNOVANTE DANS LA FINANCE DURABLE ET ESG Startup pionnière dans la finance durable, ayant développé une solution SaaS innovante pour apporter plus de transparence et de technologie au secteur financier. Grâce à une approche data-driven et centrée sur l'ESG, elle accompagne des clients internationaux dans leur transformation...