Software Engineer, Data Acquisition

Il y a 2 mois


Paris, France Mistral AI Temps plein
About Mistral

- At Mistral AI, we are a tight-knit, nimble team dedicated to bringing our cutting-edge AI technology to the world.

- Our mission is to make AI ubiquitous and open.

- We are creative, low-ego, team-spirited, and have been passionate about AI for years.

- We hire people that foster in competitive environments, because they find them more fun to work in.

- We hire passionate women and men from all over the world.

- Our teams are distributed between France, UK and USA

Role Summary

- We are seeking a skilled and motivated Web Crawling and Data Indexing Engineer to join our dynamic engineering team.

- The ideal candidate will have a strong background in web scraping, data extraction and indexing, with a focus on leveraging advanced tools and technologies to gather and process large-scale data from various web sources.

- The role is based in Paris or London

Key Responsibilities

- Develop and maintain web crawlers using Python libraries such as Beautiful Soup to extract data from target websites.

- Utilize headless browsing techniques, such as Chrome DevTools, to automate and optimize data collection processes.

- Collaborate with cross-functional teams to identify, scrape, and integrate data from APIs to support business objectives.

- Create and implement efficient parsing patterns using regular expressions, XPaths, and CSS selectors to ensure accurate data extraction.

- Design and manage distributed job queues using technologies such as Redis, Kubernetes, and Postgres to handle large-scale data processing tasks.

- Develop strategies to monitor and ensure data quality, accuracy, and integrity throughout the crawling and indexing process.

- Continuously improve and optimize existing web crawling infrastructure to maximize efficiency and adapt to new challenges.

Qualifications & profile

- Bachelor's or master's degree in computer science, information systems, or information technology

- Strong understanding of web technologies, data structures, and algorithms.

- They should have knowledge of database management systems and data warehousing.

- Programming Languages: Proficiency in programming languages such as Python, Java, or C++ is essential.

- Masterings of Web Technologies: Understanding of HTML, CSS, and JavaScript is crucial to navigate and scrape data from websites.

- Knowledge of HTTP and HTTPS protocols

- A good understanding of data structures (like queues, stacks, and hash maps) and algorithms is necessary

- Knowledge of databases (SQL or NoSQL) is important to store and manage the crawled data.

- Understanding distributed systems and technologies like Hadoop or Spark Experience using web Scraping Libraries and Frameworks like Scrapy, BeautifulSoup, Selenium, or MechanicalSoup

- Understanding how search engines work and how to optimize web crawling.

- Experience in Machine Learning to improve the efficiency and accuracy of web crawling

- Familiar with tools such as Pandas, NumPy, and Matplotlib to analyze and visualize data.

Benefits

- Daily lunch vouchers

- Contribution to a Gympass subscription

- Monthly contribution to a mobility pass

- Full health insurance for you and your family

- Generous parental leave policy
  • Data Engineer

    il y a 3 semaines


    Paris, France Data Engineer - Lead - Data Platforms Temps plein

    Lenstra was created by the passion of engineers specialised in Computer Science with a proven history in delivering top quality solutions to its customers. Bringing together work excellence and vision we managed to serve top tier clients from a variety of industry domains like Banking/Insurance, Luxury and Tech.We help our clients to solve their most...

  • Senior Software Engineer

    Il y a 2 mois


    Paris, France Software Aspekte Temps plein

    Specialism: ZK Proofs, Developer Tooling, and Blockchain Security Project: This company is dedicated to enhancing online privacy through end-to-end encryption, aiming to protect user data across the internet. Its suite of products focuses on securing AI applications both in cloud environments and on the blockchain, empowering developers and data scientists...

  • Senior Software Engineer

    il y a 4 semaines


    Paris, Île-de-France emagine Consulting Temps plein

    As a Senior Software Engineer - Data Solutions at {company}, you will be responsible for designing and developing software applications that drive data-driven decision-making. Your expertise in software development and data solutions will enable you to create efficient and scalable systems that meet the needs of our business.Key Responsibilities:Design and...

  • Software / Data Engineer

    il y a 1 mois


    Paris, France Mobiskill | WEFY Group Temps plein

    Rejoins une entreprise renommée dans le domaine du conseil, spécialisée dans la Data et l'IA. Avec plus de 1000 collaborateurs répartis dans 20 bureaux à travers le monde, l'entreprise cherche un Software/Data engineer pour rejoindre leur équipe sur Paris.Vos Missions :Déployer vos logiciels sur des environnements cloud (GCP, AWS,...

  • Software / Data Engineer

    il y a 1 mois


    Paris, Ile-de-France Mobiskill | WEFY Group Temps plein

    Rejoins une entreprise renommée dans le domaine du conseil, spécialisée dans la Data et l'IA. Avec plus de 1000 collaborateurs répartis dans 20 bureaux à travers le monde, l'entreprise cherche un Software/Data engineer pour rejoindre leur équipe sur Paris.Vos Missions :Déployer vos logiciels sur des environnements cloud (GCP, AWS,...

  • Senior Software Engineer

    il y a 4 semaines


    Paris, Île-de-France Fed Legal Temps plein

    We are seeking a skilled Software Engineer to join our team and contribute to the development of innovative data-driven solutions. Key responsibilities include designing and implementing scalable software applications, collaborating with cross-functional teams, and ensuring high-quality code delivery.Develop and maintain high-quality software applications...


  • Paris, France Artefact Temps plein

    From design to deployment, you manage your solution end-to-end, while also optimising the performance, security and scalability.Our working language is in English and preferably the local language of the office. The teamThe most tech people of our Data & Consulting division, the title of "Data engineer" or "Software engineer" does not describe everything our...

  • Software Engineer

    il y a 1 mois


    Paris, Île-de-France Dataiku Misc Postings Temps plein

        We are seeking a skilled Software Engineer to join our team at Dataiku, where you will play a crucial role in elevating our data exploration capabilities and empowering users to derive meaningful insights from their data.

  • Data Engineer

    il y a 3 semaines


    Paris, Île-de-France Audensiel Temps plein

    Data Acquisition and Processing SpecialistAt Audensiel, we're committed to supporting our clients in their digital transformation journeys. As a Data Acquisition and Processing Specialist, you'll be responsible for managing and maintaining data pipelines for data acquisition, processing, and storage.Key Responsibilities:Manage and maintain data pipelines for...

  • Junior Software Engineer

    Il y a 6 mois


    Paris, France Artefact Temps plein

    Qui sommes-nous ?Artefact est un cabinet de conseil en data nouvelle génération qui compte plus de 1 200 collaborateurs dans 19 pays, dédiés à l'accompagnement et à la transformation de nos clients par la data. Nous proposons une large gamme de solutions data-driven, que nous adaptons aux besoins spécifiques de nos clients. Parmi elles, on compte des...

  • Junior Software Engineer

    Il y a 2 mois


    Paris, France Artefact Temps plein

    Qui sommes-nous ?Artefact est un cabinet de conseil en data nouvelle génération qui compte plus de 1 200 collaborateurs dans 19 pays, dédiés à l'accompagnement et à la transformation de nos clients par la data. Nous proposons une large gamme de solutions data-driven, que nous adaptons aux besoins spécifiques de nos clients. Parmi elles, on compte des...


  • Paris, Île-de-France FED ENGINEERING Temps plein

    Job Title: Software Engineer for Data Analysis and VisualizationAbout the Role:We are seeking a skilled Software Engineer to join our team and contribute to the development of data analysis and visualization tools.Key Responsibilities:Design and implement data analysis and visualization solutionsCollaborate with cross-functional teams to integrate data...

  • Senior Software Engineer

    il y a 1 mois


    Paris, Île-de-France Artefact Temps plein

    About the RoleWe are seeking a highly skilled Senior Software Engineer to join our team at Artefact. As a Senior Software Engineer, you will be responsible for working on all aspects of data engineering in multidisciplinary client teams.Key ResponsibilitiesWork on and deploy software to the cloud (GCP, AWS or Azure)Implement software with Python and SQL to...

  • Data Architect

    Il y a 2 mois


    Paris, France Data Engineer - Lead - Data Platforms Temps plein

    Lenstra was created by the passion of engineers specialised in Computer Science with a proven history in delivering top quality solutions to its customers. Bringing together work excellence and vision we managed to serve top tier clients from a variety of industry domains like Banking/Insurance, Luxury and Tech.We help our clients to solve their most...

  • Senior Software Engineer

    il y a 2 semaines


    Paris, France Welcome to the Jungle Temps plein

    Welcome to the Jungle – Paris, Île de FranceWho are we? Artefact is a new generation of data consulting firm with more than 1,200 employees in 19 countries dedicated to supporting our clients' transformation. We offer a wide range of data-driven solutions, which we adapt to our clients' specific needs, from AI projects to automate internal processes at...


  • Paris, Île-de-France Artefact Temps plein

    About the RoleArtefact is a data consultancy that supports and transforms customers through data-driven solutions. As a Junior Software Engineer, you will work in an agile team to design and implement software solutions using cutting-edge technologies. Your responsibilities will include working on projects that utilize Artificial Intelligence to automate...

  • Software Engineer

    Il y a 2 mois


    Paris, France IC Resources Temps plein

    Software Engineer Salary: €61k - €67k Location: France   IC Resources is delighted to be partnering with a company that is conducting ground breaking work into Ultra-low latency trading. This company is working tirelessly to bridge the gap between Finance and Technology, by producing some of the fastest market data processing systems in the world. This...

  • Senior Software Engineer

    il y a 1 mois


    Paris, Île-de-France Artefact Temps plein

    About the RoleArtefact is a leading data consulting firm with a strong presence in 19 countries, employing over 1,200 experts dedicated to supporting clients' transformation. We offer a wide range of data-driven solutions, tailored to meet clients' specific needs, from AI projects to automate internal processes, to creating innovative and personalized...

  • Intern Software Engineer

    Il y a 6 mois


    Paris, France Artefact Temps plein

    Who we are ?Artefact is a new-generation data consultancy with over 1,200 employees in 19 countries, dedicated to supporting and transforming our customers through data. We offer a wide range of data-driven solutions, which we tailor to our customers' specific needs. These include projects that use Artificial Intelligence to automate internal processes,...

  • Senior Software Engineer

    il y a 4 semaines


    Paris, Île-de-France Fed Finance Temps plein

    About the JobWe are looking for a skilled Senior Software Engineer to join our team.The successful candidate will be responsible for designing and implementing data processing solutions using various technologies.Design and implement data processing pipelines using Python and other technologies.Collaborate with cross-functional teams to ensure seamless...