Python Web Scraping Expert

il y a 4 jours


Paris, Île-de-France NewsCore Temps plein

About NewsCore

NewsCore is an
AI-native market intelligence platform
that helps leading organizations monitor, collect, and analyze strategic information from the web in real time. We enable Fortune-500 corporations, major industrial players, and public institutions to detect emerging risks, competitive moves, regulatory signals, and market trends automatically. We're building
the next generation of automated monitoring systems
, powered by advanced AI, large-scale crawling, and information retrieval technologies.

Your Responsibilities

We are looking for
a scraping expert
to strengthen our ingestion and crawling stack.

You will design resilient pipelines capable of retrieving
millions of documents per day
across:

Websites, Social media, Institutional portals, Company websites, Industry sources

You'll work directly with our CTO and tech teams.

What you will work on

Large-Scale Scraping Architecture

Build and maintain high-volume scraping pipelines capable of ingesting 5M+ article / day

Ensure resiliency, retries, concurrency, proxy rotation, throttling, and anti-bot bypassing

Maintain robust source coverage and maximize recall rate

News & Media Scraping

Scrape news sites, press portals, industry pages, and structured business data

Extract HTML content cleanly (publication date, article content, title, authors, metadata, …)

Social Media Scraping

Extend and improve our stack for: X, Telegram, WhatsApp, Instagram, Tiktok

Manage sessions, cookies, mobile scraping, and API constraints

Backend Integration

Integrate crawling systems with: Django/Fast API backend, Celery workers, Background tasks, Redis caching, Internal ingestion APIs

Persist structured content into our database with indexing, versioning, and enrichment layers

Quality & Monitoring

Detect scraping failures, parsing errors, and source mismatches

Continuously improve content quality, coverage, freshness, and extraction accuracy

Build internal metrics to measure scraping performance at scale

Required Hard Skills

Strong experience in scraping (2+ years minimum)

Proven record in web scraping at scale (1M+ docs/day)

Experience with scraping frameworks: lxml , Selenium / Playwright, Requests, BeautifulSoup

Large-scale scraping best practices: concurrency, task scheduling, proxy rotation, caching strategies, error handling & bypass rules

Experience with social media scraping: (X, Telegram, WhatsApp, etc.)

Strong knowledge of: background jobs (Celery or equivalents), Django integration, Python parsing & normalization, HTML content extraction, Database ingestion

Comfortable with: CI/CD, Git, Quality-oriented code (tests, docs)

Bonus Skills (Nice to Have)

Scraping anti-bot techniques (captcha, session hijack, mobile / device emulation, …)

Performance testing & benchmarking

Knowledge of NLP, embeddings, search pipelines, information retrieval

Soft Skills

Strong ownership mindset

Autonomous and proactive

Fast learner, adaptable to fast-changing environments

Fluent English (written and spoken)

Why Join Us?

Cutting-edge AI startup
building the future of market intelligence used by global leaders.

High-impact work
on core systems powering large-scale data collection and strategic insights.

Ownership opportunities
, including potential equity and long-term incentives based on profile.

Elite engineering culture
, working with top-tier engineers, AI and data experts.

Recruitment Process

We propose 3 rounds over two weeks:

30-minute call
with Ludovic to understand your goals and situation.

2-hour test interview
with François to assess your hard skills.

45-minutes- tech interview
- deeper dive into your technical expertise.


  • Expert Développeur Python

    il y a 2 semaines


    Paris, Île-de-France Collective Temps plein

    Budget: Expert Développeur Python - Angular / PHPInformations GénéralesLocalisation: Paris 17èmeType de contrat: Freelance ou CDITaux journalier moyen (TJM): 500€ - 600€Salaire: 60 KTélétravail: 2 jours de télétravail par semaineDate de démarrage: Fin janvierDurée: Mission longue duréeContexte de la MissionDans le cadre d'une mission longue...

  • Expert Python

    il y a 6 jours


    Paris, Île-de-France Collective Temps plein

    Budget: 550 à 700 selon profilJe suis à la recherche d'un expert python pour une mission longue de 3 ans en Ile de france.Le rôle combine développement Python avancé, innovation (IA, sémantique) et DevOps/cloud, dans un contexte international.MissionMaintenir et faire évoluer les outils PythonPrototyper de nouveaux services (IA générative, recherche...

  • Expert Python

    il y a 6 jours


    Paris, Île-de-France Free-Work Temps plein

    Conception & DéveloppementConcevoir et développer un large volume d'API Python robustes, performantes et sécurisées.Structurer les composants, optimiser les flux et garantir la maintenabilité du code.Expertise & AutonomieTravailler en complète autonomie sur la conception, l'implémentation et les choix techniques.Être force de proposition sur les...

  • Web Crawling Engineer

    il y a 6 jours


    Paris, Île-de-France Mistral Ai Temps plein

    About MistralAt Mistral AI, we believe in the power of AI to simplify tasks, save time, and enhance learning and creativity. Our technology is designed to integrate seamlessly into daily working life. We democratize AI through high-performance, optimized, open-source, and cutting-edge models, products, and solutions. Our comprehensive AI platform is designed...

  • Développeur Python

    il y a 2 semaines


    Paris, Île-de-France Adatek Temps plein

    Le développeur Python / Angular aura en charge :L'analyse et la conception technique afin de répondre aux besoins du métierGarantie de la maintenance corrective et évolutive des développementsDéveloppement de nouvelles fonctionnalités en Python/TypeScript/Sql, avec forte courverture en tests unitaires ( >80%)Développement de front end web en...

  • Security expert Web Proxy

    il y a 2 jours


    Paris, Île-de-France Kéoni Consulting Temps plein

    CONTEXTE Experience : 5 ans et plus Métiers Fonctions : Pilotage de projet ou de programme, Expert Spécialités technologiques : Load balancing, Authentification, Firewalling, ReportingGestion des incidents Expert Technique Web Proxy (Skyhigh / McAfee) MISSIONS Un programme majeur de déploiement d?une plateforme Web Proxy globale, basé sur la...


  • Paris, Île-de-France Groupe EOLEN Temps plein

    CDI | Mid-Level (3-6 ans) | Grenoble (site client)Démarrage : RapideRéférence : AS+/2026/DEV-HPC-01Contexte de la missionAlliance Services Plus (AS+) recherche un(e) Ingénieur(e) Développement Python/HPC pour renforcer temporairement l'équipe d'un grand acteur industriel dans le domaine de l'énergie, sur le site de Grenoble.Vous participerez au...

  • Lead Python Backend Engineer

    il y a 14 heures


    Paris, Île-de-France Data Theorem Temps plein

    Data Theorem is an exciting company focused on creating a more secure world for data. Rooted in a strong engineer first culture, every employee has an impact on product and direction. We are searching for exceptional talent pursuing an opportunity to grow and take ownership of the projects that resonate most with them.As a Lead Python Backend Engineer, you...

  • Expert Proxy Web Gateway

    il y a 4 jours


    Paris, Île-de-France eXalt Shield Temps plein

    Expert Proxy Web Gateway - Migration Cloud Skyhigh | Client Grand Compte | ParisContexte du projet :Notre client, un groupe international majeur, finalise le déploiement mondial de sa solution Skyhigh (ex-McAfee). Nous recherchons un Expert Proxy pour renforcer une équipe de 5 experts et accompagner les dernières migrations proxy (ex: Blue Coat) →...

  • Expert Python Lead MLOps AZURE

    il y a 2 semaines


    Paris, Île-de-France Free-Work Temps plein

    Lead Développeur MLOps Python - Spécialiste IA/RAGCette mission s'adresse à des profils seniors ayant déjà industrialisé des solutions IA/RAG en production et maîtrisant les enjeux de passage à l'échelle. Profil avec un mindset sales: au-delà de ses compétences techniques, dispose d'un profil capable de s'imposer, de proposer des solutions, de...