Python Web Scraping Expert
il y a 4 jours
About NewsCore
NewsCore is an
AI-native market intelligence platform
that helps leading organizations monitor, collect, and analyze strategic information from the web in real time. We enable Fortune-500 corporations, major industrial players, and public institutions to detect emerging risks, competitive moves, regulatory signals, and market trends automatically. We're building
the next generation of automated monitoring systems
, powered by advanced AI, large-scale crawling, and information retrieval technologies.
Your Responsibilities
We are looking for
a scraping expert
to strengthen our ingestion and crawling stack.
You will design resilient pipelines capable of retrieving
millions of documents per day
across:
Websites, Social media, Institutional portals, Company websites, Industry sources
You'll work directly with our CTO and tech teams.
What you will work on
Large-Scale Scraping Architecture
Build and maintain high-volume scraping pipelines capable of ingesting 5M+ article / day
Ensure resiliency, retries, concurrency, proxy rotation, throttling, and anti-bot bypassing
Maintain robust source coverage and maximize recall rate
News & Media Scraping
Scrape news sites, press portals, industry pages, and structured business data
Extract HTML content cleanly (publication date, article content, title, authors, metadata, …)
Social Media Scraping
Extend and improve our stack for: X, Telegram, WhatsApp, Instagram, Tiktok
Manage sessions, cookies, mobile scraping, and API constraints
Backend Integration
Integrate crawling systems with: Django/Fast API backend, Celery workers, Background tasks, Redis caching, Internal ingestion APIs
Persist structured content into our database with indexing, versioning, and enrichment layers
Quality & Monitoring
Detect scraping failures, parsing errors, and source mismatches
Continuously improve content quality, coverage, freshness, and extraction accuracy
Build internal metrics to measure scraping performance at scale
Required Hard Skills
Strong experience in scraping (2+ years minimum)
Proven record in web scraping at scale (1M+ docs/day)
Experience with scraping frameworks: lxml , Selenium / Playwright, Requests, BeautifulSoup
Large-scale scraping best practices: concurrency, task scheduling, proxy rotation, caching strategies, error handling & bypass rules
Experience with social media scraping: (X, Telegram, WhatsApp, etc.)
Strong knowledge of: background jobs (Celery or equivalents), Django integration, Python parsing & normalization, HTML content extraction, Database ingestion
Comfortable with: CI/CD, Git, Quality-oriented code (tests, docs)
Bonus Skills (Nice to Have)
Scraping anti-bot techniques (captcha, session hijack, mobile / device emulation, …)
Performance testing & benchmarking
Knowledge of NLP, embeddings, search pipelines, information retrieval
Soft Skills
Strong ownership mindset
Autonomous and proactive
Fast learner, adaptable to fast-changing environments
Fluent English (written and spoken)
Why Join Us?
Cutting-edge AI startup
building the future of market intelligence used by global leaders.
High-impact work
on core systems powering large-scale data collection and strategic insights.
Ownership opportunities
, including potential equity and long-term incentives based on profile.
Elite engineering culture
, working with top-tier engineers, AI and data experts.
Recruitment Process
We propose 3 rounds over two weeks:
30-minute call
with Ludovic to understand your goals and situation.
2-hour test interview
with François to assess your hard skills.
45-minutes- tech interview
- deeper dive into your technical expertise.
-
Expert Développeur Python
il y a 2 semaines
Paris, Île-de-France Collective Temps pleinBudget: Expert Développeur Python - Angular / PHPInformations GénéralesLocalisation: Paris 17èmeType de contrat: Freelance ou CDITaux journalier moyen (TJM): 500€ - 600€Salaire: 60 KTélétravail: 2 jours de télétravail par semaineDate de démarrage: Fin janvierDurée: Mission longue duréeContexte de la MissionDans le cadre d'une mission longue...
-
Expert Python
il y a 6 jours
Paris, Île-de-France Collective Temps pleinBudget: 550 à 700 selon profilJe suis à la recherche d'un expert python pour une mission longue de 3 ans en Ile de france.Le rôle combine développement Python avancé, innovation (IA, sémantique) et DevOps/cloud, dans un contexte international.MissionMaintenir et faire évoluer les outils PythonPrototyper de nouveaux services (IA générative, recherche...
-
Expert Python
il y a 6 jours
Paris, Île-de-France Free-Work Temps pleinConception & DéveloppementConcevoir et développer un large volume d'API Python robustes, performantes et sécurisées.Structurer les composants, optimiser les flux et garantir la maintenabilité du code.Expertise & AutonomieTravailler en complète autonomie sur la conception, l'implémentation et les choix techniques.Être force de proposition sur les...
-
Web Crawling Engineer
il y a 6 jours
Paris, Île-de-France Mistral Ai Temps pleinAbout MistralAt Mistral AI, we believe in the power of AI to simplify tasks, save time, and enhance learning and creativity. Our technology is designed to integrate seamlessly into daily working life. We democratize AI through high-performance, optimized, open-source, and cutting-edge models, products, and solutions. Our comprehensive AI platform is designed...
-
Développeur Python
il y a 2 semaines
Paris, Île-de-France Adatek Temps pleinLe développeur Python / Angular aura en charge :L'analyse et la conception technique afin de répondre aux besoins du métierGarantie de la maintenance corrective et évolutive des développementsDéveloppement de nouvelles fonctionnalités en Python/TypeScript/Sql, avec forte courverture en tests unitaires ( >80%)Développement de front end web en...
-
Security expert Web Proxy
il y a 2 jours
Paris, Île-de-France Kéoni Consulting Temps pleinCONTEXTE Experience : 5 ans et plus Métiers Fonctions : Pilotage de projet ou de programme, Expert Spécialités technologiques : Load balancing, Authentification, Firewalling, ReportingGestion des incidents Expert Technique Web Proxy (Skyhigh / McAfee) MISSIONS Un programme majeur de déploiement d?une plateforme Web Proxy globale, basé sur la...
-
Ingénieur(e) Développement Python/HPC
il y a 2 jours
Paris, Île-de-France Groupe EOLEN Temps pleinCDI | Mid-Level (3-6 ans) | Grenoble (site client)Démarrage : RapideRéférence : AS+/2026/DEV-HPC-01Contexte de la missionAlliance Services Plus (AS+) recherche un(e) Ingénieur(e) Développement Python/HPC pour renforcer temporairement l'équipe d'un grand acteur industriel dans le domaine de l'énergie, sur le site de Grenoble.Vous participerez au...
-
Lead Python Backend Engineer
il y a 14 heures
Paris, Île-de-France Data Theorem Temps pleinData Theorem is an exciting company focused on creating a more secure world for data. Rooted in a strong engineer first culture, every employee has an impact on product and direction. We are searching for exceptional talent pursuing an opportunity to grow and take ownership of the projects that resonate most with them.As a Lead Python Backend Engineer, you...
-
Expert Proxy Web Gateway
il y a 4 jours
Paris, Île-de-France eXalt Shield Temps pleinExpert Proxy Web Gateway - Migration Cloud Skyhigh | Client Grand Compte | ParisContexte du projet :Notre client, un groupe international majeur, finalise le déploiement mondial de sa solution Skyhigh (ex-McAfee). Nous recherchons un Expert Proxy pour renforcer une équipe de 5 experts et accompagner les dernières migrations proxy (ex: Blue Coat) →...
-
Expert Python Lead MLOps AZURE
il y a 2 semaines
Paris, Île-de-France Free-Work Temps pleinLead Développeur MLOps Python - Spécialiste IA/RAGCette mission s'adresse à des profils seniors ayant déjà industrialisé des solutions IA/RAG en production et maîtrisant les enjeux de passage à l'échelle. Profil avec un mindset sales: au-delà de ses compétences techniques, dispose d'un profil capable de s'imposer, de proposer des solutions, de...