Auto-regulated Traffic Signal Control in Multi-modal Urban Networks Using Graph-based Deep Reinforcement Learning
il y a 2 jours
**Auto-Regulated Traffic Signal Control in Multi-Modal Urban Networks Using Graph-Based Deep Reinforcement Learning**:
- Réf **ABG-131317**
- Sujet de Thèse
- 21/04/2025
- Contrat doctoral
- LICIT laboratory (ENTPE/UGE), Lyon
- Lieu de travail- Lyon - Auvergne-Rhône-Alpes - France
- Intitulé du sujet- Auto-Regulated Traffic Signal Control in Multi-Modal Urban Networks Using Graph-Based Deep Reinforcement Learning
- Champs scientifiques- Informatique
**Description du sujet**:
Traffic Signal Control (TSC) is a cornerstone of urban traffic management, directly impacting traffic efficiency, network stability, and environmental performance [1]. Over the past decade, adaptive and intelligent TSC approaches have become essential tools for mitigating congestion. These methods adjust signal timings based on real-time traffic conditions, helping to reduce delays and improve throughput. Among these approaches, Reinforcement Learning (RL), particularly Deep Reinforcement Learning (DRL), has emerged as a promising paradigm capable of capturing complex traffic dynamics through interaction with the environment [2].
In real-world traffic networks, intersections are inherently interdependent: the conditions at one intersection are influenced by upstream inflows and downstream congestion, forming tightly coupled spatial dependencies. This complexity becomes more pronounced when multiple intersections share major traffic flows or transit routes. As such, isolated signal optimization is often insufficient. Recent work has explored Multi-Agent Reinforcement Learning (MARL) to coordinate control across multiple intersections via distributed agents. These decentralized approaches offer scalability and robustness but require careful coordination strategies to avoid myopic or conflicting decisions [3].
Challenges in Coordination and Perception
A critical open issue remains: (i) how intersections can effectively exchange and process relevant information, and (ii) to what extent an intersection is interlinked with others [4]. In most practical deployments, controllers use data only from signalized intersections, without considering the impact of non-signalized nodes (e.g., roundabouts or priority-to-the-right junctions), which are common in urban networks. These elements can significantly affect the dynamics of nearby controlled intersections.
This issue can be interpreted as a Partial Observability problem, similar to those encountered when deploying agents in real-world scenarios. There is thus a need to develop models capable of capturing heterogeneous neighborhood effects—i.e., identifying which nearby nodes influence a given intersection and integrating only the relevant information into the decision-making process [4,5].
When such neighborhood information is incorporated into the agent controlling an intersection [4], the process is typically mono-directional: the surrounding context is used to enhance the agent's perception, but without introducing a truly mutual relationship aimed at cooperation. As a result, agents tend to maintain selfish decisions, with little consideration for the impact on their surroundings, even if their perception is augmented by local context.
Need for Dynamic Protection and Proactive Coordination
Furthermore, even with sophisticated multi-agent control, oversaturated conditions (e.g., during rush hours or major public events) can lead to gridlock and systemic collapse due to spillback effects. To address this, the concept of Perimeter Control has been proposed [6], which involves restricting vehicle inflows into high-demand areas to preserve internal flow conditions. However, most existing approaches rely on static boundaries and centralized coordination, limiting scalability, transferability, and adaptability to real-time changes.
There is a pressing need for adaptive, agent-driven perimeter protection, capable of dynamically identifying and regulating protected zones based on local observations and decentralized operations [7]. Achieving this requires developing agents with local perception and control, capable of exchanging information with neighbors to foster cooperative behaviors. This is a key step toward the emergence of self-organized, proactive traffic management strategies, particularly in the context of spatially dynamic protected networks.
Embracing Multi-Modality and Multi-Objective Optimization
Managing the multi-objectivity and multi-modality of urban traffic is also becoming increasingly essential. Urban intersections accommodate a wide variety of users, including private vehicles, freight, bicycles, pedestrians, and public transit. Buses, in particular, are sensitive to signal timing and congestion, requiring headway regularity to avoid bunching and ensure reliable service.
Despite some recent progress [8], most RL-based TSC approaches still fail to model real-world bus dynamics, such as open-loop operations or heterogeneous passenger demand. Beyond multi-modality, mult
-
[STAGE MASTER 2]
il y a 1 jour
Lyon, France CESI LYON Temps pleinDescriptionThis internship project focuses on a specific component of a broader initiative to improve the dynamic rebalancing of bike-sharing systems [1,2,3]. The problem is addressed in two stages. Based on data at the station and travel needs at a given moment t, the number of bicycles available and needed will be predicted at time t. Points of origin and...
-
[STAGE MASTER 2]
il y a 3 jours
Lyon, France CESI LYON Temps pleinDescription This internship project focuses on a specific component of a broader initiative to improve the dynamic rebalancing of bike-sharing systems [1,2,3]. The problem is addressed in two stages. Based on data at the station and travel needs at a given moment t, the number of bicycles available and needed will be predicted at time t. Points of origin and...
-
Stage traitement de signal/IA
il y a 2 semaines
Lyon, Auvergne-Rhône-Alpes, France Caplogy SAS Temps pleinFusion Vision RGB + Wi-Fi CSI pour Détection Robuste de ChutesDescriptionCe stage s'inscrit dans un projet visant à concevoir un système multimodal (vision + Wi-Fi CSI) pour la détection robuste de chutes à domicile. L'objectif est d'obtenir une sensibilité et une précision supérieures aux solutions mono-modales, d'assurer la tolérance aux...
-
Stage traitement de signal/IA
il y a 2 semaines
Lyon, Auvergne-Rhône-Alpes, France Caplogy SAS Temps pleinHAR Basé sur Wi-Fi CSI pour Monitoring Non-Intrusif des Personnes ÂgéesDescription du stage:Ce stage s'inscrit dans un projet visant à concevoir un système robuste de reconnaissance d'activités humaines spécifiquement dédié au monitoring non-intrusif des personnes âgées, en s'appuyant sur le Channel State Information (CSI) des signaux Wi-Fi....
-
Lyon, Auvergne-Rhône-Alpes, France Inria Temps pleinLe descriptif de l'offre ci-dessous est en AnglaisType de contrat : CDDContrat renouvelable : OuiNiveau de diplôme exigé : Thèse ou équivalentFonction : Post-DoctorantA propos du centre ou de la direction fonctionnelleThe Inria research centre in Lyon is the 9th Inria research centre, formally created in January 2022. It brings together approximately 300...
-
Phd Position F/m Phd Thesis Proposal
il y a 1 semaine
Lyon, France Inria Temps pleinLe descriptif de l’offre ci-dessous est en Anglais_ **Type de contrat **:CDD **Niveau de diplôme exigé **:Bac + 5 ou équivalent **Fonction **:Doctorant **A propos du centre ou de la direction fonctionnelle**: The Centre Inria de l’Université de Grenoble groups together almost 600 people in 22 research teams and 7 research support...
-
Stage Traitement d'images/IA
il y a 2 semaines
Lyon, Auvergne-Rhône-Alpes, France Caplogy SAS Temps pleinHAR Multi-View par Vision RGB-D pour réhabilitation neurologique à domicileDescriptiondustage:Ce stage s'inscrit dans un projet visant à développer une solution de suivi et d'évaluation automatisée des exercices de rééducation neurologique à domicile (post-AVC, Parkinson, etc.) en s'appuyant sur plusieurs caméras RGB-D. L'objectif est de fournir...
-
[STAGE MASTER]
il y a 1 jour
Lyon, France Cesi Temps pleinInternship Topic Stress and emotion monitoring systems play a crucial role in mental health management, especially given the global rise in psychological disorders. Wearable devices equipped with sensors such as ECG (electrocardiogram), PPG (photoplethysmogram), EDA (electrodermal activity), and motion sensors provide a rich source of data for stress and...
-
Ingénieur DEEP LEARNING, MLOPS H/F
il y a 3 jours
Lyon, France GESER BEST Temps pleinDescription du poste et Missions GESER BEST recherche pour l'un de ses clients, un Ingénieur Deep Learning, spécialiste MLOPS (H/F) Vos missions, au sein d'une équipe Deep Learning fonctionnant en mode Agile (Scrum): - Intervenir en tant que développeur sur la partie MLOPS en lien avec les équipes DevOps des autres plateaux et avec l’IT centrale...
-
Machine Learning Intern Stochastic Gradient Descent
il y a 1 jour
Lyon, France Nova In Silico company Temps pleinNova In Silico is a health tech company that develops an in silico clinical trial platform jinkō to simulate drug efficacy and optimize clinical development using virtual patients and disease modeling. As an innovative company, we offer a dynamic work environment distinct from larger, established organizations. Interns will gain significant responsibilities...