Auto-regulated Traffic Signal Control in Multi-modal Urban Networks Using Graph-based Deep Reinforcement Learning

il y a 4 jours


Lyon, France LICIT laboratory (ENTPEUGE), Lyon Temps plein

**Auto-Regulated Traffic Signal Control in Multi-Modal Urban Networks Using Graph-Based Deep Reinforcement Learning**:

- Réf **ABG-131317**
- Sujet de Thèse
- 21/04/2025
- Contrat doctoral
- LICIT laboratory (ENTPE/UGE), Lyon
- Lieu de travail- Lyon - Auvergne-Rhône-Alpes - France
- Intitulé du sujet- Auto-Regulated Traffic Signal Control in Multi-Modal Urban Networks Using Graph-Based Deep Reinforcement Learning
- Champs scientifiques- Informatique

**Description du sujet**:
Traffic Signal Control (TSC) is a cornerstone of urban traffic management, directly impacting traffic efficiency, network stability, and environmental performance [1]. Over the past decade, adaptive and intelligent TSC approaches have become essential tools for mitigating congestion. These methods adjust signal timings based on real-time traffic conditions, helping to reduce delays and improve throughput. Among these approaches, Reinforcement Learning (RL), particularly Deep Reinforcement Learning (DRL), has emerged as a promising paradigm capable of capturing complex traffic dynamics through interaction with the environment [2].

In real-world traffic networks, intersections are inherently interdependent: the conditions at one intersection are influenced by upstream inflows and downstream congestion, forming tightly coupled spatial dependencies. This complexity becomes more pronounced when multiple intersections share major traffic flows or transit routes. As such, isolated signal optimization is often insufficient. Recent work has explored Multi-Agent Reinforcement Learning (MARL) to coordinate control across multiple intersections via distributed agents. These decentralized approaches offer scalability and robustness but require careful coordination strategies to avoid myopic or conflicting decisions [3].

Challenges in Coordination and Perception

A critical open issue remains: (i) how intersections can effectively exchange and process relevant information, and (ii) to what extent an intersection is interlinked with others [4]. In most practical deployments, controllers use data only from signalized intersections, without considering the impact of non-signalized nodes (e.g., roundabouts or priority-to-the-right junctions), which are common in urban networks. These elements can significantly affect the dynamics of nearby controlled intersections.

This issue can be interpreted as a Partial Observability problem, similar to those encountered when deploying agents in real-world scenarios. There is thus a need to develop models capable of capturing heterogeneous neighborhood effects—i.e., identifying which nearby nodes influence a given intersection and integrating only the relevant information into the decision-making process [4,5].

When such neighborhood information is incorporated into the agent controlling an intersection [4], the process is typically mono-directional: the surrounding context is used to enhance the agent's perception, but without introducing a truly mutual relationship aimed at cooperation. As a result, agents tend to maintain selfish decisions, with little consideration for the impact on their surroundings, even if their perception is augmented by local context.

Need for Dynamic Protection and Proactive Coordination

Furthermore, even with sophisticated multi-agent control, oversaturated conditions (e.g., during rush hours or major public events) can lead to gridlock and systemic collapse due to spillback effects. To address this, the concept of Perimeter Control has been proposed [6], which involves restricting vehicle inflows into high-demand areas to preserve internal flow conditions. However, most existing approaches rely on static boundaries and centralized coordination, limiting scalability, transferability, and adaptability to real-time changes.

There is a pressing need for adaptive, agent-driven perimeter protection, capable of dynamically identifying and regulating protected zones based on local observations and decentralized operations [7]. Achieving this requires developing agents with local perception and control, capable of exchanging information with neighbors to foster cooperative behaviors. This is a key step toward the emergence of self-organized, proactive traffic management strategies, particularly in the context of spatially dynamic protected networks.

Embracing Multi-Modality and Multi-Objective Optimization

Managing the multi-objectivity and multi-modality of urban traffic is also becoming increasingly essential. Urban intersections accommodate a wide variety of users, including private vehicles, freight, bicycles, pedestrians, and public transit. Buses, in particular, are sensitive to signal timing and congestion, requiring headway regularity to avoid bunching and ensure reliable service.

Despite some recent progress [8], most RL-based TSC approaches still fail to model real-world bus dynamics, such as open-loop operations or heterogeneous passenger demand. Beyond multi-modality, mult



  • Lyon, Auvergne-Rhône-Alpes, France Université Claude Bernard Lyon 1 - DISP Lab Temps plein

    Vision-Based Automatic Calibration and Synchronization of Digital Twins for Cyber-Physical Production Systems Using Neuromorphic SensorsRéf ABG-134424Stage master 2 / IngénieurDurée 6 moisSalaire net mensuel 60018/11/2025Université Claude Bernard Lyon 1 - DISP LabLieu de travailLyon Auvergne-Rhône-Alpes FranceChamps scientifiquesSciences de...

  • Stagiaire en Deep Learning

    il y a 5 heures


    Lyon, France SNCF RESEAU Temps plein

    **À propos du poste** Nous recherchons un stagiaire ou une stagiaire motivé(e) et curieux(se) pour rejoindre notre équipe. Ce stage propose une immersion dans la conception et l’industrialisation de solutions Deep Learning pour la maintenance prédictive, avec des livrables concrets et exploitables par les équipes de...


  • Lyon, Auvergne-Rhône-Alpes, France Inria Temps plein

    Le descriptif de l'offre ci-dessous est en AnglaisType de contrat : CDDContrat renouvelable : OuiNiveau de diplôme exigé : Thèse ou équivalentFonction : Post-DoctorantA propos du centre ou de la direction fonctionnelleThe Inria research centre in Lyon is the 9th Inria research centre, formally created in January 2022. It brings together approximately 300...

  • Stage en Machine Learning

    il y a 6 jours


    Lyon 7e, France Kurage Temps plein

    At Kurage, we're a passionate team leveraging technological innovation to redefine the rehabilitation of hemiplegic patients. Our approach involves integrating Artificial Intelligence into our neuroprostheses to enable rehabilitation through physical activity, opening new horizons for individuals with impaired motor functions. **Job Description**: As an...


  • Lyon, France Inria Temps plein

    Le descriptif de l’offre ci-dessous est en Anglais_ **Type de contrat **:CDD **Niveau de diplôme exigé **:Bac + 5 ou équivalent **Fonction **:Doctorant **A propos du centre ou de la direction fonctionnelle**: The Centre Inria de l’Université de Grenoble groups together almost 600 people in 22 research teams and 7 research support...


  • Lyon, France Sword Services Temps plein

    We are opening a position for a Network and Security Administrator to strengthen our client’s team based in Lyon. This role aims to enhance the unit’s expertise in IT networking to streamline operations and support project-related technical decisions. **Responsabilities**: - Provide tier-2 support: handle escalated incidents, troubleshoot complex...


  • Lyon, France Nova In Silico company Temps plein

    Nova In Silico is a health tech company that develops an in silico clinical trial platform jinkō to simulate drug efficacy and optimize clinical development using virtual patients and disease modeling. As an innovative company, we offer a dynamic work environment distinct from larger, established organizations. Interns will gain significant responsibilities...

  • Data Scientist

    il y a 1 semaine


    Lyon, France Axens Temps plein

    **The Digital Innovation Department is looking for a** **Data Scientist - Specializing in NLP (F/M)** Localisation : Rueil-Malmaison **Axens Presentation**: Axens Group provides a complete range of solutions for the conversion of oil and biomass to cleaner fuels, the production and purification of major petrochemical intermediates, the chemical recycling...

  • Regional Sales Manager

    il y a 2 semaines


    Lyon, France Palo Alto Networks Temps plein

    Company Description **Our Mission** At Palo Alto Networks® everything starts and ends with our mission: Being the cybersecurity partner of choice, protecting our digital way of life. We have the vision of a world where each day is safer and more secure than the one before. These aren’t easy goals to accomplish - but we’re not here for easy. We’re...

  • Doctor in Acoustics

    il y a 2 semaines


    Lyon, France hap2U Temps plein

    Job Title Doctor in Acoustics Job description You will work under the responsibility of the technical director. As an acoustic expert, you will have to develop new features based on your own skills and knowledge and in collaboration with the technical director. You will also participate in customer projects with large companies, mainly in the automotive...