<?xml version="1.0" encoding="UTF-8"?><xml><records><record><source-app name="Biblio" version="7.x">Drupal-Biblio</source-app><ref-type>17</ref-type><contributors><authors><author><style face="normal" font="default" size="100%">Haddad, Tarek-Amine</style></author><author><style face="normal" font="default" size="100%">Djalal HEDJAZI</style></author><author><style face="normal" font="default" size="100%">Sofiane Aouag</style></author></authors></contributors><titles><title><style face="normal" font="default" size="100%">A deep reinforcement learning-based cooperative approach for multi-intersection traffic signal control</style></title><secondary-title><style face="normal" font="default" size="100%">Engineering Applications of Artificial Intelligence</style></secondary-title></titles><dates><year><style  face="normal" font="default" size="100%">2022</style></year></dates><urls><web-urls><url><style face="normal" font="default" size="100%">https://doi.org/10.1016/j.engappai.2022.105019</style></url></web-urls></urls><volume><style face="normal" font="default" size="100%">114</style></volume><language><style face="normal" font="default" size="100%">eng</style></language><abstract><style face="normal" font="default" size="100%">&lt;p style=&quot;text-align: justify;&quot;&gt;
	Recently, Adaptive Traffic Signal Control (ATSC) in the multi-intersection system is considered as one of the most critical issues in&amp;nbsp;Intelligent Transportation Systems&amp;nbsp;(ITS). Among the proposed&amp;nbsp;AI-based approaches,&amp;nbsp;Deep Reinforcement Learning&amp;nbsp;(DRL) has been largely applied while showing better performances. This paper proposes a new&amp;nbsp;DRL-based cooperative approach for controlling multiple intersections. The problem is modelled as a Multi-Agent Reinforcement Learning (&lt;em&gt;MARL&lt;/em&gt;) system, while each agent is trained to select the best action to control an intersection by obtaining information about its local lanes state. The cooperation aspect is manifested in this approach by considering the effect of the state, action and reward of neighbour agents in the process of policy learning. An&amp;nbsp;intersection controller&amp;nbsp;applies a Deep Q-Network (DQN) method, while&amp;nbsp;transferring&amp;nbsp;state, action and reward received from their neighbour agents to its own loss function during the learning process. The experimental results under different scenarios shows that the proposed approach outperforms many state-of-the-art approaches in terms of three metrics: Average Waiting Time (AWT),&amp;nbsp;Average Queue Length&amp;nbsp;(AQL) and Average Emission CO&lt;sub&gt;2&lt;/sub&gt;&amp;nbsp;(AEC). In addition, the cooperation between the different trained&amp;nbsp;&lt;em&gt;DRL&lt;/em&gt;-based controllers allows the system to continuously learn and improve its performance by interacting with the environment, particularly when the traffic is congested.
&lt;/p&gt;
</style></abstract></record></records></xml>