基于多维时空层递的交通信号分布式强化学习方法
王福建, 范诚睿, 周斌, 封春房, 马东方
Traffic Signal Decentralized Reinforcement Learning Method Based on a Multi-perspective Spatio-temporal Hierarchical Structure
WANG Fu-jian, FAN Cheng-rui, ZHOU Bin, FENG Chun-fang, MA Dong-fang
中国公路学报 . 2024, (7): 250 -263 .  DOI: 10.19721/j.cnki.1001-7372.2024.07.020