基于分层柔性演员-评论家强化学习的交叉口信号配时-车辆轨迹联合优化方法
马莹莹, 李腾, 梁韵逸, 唐蒙
A Method for Joint Optimization of Signal Timing and Vehicle Trajectories at Intersections Based on Hierarchical Soft Actor-Critic Reinforcement Learning
MA Yingying, LI Teng, LIANG Yunyi, TANG Meng
华南理工大学学报(自然科学版) . 2025, (12): 1 -16 .  DOI: 10.12141/j.issn.1000-565X.240549