Optimizing green splits in high-dimensional traffic signal control with trust region Bayesian optimization

Yunhai Gong, Shaopeng Zhong*, Shengchuan Zhao, Feng Xiao, Wenwen Wang, Yu Jiang*

*Corresponding author for this work

Research output: Contribution to journalJournal articleResearchpeer-review

21 Downloads (Orbit)

Abstract

Centralized traffic signal control has long been a challenging, high-dimensional optimization problem. This study establishes a simulation-based optimization framework and develops a novel optimization algorithm based on trust region Bayesian optimization (TuRBO), which can efficiently obtain an approximate optimal solution to the high-dimensional traffic signal control problem. Local Gaussian process (GP), trust region, and Thompson sampling are employed in the TuRBO and contribute considerably to performance in terms of computational speed, solution quality, and scalability. Empirical studies are carried out using data from Mudanjiang and Chengdu, China. The performance of TuRBO is compared with that of Bayesian optimization (BO), genetic algorithm and random sampling. The results show that TuRBO converges the fastest because of its ability to balance exploration and exploitation through the trust region and Thompson sampling. Meanwhile, because TuRBO enables more efficient exploitation through the local GP, the solution quality of TuRBO outperforms others significantly. The average waiting time achieved by TuRBO was 2.84% lower than that achieved by BO. Finally, the method has been successfully extended to a large network with 233-dimensional spaces and 122 signalized intersections, demonstrating that the developed methodology can deal with high-dimensional traffic signal control effectively for real case applications.

Original languageEnglish
JournalComputer-Aided Civil and Infrastructure Engineering
Number of pages23
ISSN1093-9687
DOIs
Publication statusAccepted/In press - 2025

Fingerprint

Dive into the research topics of 'Optimizing green splits in high-dimensional traffic signal control with trust region Bayesian optimization'. Together they form a unique fingerprint.

Cite this