Abstract: Deep reinforcement learning (DRL)-based traffic signal control (TSC) algorithms often suffer from overfitting to static training environments and perform poorly in unseen traffic scenarios.