1. Time and Distance Gaps of Primary-Secondary Crashes Prediction and Analysis Using Random Forests and SHAP Model
- Author
-
Xinyuan Liu, Jinjun Tang, Fan Gao, and Xizhi Ding
- Subjects
Economics and Econometrics ,Article Subject ,Strategy and Management ,Mechanical Engineering ,Automotive Engineering ,Computer Science Applications - Abstract
Secondary crashes (SCs) are typically defined as the crash that occurs within the spatiotemporal boundaries of the impact area of the primary crashes (PCs), which will intensify traffic congestion and induce a series of road safety issues. Predicting and analyzing the time and distance gaps between the SCs and PCs will help to prevent the occurrence of SCs. In this paper, a combined data-driven method of static and dynamic approaches is applied to identify SCs. Then, the random forests (RF) method is implemented to predict the two gaps using temporal, primary crash, roadway, and real-time traffic characteristics data collected from 2016 to 2019 at California interstate freeways. Subsequently, the SHapley Additive explanation (SHAP) approach is employed to interpret the RF outputs. The results show that the traffic volume, speed, lighting, and population are considered the most significant factors in both gaps. Furthermore, the main and interaction effects of factors are also quantified. High volume possibly promotes the time and distance gaps, while low volume inhibits them. And volume affects the distance gap inconsiderably when it falls between 300 and 400 veh/5 min. Traffic conditions with high speed and low volume are strongly associated with short-time and short-distance gaps. Darker surroundings probably accelerate the occurrence of SCs. Moreover, crashes involving the violation categories of improper turns or unsafe lane changes likely result in long time and distance gaps. These results have important implications for proposing traffic management and improving road safety.
- Published
- 2023
- Full Text
- View/download PDF