Back to Search Start Over

Use Subsampling to Solve Imbalanced Dataset Problem for Automatic Incident Detection Algorithm

Authors :
Miao Hua Li
Shu Yan Chen
Source :
Applied Mechanics and Materials. :2114-2119
Publication Year :
2014
Publisher :
Trans Tech Publications, Ltd., 2014.

Abstract

Considering the fact that the amount of traffic incident data is rare compared to the large amount of normal traffic state data in the real word, we proposed an Automatic Incident Detection (AID) algorithm based on subsampling method. First, an improved subsampling method based on Edited Nearest Neighbor Rule (ENN) algorithm was used to reconstruct the training set to get a balanced dataset. Then, the Support Vector Machine (SVM) was adopted as a classifier to detect traffic incidents. The real traffic data collected from the I-880 freeway in American was used to build the model and test the performance of the proposed AID algorithm. In addition, we made a comparison of the detection performances between the AID algorithm obtained by the original training set and the one by the relatively balanced training set. The experimental results show that the proposed AID algorithm based on subsampling is suitable for imbalanced dataset and can obtain a better detection performance.

Details

ISSN :
16627482
Database :
OpenAIRE
Journal :
Applied Mechanics and Materials
Accession number :
edsair.doi...........e477fc45ed286e788e2ffe2a4375bc2d