Back to Search Start Over

A lightweight model combining convolutional neural network and Transformer for driver distraction recognition.

Authors :
Tang, Xuexi
Chen, Yan
Ma, Yifan
Yang, Wenxuan
Zhou, Houpan
Huang, Jingzhou
Source :
Engineering Applications of Artificial Intelligence. Jun2024, Vol. 132, pN.PAG-N.PAG. 1p.
Publication Year :
2024

Abstract

Driver distraction recognition has been studied by many researchers. However, most studies have failed to balance the efficiency and accuracy of models. In this study, a lightweight network called CaTNet is proposed. The CaTNet is a simplified framework based on the existing model ConvNeXt, which prunes redundant feature layers. And a whole new module CaT containing self-attention is introduced in tandem, both of which are combined to enhance the feature characterization. It captures long-range dependencies and retains the local inductive bias provided by the convolutional neural network (CNN). The proposed method is verified on the dataset of American University in Cairo (AUC) and the State Farm Distracted Driver Detection (SFD3) Dataset. The CaTNet achieves 94.82% and 99.91% Top-1 accuracy while the number of model parameters is only 2.832M, with 12.44 frames per second (FPS) running on Jetson Nano. These results are superior to other existing models. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
09521976
Volume :
132
Database :
Academic Search Index
Journal :
Engineering Applications of Artificial Intelligence
Publication Type :
Academic Journal
Accession number :
177088654
Full Text :
https://doi.org/10.1016/j.engappai.2024.107910