Start Over

The inconvenient truth of ground truth errors in automotive datasets and DNN-based detection

Authors :: Pak Hung Chan
Boda Li
Gabriele Baris
Qasim Sadiq
Valentina Donzella
Source :: Data-Centric Engineering, Vol 5 (2024)
Publication Year :: 2024
Publisher :: Cambridge University Press, 2024.
Abstract: Assisted and automated driving functions will rely on machine learning algorithms, given their ability to cope with real-world variations, e.g. vehicles of different shapes, positions, colors, and so forth. Supervised learning needs annotated datasets, and several automotive datasets are available. However, these datasets are tremendous in volume, and labeling accuracy and quality can vary across different datasets and within dataset frames. Accurate and appropriate ground truth is especially important for automotive, as “incomplete” or “incorrect” learning can negatively impact vehicle safety when these neural networks are deployed. This work investigates the ground truth quality of widely adopted automotive datasets, including a detailed analysis of KITTI MoSeg. According to the identified and classified errors in the annotations of different automotive datasets, this article provides three different criteria collections for producing improved annotations. These criteria are enforceable and applicable to a wide variety of datasets. The three annotations sets are created to (i) remove dubious cases; (ii) annotate to the best of human visual system; and (iii) remove clear erroneous BBs. KITTI MoSeg has been reannotated three times according to the specified criteria, and three state-of-the-art deep neural network object detectors are used to evaluate them. The results clearly show that network performance is affected by ground truth variations, and removing clear errors is beneficial for predicting real-world objects only for some networks. The relabeled datasets still present some cases with “arbitrary”/“controversial” annotations, and therefore, this work concludes with some guidelines related to dataset annotation, metadata/sublabels, and specific automotive use cases.

Subjects :: machine learning
automated vehicles
automotive dataset
labeling
Engineering (General). Civil engineering (General)
TA1-2040

Details

Language :: English
ISSN :: 26326736
Volume :: 5
Database :: Directory of Open Access Journals
Journal :: Data-Centric Engineering
Publication Type :: Academic Journal
Accession number :: edsdoj.bbd861b1714c45b5977d7db5a19f931e
Document Type :: article
Full Text :: https://doi.org/10.1017/dce.2024.39

Full Text Access

View/download PDF

Tools

Email
Cite

Printer

Authors Abstract Subjects Details

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

The inconvenient truth of ground truth errors in automotive datasets and DNN-based detection

Abstract

Subjects

Details

Tools

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

The inconvenient truth of ground truth errors in automotive datasets and DNN-based detection

Abstract

Subjects

Details

Tools

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources