Back to Search Start Over

<inline-formula><tex-math notation="LaTeX">$\text{Offset}^{3}\text{Net}$</tex-math></inline-formula>: Simple Joint 3-D Detection and Tracking With Three-Step Offset Learning

Authors :
Sun, Jing
Ji, Yi-Mu
He, Jing
Wu, Fei
Sun, Yanfei
Source :
IEEE Transactions on Industrial Informatics; February 2024, Vol. 20 Issue: 2 p2284-2294, 11p
Publication Year :
2024

Abstract

Light-detection-and-ranging-based multiobject detection and tracking play fundamental roles in autonomous driving systems. Most existing detection and tracking methods inevitably require complex pairing permutations for object association across frames, making the framework slow. Moreover, the occlusion and viewpoint changes lead to missed and false detection. To solve the abovementioned issues, this article proposes a simple joint 3-D detection and tracking approach with three-step offset learning (&lt;inline-formula&gt;&lt;tex-math notation=&quot;LaTeX&quot;&gt;$\text{Offset}^{3}\text{Net}$&lt;/tex-math&gt;&lt;/inline-formula&gt;). Specifically, &lt;inline-formula&gt;&lt;tex-math notation=&quot;LaTeX&quot;&gt;$\text{Offset}^{3}\text{Net}$&lt;/tex-math&gt;&lt;/inline-formula&gt; incorporates three task-specific output subnetworks to learn three offsets: 1) center offset, 2) motion offset, and 3) association offset. The learning of abovementioned offsets eliminates the complex bipartite matching processing. Specifically, the center offset guides the model to generate precise detections, whereas the motion offset transforms the track from the previous frame to the current frame, and the association offset minimizes the distance between detection and motion-updated track of the same object. Then, a simple read-off operation is conducted for data association on a hybrid-time centerness map, which represents the detections and offset-updated tracks. In addition, we design a detection-feature-enhanced module that captures the temporal coherence of the object motion and appearance information, avoiding the missed and false detection. Experiments on nuScenes have demonstrated the effectiveness of our &lt;inline-formula&gt;&lt;tex-math notation=&quot;LaTeX&quot;&gt;$\text{Offset}^{3}\text{Net}$&lt;/tex-math&gt;&lt;/inline-formula&gt; in terms of accuracy and speed compared with most 3-D detection and tracking methods.

Details

Language :
English
ISSN :
15513203
Volume :
20
Issue :
2
Database :
Supplemental Index
Journal :
IEEE Transactions on Industrial Informatics
Publication Type :
Periodical
Accession number :
ejs65300909
Full Text :
https://doi.org/10.1109/TII.2023.3290184