Start Over

TRACL: Temporal reconstruction and adaptive consistency loss for semi‐supervised video semantic segmentation

Authors :: Zhixue Liang
Wenyong Dong
Bo Zhang
Source :: IET Image Processing, Vol 18, Iss 2, Pp 348-361 (2024)
Publication Year :: 2024
Publisher :: Wiley, 2024.
Abstract: Abstract While existing supervised semantic segmentation methods have shown significant performance improvements, they heavily rely on large‐scale pixel‐level annotated data. To reduce this dependence, recent research has proposed semi‐supervised learning‐based methods that have achieved great success. However, almost all these works are mainly dedicated to image semantic segmentation, while semi‐supervised video semantic segmentation (SVSS) has been barely explored. Due to the significant difference between video data and image, simply adapting semi‐supervised image semantic segmentation approaches to SVSS may neglect the inherent temporal correlations in video frames. This paper presents a novel method (named TRACL) with temporal reconstruction (TR) and adaptive consistency loss (ACL) for SVSS, aiming to fully utilize the temporal relations of internal frames in video clip. The authors’ TR method implements the reconstruction from the feature and output levels to narrow the distribution gap between internal video frames. Specifically, considering the underlying data distribution, the authors construct Gaussian models for each category, and use probability density function to obtain the similarity between different feature maps for temporal feature reconstruction. The authors’ ACL can adaptively select two pixel‐wise consistency loss including Flow Consistency Loss and Reconstruction Consistency Loss, providing stronger supervision signals for unlabelled frames during model training. Additionally, the authors extend their method to unlabelled video for more training data by employing mean‐teacher structure. Extensive experiments on three datasets including Cityscapes, Camvid and VSPW demonstrate that the authors’ proposed method outperforms previous state‐of‐the‐art methods.

Subjects :: adaptive consistency loss
temporal reconstruction
video semantic segmentation
Photography
TR1-1050
Computer software
QA76.75-76.765

Details

Language :: English
ISSN :: 17519667 and 17519659
Volume :: 18
Issue :: 2
Database :: Directory of Open Access Journals
Journal :: IET Image Processing
Publication Type :: Academic Journal
Accession number :: edsdoj.6953ddc0472741dc99678d1477f96b66
Document Type :: article
Full Text :: https://doi.org/10.1049/ipr2.12952

Full Text Access

View/download PDF

Tools

Email
Cite

Printer

Authors Abstract Subjects Details

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

TRACL: Temporal reconstruction and adaptive consistency loss for semi‐supervised video semantic segmentation

Abstract

Subjects

Details

Tools

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

TRACL: Temporal reconstruction and adaptive consistency loss for semi‐supervised video semantic segmentation

Abstract

Subjects

Details

Tools

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources