20 results on '"semi-supervised object detection"'
Search Results
2. Semi-supervised Underwater Object Detection Using Grid Marker-Assisted Image Enhancement
- Author
-
Xia, Caixia, Zhou, Wenzhang, Zhang, Caiyu, Fan, Baojie, Goos, Gerhard, Series Editor, Hartmanis, Juris, Founding Editor, Bertino, Elisa, Editorial Board Member, Gao, Wen, Editorial Board Member, Steffen, Bernhard, Editorial Board Member, Yung, Moti, Editorial Board Member, Lan, Xuguang, editor, Mei, Xuesong, editor, Jiang, Caigui, editor, Zhao, Fei, editor, and Tian, Zhiqiang, editor
- Published
- 2025
- Full Text
- View/download PDF
3. DRCO: Dense-Label Refinement and Cross Optimization for Semi-Supervised Object Detection
- Author
-
Yunlong Qin, Yanjun Li, Feifan Ji, Yan Liu, Yu Wang, and Ji Xiang
- Subjects
Semi-supervised object detection ,object detection ,semi-supervised learning ,Electrical engineering. Electronics. Nuclear engineering ,TK1-9971 - Abstract
In semi-supervised object detection (SSOD), the methods based on dense pseudo-labeling bypass complex post-processing while maintaining competitive performance compared to the methods based on sparse pseudo-labeling. However, there are still relatively few researches focused on the dense pseudo-labeling paradigm. In this work, we first experimentally point out the shortcomings of current dense pseudo-labeling methods: 1) Low-quality sampling: the fixed threshold strategies can result in numerous false negatives and false positives. 2) Inconsistency between classification scores and localization quality: classification scores cannot represent localization quality, resulting in poor quality of predicted bounding boxes for sampled positive samples. 3) Suboptimal training approach: current training methods only utilize its knowledge from the perspective of final dense pseudo-labels, failing to fully exploit the teacher model. To address these issues, we propose a method called Dense-Label Refinement and Cross Optimization (DRCO) based on dense pseudo-labels. Specifically, to tackle the issue of low-quality sampling, we introduce the Adaptive Sampling Approach (ASA), which achieves high-quality sampling at the image level and dynamic sampling ratios without introducing any additional hyperparameters. For the inconsistency between classification scores and localization quality, Comprehensive FRS (cFRS) is proposed to jointly optimize the classification and localization branches more efficiently, thereby obtaining a more comprehensive score. Finally, for the suboptimal training approach, we introduce Cross Prediction Optimization (CPO). CPO efficiently leverages the knowledge of the teacher model through a Cross-Head operation, thereby achieving more effective teacher-student interaction. DRCO achieves 27.31% mAP with only 1% COCO labeled data, which is approximately 1.24% mAP higher than the previous state-of-the-art. Based on MS COCO and PASCAL VOC benchmarks, further comprehensive experiments demonstrate that our method alleviates the aforementioned shortcomings and achieves competitive performance.
- Published
- 2025
- Full Text
- View/download PDF
4. STOD: toward semi-supervised tiny object detection.
- Author
-
Guo, Yanan, Feng, Yuxin, Du, Kangning, and Cao, Lin
- Subjects
- *
MINIATURE objects , *GAUSSIAN mixture models , *DETECTORS - Abstract
Semi-supervised object detection aims to enhance object detectors by utilizing a large number of unlabeled images, which has gained increasing attention in natural scenes. However, when these methods are directly applied to scenes with tiny objects, they face the challenge of selecting pseudo-labels with high localization quality due to the minuscule and blurred characteristics of these objects. To address this issue, we propose a novel method called semi-supervised tiny object detection (STOD). Firstly, to enhance the localization accuracy of pseudo-labels, we design a dense IoU-aware head that evaluates the quality of bounding box localization by incorporating additional predicted overlap values. Secondly, to mine more potential pseudo-labels, we propose a GMM-based multi-threshold pseudo-labels mining module that dynamically generates multiple thresholds using classification scores and overlap values to classify bounding boxes into strong positive and weak positive pseudo-labels. Lastly, we design the localization-aware weighting loss to incorporate the localization quality of both positive and negative samples in order to enhance the accuracy of pseudo-label localization. The experimental results show that STOD achieves comparable performance when compared to both fully and semi-supervised methods. Notably, on the VisDrone-Partial benchmark, STOD achieves outstanding results by outperforming our baseline model with improvements of 3.1 mAP, 5.7 A P 50 , and 3.0 A P 75 . [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
5. Adaptive Adversarial Self-Training for Semi-Supervised Object Detection in Complex Maritime Scenes.
- Author
-
Feng, Junjian, Tian, Lianfang, and Li, Xiangxia
- Subjects
- *
OBJECT recognition (Computer vision) , *DATA augmentation , *DETECTORS , *SPEED - Abstract
Semi-supervised object detection helps to monitor and manage maritime transportation effectively, saving labeling costs. Currently, many semi-supervised object detection methods use a combination of data augmentation and pseudo-label to improve model performance. However, these methods may get into trouble in complex maritime scenes, including occlusion, scale variations and lighting variations, leading to distribution bias between labeled data and unlabeled data and pseudo-label bias. To address these problems, we propose a semi-supervised object detection method in complex maritime scenes based on adaptive adversarial self-training, which provides a teacher–student detection framework to use a robust pseudo-label with data augmentation. The proposed method contains two modules called adversarial distribution discriminator and label adaptive assigner. The adversarial distribution discriminator is proposed to match the distribution between augmented data generated from different data augmentations, while the label adaptive assigner is proposed to reduce the labeling bias for unlabeled data so that the pseudo-label of unlabeled data contributes to the detection performance effectively. Experimental results show that the proposed method achieves a better mean average precision of 91.4%, with only 5% of the labeled samples compared with other semi-supervised object detection methods, and its detection speed is 11.1 frames per second. Experiments also demonstrate that the proposed method improves the detection performance compared with fully supervised detectors. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
6. SAR-CDSS: A Semi-Supervised Cross-Domain Object Detection from Optical to SAR Domain.
- Author
-
Luo, Cheng, Zhang, Yueting, Guo, Jiayi, Hu, Yuxin, Zhou, Guangyao, You, Hongjian, and Ning, Xia
- Subjects
- *
OBJECT recognition (Computer vision) , *SYNTHETIC aperture radar , *TRANSFORMER models , *CONVOLUTIONAL neural networks , *FEATURE extraction , *SUPERVISED learning , *SPACE-based radar , *KNOWLEDGE transfer - Abstract
The unique imaging modality of synthetic aperture radar (SAR) has posed significant challenges for object detection, making it more complex to acquire and interpret than optical images. Recently, numerous studies have proposed cross-domain adaptive methods based on convolutional neural networks (CNNs) to promote SAR object detection using optical data. However, existing cross-domain methods focus on image features, lack improvement on input data, and ignore the valuable supervision provided by few labeled SAR images. Therefore, we propose a semi-supervised cross-domain object detection framework that uses optical data and few SAR data to achieve knowledge transfer for SAR object detection. Our method focuses on the data processing aspects to gradually reduce the domain shift at the image, instance, and feature levels. First, we propose a data augmentation method of image mixing and instance swapping to generate a mixed domain that is more similar to the SAR domain. This method fully utilizes few SAR annotation information to reduce domain shift at image and instance levels. Second, at the feature level, we propose an adaptive optimization strategy to filter out mixed domain samples that significantly deviate from the SAR feature distribution to train feature extractor. In addition, we employ Vision Transformer (ViT) as feature extractor to handle the global feature extraction of mixed images. We propose a detection head based on normalized Wasserstein distance (NWD) to enhance objects with smaller effective regions in SAR images. The effectiveness of our proposed method is evaluated on public SAR ship and oil tank datasets. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
7. Wheat Teacher: A One-Stage Anchor-Based Semi-Supervised Wheat Head Detector Utilizing Pseudo-Labeling and Consistency Regularization Methods.
- Author
-
Zhang, Rui, Yao, Mingwei, Qiu, Zijie, Zhang, Lizhuo, Li, Wei, and Shen, Yue
- Subjects
OBJECT recognition (Computer vision) ,WHEAT breeding ,TEACHER development ,DETECTORS ,SCHOOL principals ,WHEAT - Abstract
Wheat breeding heavily relies on the observation of various traits during the wheat growth process. Among all traits, wheat head density stands out as a particularly crucial characteristic. Despite the realization of high-throughput phenotypic data collection for wheat, the development of efficient and robust models for extracting traits from raw data remains a significant challenge. Numerous fully supervised target detection algorithms have been employed to address the wheat head detection problem. However, constrained by the exorbitant cost of dataset creation, especially the manual annotation cost, fully supervised target detection algorithms struggle to unleash their full potential. Semi-supervised training methods can leverage unlabeled data to enhance model performance, addressing the issue of insufficient labeled data. This paper introduces a one-stage anchor-based semi-supervised wheat head detector, named "Wheat Teacher", which combines two semi-supervised methods, pseudo-labeling, and consistency regularization. Furthermore, two novel dynamic threshold components, Pseudo-label Dynamic Allocator and Loss Dynamic Threshold, are designed specifically for wheat head detection scenarios to allocate pseudo-labels and filter losses. We conducted detailed experiments on the largest wheat head public dataset, GWHD2021. Compared with various types of detectors, Wheat Teacher achieved a mAP0.5 of 92.8% with only 20% labeled data. This result surpassed the test outcomes of two fully supervised object detection models trained with 100% labeled data, and the difference with the other two fully supervised models trained with 100% labeled data was within 1%. Moreover, Wheat Teacher exhibits improvements of 2.1%, 3.6%, 5.1%, 37.7%, and 25.8% in mAP0.5 under different labeled data usage ratios of 20%, 10%, 5%, 2%, and 1%, respectively, validating the effectiveness of our semi-supervised approach. These experiments demonstrate the significant potential of Wheat Teacher in wheat head detection. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
8. A Semi-Supervised Object Detection Method for Close Range Detection of Spacecraft and Space Debris
- Author
-
Zhang, Huan, Zhang, Yang, Feng, Qingjuan, and Zhang, Kebei
- Published
- 2024
- Full Text
- View/download PDF
9. Semi-Supervised Person Detection in Aerial Images with Instance Segmentation and Maximum Mean Discrepancy Distance.
- Author
-
Zhang, Xiangqing, Feng, Yan, Zhang, Shun, Wang, Nan, Mei, Shaohui, and He, Mingyi
- Subjects
- *
RESCUE work , *IMAGE segmentation , *LOW vision , *DETECTORS , *ENTROPY - Abstract
Detecting sparse, small, lost persons with only a few pixels in high-resolution aerial images was, is, and remains an important and difficult mission, in which a vital role is played by accurate monitoring and intelligent co-rescuing for the search and rescue (SaR) system. However, many problems have not been effectively solved in existing remote-vision-based SaR systems, such as the shortage of person samples in SaR scenarios and the low tolerance of small objects for bounding boxes. To address these issues, a copy-paste mechanism (ISCP) with semi-supervised object detection (SSOD) via instance segmentation and maximum mean discrepancy distance is proposed (MMD), which can provide highly robust, multi-task, and efficient aerial-based person detection for the prototype SaR system. Specifically, numerous pseudo-labels are obtained by accurately segmenting the instances of synthetic ISCP samples to obtain their boundaries. The SSOD trainer then uses soft weights to balance the prediction entropy of the loss function between the ground truth and unreliable labels. Moreover, a novel evaluation metric MMD for anchor-based detectors is proposed to elegantly compute the IoU of the bounding boxes. Extensive experiments and ablation studies on Heridal and optimized public datasets demonstrate that our approach is effective and achieves state-of-the-art person detection performance in aerial images. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
10. Wheat Teacher: A One-Stage Anchor-Based Semi-Supervised Wheat Head Detector Utilizing Pseudo-Labeling and Consistency Regularization Methods
- Author
-
Rui Zhang, Mingwei Yao, Zijie Qiu, Lizhuo Zhang, Wei Li, and Yue Shen
- Subjects
digital agriculture ,deep learning ,semi-supervised object detection ,wheat head detection ,Agriculture (General) ,S1-972 - Abstract
Wheat breeding heavily relies on the observation of various traits during the wheat growth process. Among all traits, wheat head density stands out as a particularly crucial characteristic. Despite the realization of high-throughput phenotypic data collection for wheat, the development of efficient and robust models for extracting traits from raw data remains a significant challenge. Numerous fully supervised target detection algorithms have been employed to address the wheat head detection problem. However, constrained by the exorbitant cost of dataset creation, especially the manual annotation cost, fully supervised target detection algorithms struggle to unleash their full potential. Semi-supervised training methods can leverage unlabeled data to enhance model performance, addressing the issue of insufficient labeled data. This paper introduces a one-stage anchor-based semi-supervised wheat head detector, named “Wheat Teacher”, which combines two semi-supervised methods, pseudo-labeling, and consistency regularization. Furthermore, two novel dynamic threshold components, Pseudo-label Dynamic Allocator and Loss Dynamic Threshold, are designed specifically for wheat head detection scenarios to allocate pseudo-labels and filter losses. We conducted detailed experiments on the largest wheat head public dataset, GWHD2021. Compared with various types of detectors, Wheat Teacher achieved a mAP0.5 of 92.8% with only 20% labeled data. This result surpassed the test outcomes of two fully supervised object detection models trained with 100% labeled data, and the difference with the other two fully supervised models trained with 100% labeled data was within 1%. Moreover, Wheat Teacher exhibits improvements of 2.1%, 3.6%, 5.1%, 37.7%, and 25.8% in mAP0.5 under different labeled data usage ratios of 20%, 10%, 5%, 2%, and 1%, respectively, validating the effectiveness of our semi-supervised approach. These experiments demonstrate the significant potential of Wheat Teacher in wheat head detection.
- Published
- 2024
- Full Text
- View/download PDF
11. Dense Teacher: Dense Pseudo-Labels for Semi-supervised Object Detection
- Author
-
Zhou, Hongyu, Ge, Zheng, Liu, Songtao, Mao, Weixin, Li, Zeming, Yu, Haiyan, Sun, Jian, Goos, Gerhard, Founding Editor, Hartmanis, Juris, Founding Editor, Bertino, Elisa, Editorial Board Member, Gao, Wen, Editorial Board Member, Steffen, Bernhard, Editorial Board Member, Yung, Moti, Editorial Board Member, Avidan, Shai, editor, Brostow, Gabriel, editor, Cissé, Moustapha, editor, Farinella, Giovanni Maria, editor, and Hassner, Tal, editor
- Published
- 2022
- Full Text
- View/download PDF
12. Diverse Learner: Exploring Diverse Supervision for Semi-supervised Object Detection
- Author
-
Li, Linfeng, Jiang, Minyue, Yu, Yue, Zhang, Wei, Lin, Xiangru, Li, Yingying, Tan, Xiao, Wang, Jingdong, Ding, Errui, Goos, Gerhard, Founding Editor, Hartmanis, Juris, Founding Editor, Bertino, Elisa, Editorial Board Member, Gao, Wen, Editorial Board Member, Steffen, Bernhard, Editorial Board Member, Yung, Moti, Editorial Board Member, Avidan, Shai, editor, Brostow, Gabriel, editor, Cissé, Moustapha, editor, Farinella, Giovanni Maria, editor, and Hassner, Tal, editor
- Published
- 2022
- Full Text
- View/download PDF
13. Seamless Iterative Semi-supervised Correction of Imperfect Labels in Microscopy Images
- Author
-
Elbatel, Marawan, Bornberg, Christina, Kattel, Manasi, Almar, Enrique, Marrocco, Claudio, Bria, Alessandro, Goos, Gerhard, Founding Editor, Hartmanis, Juris, Founding Editor, Bertino, Elisa, Editorial Board Member, Gao, Wen, Editorial Board Member, Steffen, Bernhard, Editorial Board Member, Yung, Moti, Editorial Board Member, Kamnitsas, Konstantinos, editor, Koch, Lisa, editor, Islam, Mobarakol, editor, Xu, Ziyue, editor, Cardoso, Jorge, editor, Dou, Qi, editor, Rieke, Nicola, editor, and Tsaftaris, Sotirios, editor
- Published
- 2022
- Full Text
- View/download PDF
14. Improving Localization for Semi-Supervised Object Detection
- Author
-
Rossi, Leonardo, Karimi, Akbar, Prati, Andrea, Goos, Gerhard, Founding Editor, Hartmanis, Juris, Founding Editor, Bertino, Elisa, Editorial Board Member, Gao, Wen, Editorial Board Member, Steffen, Bernhard, Editorial Board Member, Yung, Moti, Editorial Board Member, Sclaroff, Stan, editor, Distante, Cosimo, editor, Leo, Marco, editor, Farinella, Giovanni M., editor, and Tombari, Federico, editor
- Published
- 2022
- Full Text
- View/download PDF
15. Estimating Difficulty Score of Visual Search in Images for Semi-supervised Object Detection
- Author
-
Ma, Dongliang, Zhang, Haipeng, Wu, Hao, Zhang, Tao, Sun, Jun, Goos, Gerhard, Founding Editor, Hartmanis, Juris, Founding Editor, Bertino, Elisa, Editorial Board Member, Gao, Wen, Editorial Board Member, Steffen, Bernhard, Editorial Board Member, Woeginger, Gerhard, Editorial Board Member, Yung, Moti, Editorial Board Member, Ohara, Kouzou, editor, and Bai, Quan, editor
- Published
- 2019
- Full Text
- View/download PDF
16. Boosting Semi-Supervised Few-Shot Object Detection with SoftER Teacher
- Author
-
Phi Vu Tran
- Subjects
FOS: Computer and information sciences ,Computer Science - Machine Learning ,machine learning ,Computer Vision and Pattern Recognition (cs.CV) ,few-shot object detection ,label-efficient object detection ,Computer Science - Computer Vision and Pattern Recognition ,semi-supervised object detection ,deep learning ,object detection ,computer vision ,Machine Learning (cs.LG) - Abstract
Few-shot object detection (FSOD) is an emerging problem aimed at detecting novel concepts from few exemplars. Existing approaches to FSOD assume abundant base labels to adapt to novel objects. This paper studies the task of semi-supervised FSOD by considering a realistic scenario in which both base and novel labels are simultaneously scarce. We explore the utility of unlabeled data and discover its remarkable ability to boost semi-supervised FSOD by way of region proposals. Motivated by this finding, we introduce SoftER Teacher, a robust detector combining pseudo-labeling with representation learning on region proposals, to harness unlabeled data for improved FSOD without relying on abundant labels. Extensive experiments show that SoftER Teacher surpasses the novel performance of a strong supervised detector using only 10% of required base labels, without experiencing catastrophic forgetting observed in prior approaches. Our work also sheds light on a potential relationship between semi-supervised and few-shot detection suggesting that a stronger semi-supervised detector leads to a more effective few-shot detector. The code and models are available at https://github.com/lexisnexis-risk-open-source/ledetection, Technical Report. Project page at https://github.com/lexisnexis-risk-open-source/ledetection
- Published
- 2023
- Full Text
- View/download PDF
17. Temporal self-ensembling teacher for semi-supervised object detection
- Author
-
Chen, C. (Cong), Dong, S. (Shouyang), Tian, Y. (Ye), Cao, K. (Kunlin), Liu, L. (Li), Guo, Y. (Yuanhao), Chen, C. (Cong), Dong, S. (Shouyang), Tian, Y. (Ye), Cao, K. (Kunlin), Liu, L. (Li), and Guo, Y. (Yuanhao)
- Abstract
This paper focuses on the semi-supervised object detection (SSOD) which makes good use of unlabeled data to boost performance. We face the following obstacles when adapting the knowledge distillation (KD) framework in SSOD. (1) The teacher model serves a dual role as a teacher and a student, such that the teacher predictions on unlabeled images may limit the upper bound of the student. (2) The data imbalance issue caused by the large quantity of consistent predictions between the teacher and student hinders an efficient knowledge transfer between them. To mitigate these issues, we propose a novel SSOD model called Temporal Self-Ensembling Teacher (TSET). Our teacher model ensembles its temporal predictions for unlabeled images under stochastic perturbations. Then, our teacher model ensembles its model weights with those of the student model by an exponential moving average. These ensembling strategies ensure data and model diversity, and lead to better teacher predictions for unlabeled images. In addition, we adapt the focal loss to formulate the consistency loss for handling the data imbalance issue. Together with a thresholding method, the focal loss automatically reweights the inconsistent predictions, which preserves the knowledge for difficult objects to detect in the unlabeled images. The mAP of our model reaches 80.73% and 40.52% on the VOC2007 test set and the COCO2014 minival5k set, respectively, and outperforms a strong fully supervised detector by 2.37% and 1.49%, respectively. Furthermore, the mAP of our model (80.73%) sets a new state-of-the-art performance in SSOD on the VOC2007 test set.
- Published
- 2022
18. Semi-Supervised Plant Leaf Detection and Stress Recognition
- Author
-
Antal Csizmadia, Márk and Antal Csizmadia, Márk
- Abstract
One of the main limitations of training deep learning-based object detection models is the availability of large amounts of data annotations. When annotations are scarce, semi-supervised learning provides frameworks to improve object detection performance by utilising unlabelled data. This is particularly useful in plant leaf detection and possible leaf stress recognition, where data annotations are expensive to obtain due to the need for specialised domain knowledge. This project aims to investigate the feasibility of the Unbiased Teacher, a semi-supervised object detection algorithm, for detecting plant leaves and recognising possible leaf stress in experimental settings where few annotations are available during training. We build an annotated data set for this task and implement the Unbiased Teacher algorithm. We optimise the Unbiased Teacher algorithm and compare its performance to that of a baseline model. Finally, we investigate which hyperparameters of the Unbiased Teacher algorithm most significantly affect its performance and its ability to utilise unlabelled images. We find that the Unbiased Teacher algorithm outperforms the baseline model in the experimental settings when limited annotated data are available during training. Amongst the hyperparameters we consider, we identify the confidence threshold as having the most effect on the algorithm’s performance and ability to leverage unlabelled data. Ultimately, we demonstrate the feasibility of improving object detection performance with the Unbiased Teacher algorithm in plant leaf detection and possible stress recognition when few annotations are available. The improved performance reduces the amount of annotated data required for this task, reducing annotation costs and thereby increasing usage for real-world tasks., En av huvudbegränsningarna med att träna djupinlärningsbaserade objektdetekteringsmodeller är tillgången på stora mängder annoterad data. Vid små mängder av tillgänglig data kan semi-övervakad inlärning erbjuda ett ramverk för att förbättra objektdetekteringsprestanda genom att använda icke-annoterad data. Detta är särskilt användbart vid detektering av växtblad och möjlig igenkänning av stressymptom hos bladen, där kostnaden för annotering av data är hög på grund av behovet av specialiserad kunskap inom området. Detta projekt syftar till att undersöka genomförbarheten av Opartiska Läraren (eng. ”Unbiased Teacher”), en semi-övervakad objektdetekteringsalgoritm, för att upptäcka växtblad och känna igen möjliga stressymptom hos blad i experimentella miljöer när endast en liten mängd annoterad data finns tillgänglig under träning. För att åstadkomma detta bygger vi ett annoterat dataset och implementerar Opartiska Läraren. Vi optimerar Opartiska Läraren och jämför dess prestanda med en baslinjemodell. Slutligen undersöker vi de hyperparametrar som mest påverkar Opartiska Lärarens prestanda och dess förmåga att använda icke-annoterade bilder. Vi finner att Opartiska Läraren överträffar baslinjemodellen i de experimentella inställningarna när det finns en begränsad mängd annoterad data under träningen. Bland hyperparametrarna vi överväger identifierar vi konfidensgränsen som har störst effekt på algoritmens prestanda och dess förmåga att utnyttja icke-annoterad data. Vi demonstrerar möjligheten att förbättra objektdetekteringsprestandan med Opartiska Läraren i växtbladsdetektering och möjlig stressigenkänning när få anteckningar finns tillgängliga. Den förbättrade prestandan minskar mängden annoterad data som krävs, vilket minskar anteckningskostnaderna och ökar därmed användbarheten för användning inom mer praktiska områden.
- Published
- 2022
19. Semi-övervakad detektering av växtblad och möjlig stressigenkänning
- Author
-
Antal Csizmadia, Márk
- Subjects
Computer and Information Sciences ,Deep Learning ,Semi-Supervised Object Detection ,Objektdetektering ,Semi-övervakad inlärning ,Semi-Supervised Learning ,Computer Vision ,Object Detection ,Semi-övervakad objektdetektering ,Data- och informationsvetenskap ,datorseende ,Djupinlärning - Abstract
One of the main limitations of training deep learning-based object detection models is the availability of large amounts of data annotations. When annotations are scarce, semi-supervised learning provides frameworks to improve object detection performance by utilising unlabelled data. This is particularly useful in plant leaf detection and possible leaf stress recognition, where data annotations are expensive to obtain due to the need for specialised domain knowledge. This project aims to investigate the feasibility of the Unbiased Teacher, a semi-supervised object detection algorithm, for detecting plant leaves and recognising possible leaf stress in experimental settings where few annotations are available during training. We build an annotated data set for this task and implement the Unbiased Teacher algorithm. We optimise the Unbiased Teacher algorithm and compare its performance to that of a baseline model. Finally, we investigate which hyperparameters of the Unbiased Teacher algorithm most significantly affect its performance and its ability to utilise unlabelled images. We find that the Unbiased Teacher algorithm outperforms the baseline model in the experimental settings when limited annotated data are available during training. Amongst the hyperparameters we consider, we identify the confidence threshold as having the most effect on the algorithm’s performance and ability to leverage unlabelled data. Ultimately, we demonstrate the feasibility of improving object detection performance with the Unbiased Teacher algorithm in plant leaf detection and possible stress recognition when few annotations are available. The improved performance reduces the amount of annotated data required for this task, reducing annotation costs and thereby increasing usage for real-world tasks. En av huvudbegränsningarna med att träna djupinlärningsbaserade objektdetekteringsmodeller är tillgången på stora mängder annoterad data. Vid små mängder av tillgänglig data kan semi-övervakad inlärning erbjuda ett ramverk för att förbättra objektdetekteringsprestanda genom att använda icke-annoterad data. Detta är särskilt användbart vid detektering av växtblad och möjlig igenkänning av stressymptom hos bladen, där kostnaden för annotering av data är hög på grund av behovet av specialiserad kunskap inom området. Detta projekt syftar till att undersöka genomförbarheten av Opartiska Läraren (eng. ”Unbiased Teacher”), en semi-övervakad objektdetekteringsalgoritm, för att upptäcka växtblad och känna igen möjliga stressymptom hos blad i experimentella miljöer när endast en liten mängd annoterad data finns tillgänglig under träning. För att åstadkomma detta bygger vi ett annoterat dataset och implementerar Opartiska Läraren. Vi optimerar Opartiska Läraren och jämför dess prestanda med en baslinjemodell. Slutligen undersöker vi de hyperparametrar som mest påverkar Opartiska Lärarens prestanda och dess förmåga att använda icke-annoterade bilder. Vi finner att Opartiska Läraren överträffar baslinjemodellen i de experimentella inställningarna när det finns en begränsad mängd annoterad data under träningen. Bland hyperparametrarna vi överväger identifierar vi konfidensgränsen som har störst effekt på algoritmens prestanda och dess förmåga att utnyttja icke-annoterad data. Vi demonstrerar möjligheten att förbättra objektdetekteringsprestandan med Opartiska Läraren i växtbladsdetektering och möjlig stressigenkänning när få anteckningar finns tillgängliga. Den förbättrade prestandan minskar mängden annoterad data som krävs, vilket minskar anteckningskostnaderna och ökar därmed användbarheten för användning inom mer praktiska områden.
- Published
- 2022
20. Temporal Self-Ensembling Teacher for Semi-Supervised Object Detection
- Author
-
Shouyang Dong, Kunlin Cao, Yuanhao Guo, Cong Chen, Li Liu, and Ye Tian
- Subjects
FOS: Computer and information sciences ,Source code ,Computer science ,media_common.quotation_subject ,Computer Vision and Pattern Recognition (cs.CV) ,semi-supervised object detection ,Computer Science - Computer Vision and Pattern Recognition ,Machine learning ,computer.software_genre ,Set (abstract data type) ,Consistency (database systems) ,Moving average ,temporal self-ensembling ,Media Technology ,Electrical and Electronic Engineering ,media_common ,Contextual image classification ,business.industry ,Object detection ,Computer Science Applications ,Term (time) ,knowledge distillation ,Test set ,Signal Processing ,Artificial intelligence ,focal loss ,business ,deep convolutional neural networks ,computer - Abstract
This paper focuses on Semi-Supervised Object Detection (SSOD). Knowledge Distillation (KD) has been widely used for semi-supervised image classification. However, adapting these methods for SSOD has the following obstacles. (1) The teacher model serves a dual role as a teacher and a student, such that the teacher predictions on unlabeled images may be very close to those of student, which limits the upper-bound of the student. (2) The class imbalance issue in SSOD hinders an efficient knowledge transfer from teacher to student. To address these problems, we propose a novel method Temporal Self-Ensembling Teacher (TSE-T) for SSOD. Differently from previous KD based methods, we devise a temporally evolved teacher model. First, our teacher model ensembles its temporal predictions for unlabeled images under stochastic perturbations. Second, our teacher model ensembles its temporal model weights with the student model weights by an exponential moving average (EMA) which allows the teacher gradually learn from the student. These self-ensembling strategies increase data and model diversity, thus improving teacher predictions on unlabeled images. Finally, we use focal loss to formulate consistency regularization term to handle the data imbalance problem, which is a more efficient manner to utilize the useful information from unlabeled images than a simple hard-thresholding method which solely preserves confident predictions. Evaluated on the widely used VOC and COCO benchmarks, the mAP of our method has achieved 80.73% and 40.52% on the VOC2007 test set and the COCO2014 minval5k set respectively, which outperforms a strong fully-supervised detector by 2.37% and 1.49%. Furthermore, our method sets the new state-of-the-art in SSOD on VOC2007 test set which outperforms the baseline SSOD method by 1.44%. The source code of this work is publicly available at http://github.com/syangdong/tse-t., 13 papges, 4 figures, preprint for submission
- Published
- 2020
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.