Descriptor: "YOLOv8n" - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"YOLOv8n"' showing total 200 results

Start Over Descriptor "YOLOv8n"

200 results on '"YOLOv8n"'

1. GAM-YOLOv8n: enhanced feature extraction and difficult example learning for site distribution box door status detection.

Author: Zhao, Song, Cai, TaiWei, Peng, Bao, Zhang, Teng, and Zhou, XiaoBing
Abstract: The detection of distribution box doors on construction sites is particularly important in site safety management, but the size and posture of distribution boxes vary in different scenarios, and there are still challenges. This article proposes an improved YOLOv8n construction site distribution box door status detection and recognition method. Firstly, Global Attention Mechanism is introduced to reduce information dispersion and enhance global interaction representation, preserving the correlation between spatial and channel information to strengthen the network's feature extraction capability during the detection process. Secondly, to tackle the problem of class imbalance in construction site distribution box door state detection, the Focal_EIoU detection box loss function is used to replace the CIoU loss function, optimizing the model's ability to learn from difficult samples.Lastly,the proposed method is evaluated on a dataset of distribution boxes with different shapes and sizes collected from various construction scenes. Experimental results demonstrate that the improved YOLOv8n algorithm achieves an average precision (mAP) of 82.1% at a speed of 66.7 frames per second, outperforming other classical object detection networks and the original network. This improved method provides an efficient and accurate solution for practical detection tasks in smart chemical sites, especially in enhancing feature extraction and processing difficult sample cases, which has made significant progress. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

2. 基于多重机制优化YOLOv8 的复杂环境下安全帽检测方法.

Author: 肖振久, 严肃, and 曲海成
Abstract: Copyright of Journal of Computer Engineering & Applications is the property of Beijing Journal of Computer Engineering & Applications Journal Co Ltd. and its content may not be copied or emailed to multiple sites or posted to a listserv without the copyright holder's express written permission. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract. (Copyright applies to all Abstracts.)
Published: 2024
Full Text: View/download PDF

3. CAMLLA-YOLOv8n: Cow Behavior Recognition Based on Improved YOLOv8n.

Author: Jia, Qingxiang, Yang, Jucheng, Han, Shujie, Du, Zihan, and Liu, Jianzheng
Subjects: *ANIMAL culture, *DATA augmentation, *COMPUTER vision, *DAIRY cattle, *DEEP learning, *ESTRUS, *MILK quality
Abstract: Simple Summary: The daily behaviors of Holstein cows, such as standing, grazing, and lying, as well as abnormal behaviors such as estrus, licking, and fighting, are closely related to their physiological health. Accurately identifying these behaviors is of great significance for monitoring the health of dairy cows. For instance, hoof disease generally causes dairy cows to lie down more, while cows in estrus exhibit mounting behavior. This study employs deep learning technology based on computer vision to detect dairy cow behavior. The experimental results demonstrate that this method effectively meets the need for the accurate and rapid identification of Holstein cow behavior in real agricultural environments, which is crucial for improving the economic benefits of farms. Cow behavior carries important health information. The timely and accurate detection of standing, grazing, lying, estrus, licking, fighting, and other behaviors is crucial for individual cow monitoring and understanding of their health status. In this study, a model called CAMLLA-YOLOv8n is proposed for Holstein cow behavior recognition. We use a hybrid data augmentation method to provide the model with rich Holstein cow behavior features and improve the YOLOV8n model to optimize the Holstein cow behavior detection results under challenging conditions. Specifically, we integrate the Coordinate Attention mechanism into the C2f module to form the C2f-CA module, which strengthens the expression of inter-channel feature information, enabling the model to more accurately identify and understand the spatial relationship between different Holstein cows' positions, thereby improving the sensitivity to key areas and the ability to filter background interference. Secondly, the MLLAttention mechanism is introduced in the P3, P4, and P5 layers of the Neck part of the model to better cope with the challenges of Holstein cow behavior recognition caused by large-scale changes. In addition, we also innovatively improve the SPPF module to form the SPPF-GPE module, which optimizes small target recognition by combining global average pooling and global maximum pooling processing and enhances the model's ability to capture the key parts of Holstein cow behavior in the environment. Given the limitations of traditional IoU loss in cow behavior detection, we replace CIoU loss with Shape–IoU loss, focusing on the shape and scale features of the Bounding Box, thereby improving the matching degree between the Prediction Box and the Ground Truth Box. In order to verify the effectiveness of the proposed CAMLLA-YOLOv8n algorithm, we conducted experiments on a self-constructed dataset containing 23,073 Holstein cow behavior instances. The experimental results show that, compared with models such as YOLOv3-tiny, YOLOv5n, YOLOv5s, YOLOv7-tiny, YOLOv8n, and YOLOv8s, the improved CAMLLA-YOLOv8n model achieved increases in Precision of 8.79%, 7.16%, 6.06%, 2.86%, 2.18%, and 2.69%, respectively, when detecting the states of Holstein cows grazing, standing, lying, licking, estrus, fighting, and empty bedding. Finally, although the Params and FLOPs of the CAMLLA-YOLOv8n model increased slightly compared with the YOLOv8n model, it achieved significant improvements of 2.18%, 1.62%, 1.84%, and 1.77% in the four key performance indicators of Precision, Recall, mAP@0.5, and mAP@0.5:0.95, respectively. This model, named CAMLLA-YOLOv8n, effectively meets the need for the accurate and rapid identification of Holstein cow behavior in actual agricultural environments. This research is significant for improving the economic benefits of farms and promoting the transformation of animal husbandry towards digitalization and intelligence. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

4. Identification of water-cooled wall ash accumulation based on AWGAM-YOLOv8n.

Author: Hao, Yongxing, Wang, Bin, Hao, Yilong, and Cao, Angang
Subjects: *FEATURE extraction, *IMAGE fusion, *IMAGE intensifiers, *LEARNING ability, *INCINERATORS
Abstract: Identifying the ash accumulation generated on the water-cooled walls of the waste incinerator is essential for the cleanup by the robotic arm. This paper improves a new algorithm based on YOLOv8n, which can identify the ash accumulation position on the water-cooled wall quickly and accurately. Firstly, the multi-scale fusion image enhancement algorithm is used to improve the sharpness and contrast of the image and enrich the details of the image. Secondly, the backbone feature extraction network of YOLOv8n is replaced by Mobilenetv3 network, which reduces the parameters in the model greatly. Finally, this paper improves a new attention mechanism AWGAM (Add Weight Global Attention Mechanism) based on GAM (Global Attention Mechanism), which can better integrate the feature information between different dimensions and improve the learning ability of the model. AWGAM is added to the backbone of the model. The experimental results show that compared with the original YOLOv8n model, the improved YOLOv8n model has 59.9% fewer parameters, 4.4% higher precision, 8.8% higher recall, 3.2% higher mAP50 (mean Average Precision) and 8.8% higher mAP50-95. This model has made remarkable progress on the basis of the original algorithm, and has strong competitiveness compared with other advanced target detection models. The lightweight and high accuracy of ash accumulation detection offered by the proposed model presents promising applications in ash accumulation detection tasks of water-cooled walls. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

5. Resilient machine learning for steel surface defect detection based on lightweight convolution.

Author: Liu, Li-Juan, Zhang, Yu, and Karimi, Hamid Reza
Subjects: *SURFACE defects, *STRUCTURAL engineering, *METAL defects, *RELIABILITY in engineering, *METAL detectors
Abstract: Steel, as a crucial material extensively used in various fields, has a critical impact on the determination of the stability and reliability of engineering structures. Nevertheless, because of inevitable factors in manufacturing, transportation, and other processes, steel may exhibit various surface defects during production and handling. To address these defects, the investigation puts forward a resilient machine-learning method for steel surface defect detection based on lightweight convolution. First, to reduce redundant features, complexity, and computational cost, the Spatial and Channel Reconstruction Convolution (ScConv) module is added before the Spatial Pyramid Pooling-Fast (SPPF) within the YOLOv8n's backbone network. Second, in the Neck layer, lightweight convolution GSConv is used to replace the convolutional modules, and the efficient cross-stage partial network (CSP) module, VoV-GSCSP is substituted for the C2f module to alleviate the model burden while maintaining accuracy. Then, to focus on important information related to the current task, the Coordinate Attention module is added to the Neck layer. Finally, the activation function of YOLOv8n has been swapped for the Leaky Rectified Linear Unit (LeakyReLU) to effectively address issues such as gradient vanishing and overfitting. The method achieved a mean Average Precision (mAP) of 77.7% on the NEU-DET dataset, which is an improvement of 4.7% over the original YOLOv8n. Additionally, the frames per second (FPS) reached 17.36 f/s, representing a 5.79 f/s increase compared to the original YOLOv8n. On the GC10-DET dataset, mAP improves by 5.5%, with a FPS of 15.63 f/s. A plethora of experimentation on both datasets illustrates the method's robustness, meeting the precision criteria for detecting metal defects. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

6. YOLOv8n-CSD: A Lightweight Detection Method for Nectarines in Complex Environments.

Author: Zhang, Guohai, Yang, Xiaohui, Lv, Danyang, Zhao, Yuqian, and Liu, Peng
Subjects: *NECTARINE, *IMAGE recognition (Computer vision), *DEEP learning, *FRUIT, *ROBOTS
Abstract: At present, the picking of nectarines mainly relies on manual completion in China, and the process involves high labor intensity during picking and low picking efficiency. Therefore, it is necessary to introduce automated picking. To improve the accuracy of nectarine fruit recognition in complex environments and to increase the efficiency of automatic orchard-picking robots, a lightweight nectarine detection method, YOLOv8n-CSD, is proposed in this study. This model improves on YOLOv8n by first proposing a new structure, C2f-PC, to replace the C2f structure used in the original network, thus reducing the number of model parameters. Second, the SEAM is introduced to improve the model's recognition of the occluded part. Finally, to realize real-time detection of nectarine fruits, the DySample Lightweight Dynamic Upsampling Module is introduced to save computational resources while effectively enhancing the model's anti-interference ability. With a compact size of 4.7 MB, this model achieves 95.1% precision, 84.9% recall, and a mAP@0.5 of 93.2%—the model's volume has been reduced while the evaluation metrics have all been improved over the baseline model. The study shows that the YOLOv8n-CSD model outperforms the current mainstream target detection models, and can recognize nectarines in different environments faster and more accurately, which lays the foundation for the field application of automatic picking technology. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

7. 基于改进YOLOv8n 的织物疵点检测.

Author: 李耀, 徐红伟, 柯海森, 郭殿鹏, and 李孝禄
Subjects: TEXTILE patterns, TEXTILE industry, TEXTILES, SPEED, ALGORITHMS
Abstract: Copyright of Cotton Textile Technology is the property of Cotton Textile Technology Editorial Office and its content may not be copied or emailed to multiple sites or posted to a listserv without the copyright holder's express written permission. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract. (Copyright applies to all Abstracts.)
Published: 2024

8. 基于改进YOLOv8n的轻量化分心驾驶检测算法.

Author: 朱玉华, 龚晓腾, and 吴宁
Subjects: DEEP learning, SPINE, DISTRACTION, ALGORITHMS, COST
Abstract: Copyright of Automotive Engineer (1674-6546) is the property of Auto Engineering Editorial Office and its content may not be copied or emailed to multiple sites or posted to a listserv without the copyright holder's express written permission. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract. (Copyright applies to all Abstracts.)
Published: 2024
Full Text: View/download PDF

9. YOLOv8n-WSE-Pest: A Lightweight Deep Learning Model Based on YOLOv8n for Pest Identification in Tea Gardens.

Author: Li, Hongxu, Yuan, Wenxia, Xia, Yuxin, Wang, Zejun, He, Junjie, Wang, Qiaomei, Zhang, Shihao, Li, Limei, Yang, Fang, and Wang, Baijuan
Subjects: OBJECT recognition (Computer vision), PEST control, APPLIED sciences, AGRICULTURE, IMAGE recognition (Computer vision), TEA plantations
Abstract: China's Yunnan Province, known for its tea plantations, faces significant challenges in smart pest management due to its ecologically intricate environment. To enable the intelligent monitoring of pests within tea plantations, this study introduces a novel image recognition algorithm, designated as YOLOv8n-WSE-pest. Taking into account the pest image data collected from organic tea gardens in Yunnan, this study utilizes the YOLOv8n network as a foundation and optimizes the original loss function using WIoU-v3 to achieve dynamic gradient allocation and improve the prediction accuracy. The addition of the Spatial and Channel Reconstruction Convolution structure in the Backbone layer reduces redundant spatial and channel features, thereby reducing the model's complexity. The integration of the Efficient Multi-Scale Attention Module with Cross-Spatial Learning enables the model to have more flexible global attention. The research results demonstrate that compared to the original YOLOv8n model, the improved YOLOv8n-WSE-pest model shows increases in the precision, recall, mAP50, and F1 score by 3.12%, 5.65%, 2.18%, and 4.43%, respectively. In external validation, the mAP of the model outperforms other deep learning networks such as Faster-RCNN, SSD, and the original YOLOv8n, with improvements of 14.34%, 8.85%, and 2.18%, respectively. In summary, the intelligent tea garden pest identification model proposed in this study excels at precise the detection of key pests in tea plantations, enhancing the efficiency and accuracy of pest management through the application of advanced techniques in applied science. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

10. GMS-YOLO: an enhanced algorithm for water meter reading recognition in complex environments.

Author: Wang, Yu and Xiang, Xiaodong
Abstract: The disordered arrangement of water-meter pipes and the random rotation angles of their mechanical character wheels frequently result in captured water-meter images exhibiting tilt, blur, and incomplete characters. These issues complicate the detection of water-meter images, rendering traditional OCR (optical character recognition) methods inadequate for current detection requirements. Furthermore, the two-stage detection method, which involves first locating and then recognizing, proves overly cumbersome. In this paper, water-meter reading recognition is approached as an object-detection task, extracting readings using the algorithm’s Predicted Box information, establishing a water-meter dataset, and refining the algorithmic framework to improve the accuracy of recognizing incomplete characters. Utilizing YOLOv8n as the baseline, we propose GMS-YOLO, a novel object-detection algorithm that employs Grouped Multi-Scale Convolution for enhanced performance. First, by substituting the Bottleneck module’s convolution with GMSC (Grouped Multi-Scale Convolution), the model can access various scale receptive fields, thus boosting its feature-extraction prowess. Second, incorporating LSKA (Large Kernel Separable Attention) into the SPPF (Spatial Pyramid Pooling Fast) module improves the perception of fine-grained features. Finally, replacing CIoU (Generalized Intersection over Union) with the ShapeIoU bounding box loss function enhances the model’s ability to localize objects and speeds up its convergence. Evaluating a self-compiled water-meter image dataset, GMS-YOLO attained a mAP@0.5 of 92.4% and a precision of 93.2%, marking a 2.0% and 2.1% enhancement over YOLOv8n, respectively. Despite the increased computational burden, GMS-YOLO maintains an average detection time of 10 ms per image, meeting practical detection needs. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

11. Improved YOLOv8n for Lightweight Ship Detection.

Author: Gao, Zhiguang, Yu, Xiaoyan, Rong, Xianwei, and Wang, Wenqi
Subjects: CONVOLUTIONAL neural networks, TRANSPORTATION management, MARITIME management, SHIP models, FEATURE extraction
Abstract: Automatic ship detection is a crucial task within the domain of maritime transportation management. With the progressive success of convolutional neural networks (CNNs), a number of advanced CNN models have been presented in order to detect ships. Although these detection models have achieved marked performance, several undesired results may occur under complex maritime conditions, such as missed detections, false positives, and low detection accuracy. Moreover, the existing detection models endure large number of parameters and heavy computation cost. To deal with these problems, we suggest a lightweight ship model of detection called DSSM–LightNet based upon the improved YOLOv8n. First, we introduce a lightweight Dual Convolutional (DualConv) into the model to lower both the number of parameters and the computational complexity. The principle is that DualConv combines two types of convolution kernels, 3x3 and 1x1, and utilizes group convolution techniques to effectively reduce computational costs while processing the same input feature map channels. Second, we propose a Slim-neck structure in the neck network, which introduces GSConv and VoVGSCSP modules to construct an efficient feature-fusion layer. This fusion strategy helps the model better capture the features of targets of different sizes. Meanwhile, a spatially enhanced attention module (SEAM) is leveraged to integrate with a Feature Pyramid Network (FPN) and the Slim-neck to achieve simple yet effective feature extraction, minimizing information loss during feature fusion. CIoU may not accurately reflect the relative positional relationship between bounding boxes in some complex scenarios. In contrast, MPDIoU can provide more accurate positional information in bounding-box regression by directly minimizing point distance and considering comprehensive loss. Therefore, we utilize the minimum point distance IoU (MPDIoU) rather than the Complete Intersection over Union (CIoU) Loss to further enhance the detection precision of the suggested model. Comprehensive tests carried out on the publicly accessible SeaShips dataset have demonstrated that our model greatly exceeds other algorithms in relation to their detection accuracy and efficiency, while reserving its lightweight nature. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

12. Lightweight Sewer Pipe Crack Detection Method Based on Amphibious Robot and Improved YOLOv8n.

Author: Lv, Zhenming, Dong, Shaojiang, He, Jingyao, Hu, Bo, Liu, Qingyi, and Wang, Honghang
Subjects: *SEWER pipes, *FEATURE extraction, *SEWAGE, *ROBOTS, *COST effectiveness
Abstract: Aiming at the problem of difficult crack detection in underground urban sewage pipelines, a lightweight sewage pipeline crack detection method based on sewage pipeline robots and improved YOLOv8n is proposed. The method uses pipeline robots as the equipment carrier to move rapidly and collect high-definition data of apparent diseases in sewage pipelines with both water and sludge media. The lightweight RGCSPELAN module is introduced to reduce the number of parameters while ensuring the detection performance. First, we replaced the lightweight detection head Detect_LADH to reduce the number of parameters and improve the feature extraction of modeled cracks. Finally, we added the LSKA module to the SPPF module to improve the robustness of YOLOv8n. Compared with YOLOv5n, YOLOv6n, YOLOv8n, RT-DETRr18, YOLOv9t, and YOLOv10n, the improved YOLOv8n has a smaller number of parameters of only 1.6 M. The FPS index reaches 261, which is good for real-time detection, and at the same time, the model also has a good detection accuracy. The validation of sewage pipe crack detection through real scenarios proves the feasibility of the proposed method, which has good results in targeting both small and long cracks. It shows potential in improving the safety maintenance, detection efficiency, and cost-effectiveness of urban sewage pipes. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

13. Dress Code Monitoring Method in Industrial Scene Based on Improved YOLOv8n and DeepSORT.

Author: Zou, Jiadong, Song, Tao, Cao, Songxiao, Zhou, Bin, and Jiang, Qing
Subjects: *DRESS codes, *FALSE alarms, *DETECTION alarms, *NECK, *HATS
Abstract: Deep learning-based object detection has become a powerful tool in dress code monitoring. However, even state-of-the-art detection models inevitably suffer from false alarms or missed detections, especially when handling small targets such as hats and masks. To overcome these limitations, this paper proposes a novel method for dress code monitoring using an improved YOLOv8n model, the DeepSORT tracking, and a new dress code judgment criterion. We improve the YOLOv8n model through three means: (1) a new neck structure named FPN-PAN-FPN (FPF) is introduced to enhance the model's feature fusion capability, (2) Receptive-Field Attention convolutional operation (RFAConv) is utilized to better capture the difference in information brought by different positions, and a (3) Focused Linear Attention (FLatten) mechanism is added to expand the model's receptive field. This improved YOLOv8n model increases mAP while reducing model size. Next, DeepSORT is integrated to obtain instance information across multi-frames. Finally, we adopt a new judgment criterion to conduct real-scene dress code monitoring. The experimental results show that our method effectively identifies instances of dress violations, reduces false alarms, and improves accuracy. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

14. YOLOv8-E: An Improved YOLOv8 Algorithm for Eggplant Disease Detection.

Author: Huang, Yuxi, Zhao, Hong, and Wang, Jie
Subjects: OBJECT recognition (Computer vision), CROP yields, COMPUTATIONAL complexity, ALGORITHMS, DEEP learning, NECK, EGGPLANT
Abstract: During the developmental stages, eggplants are susceptible to diseases, which can impact crop yields and farmers' economic returns. Therefore, timely and effective detection of eggplant diseases is crucial. Deep learning-based object detection algorithms can automatically extract features from images of eggplants affected by diseases. However, eggplant disease images captured in complex farmland environments present challenges such as varying disease sizes, occlusion, overlap, and small target detection, making it difficult for existing deep-learning models to achieve satisfactory detection performance. To address this challenge, this study proposed an optimized eggplant disease detection algorithm, YOLOv8-E, based on You Only Look Once version 8 nano (YOLOv8n). Firstly, we integrate switchable atrous convolution (SAConv) into the C2f module to design the C2f_SAConv module, replacing some of the C2f modules in the backbone network of YOLOv8n, enabling our proposed algorithm to better extract eggplant disease features. Secondly, to facilitate the deployment of the detection model on mobile devices, we reconstruct the Neck network of YOLOv8n using the SlimNeck module, making the model lighter. Additionally, to tackle the issue of missing small targets, we embed the large separable kernel attention (LSKA) module within SlimNeck, enhancing the model's attention to fine-grained information. Lastly, we combined intersection over union with auxiliary bounding box (Inner-IoU) and minimum point distance intersection over union (MPDIoU), introducing the Inner-MPDIoU loss to speed up convergence of the model and raise detection precision of overlapped and occluded targets. Ablation studies demonstrated that, compared to YOLOv8n, the mean average precision (mAP) and F1 score of YOLOv8-E reached 79.4% and 75.7%, respectively, which obtained a 5.5% increment and a 4.5% increase, while also reducing the model size and computational complexity. Furthermore, YOLOv8-E achieved higher detection performance than other mainstream algorithms. YOLOv8-E exhibits significant potential for practical application in eggplant disease detection. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

15. YOLO-BGS Optimizes Textile Production Processes: Enhancing YOLOv8n with Bi-Directional Feature Pyramid Network and Global and Shuffle Attention Mechanisms for Efficient Fabric Defect Detection.

Author: Lu, Gege, Xiong, Tian, and Wu, Gaihong
Abstract: Timely detection of fabric defects is crucial for improving fabric quality and reducing production losses for companies. Traditional methods for detecting fabric defects face several challenges, including low detection efficiency, poor accuracy, and limited types of detectable defects. To address these issues, this paper chose the YOLOv8n model for continuous iteration enhancement in order to improve its detection performance. First, multiscale feature fusion was realized by the Bi-directional Feature Pyramid Network (BiFPN). Second, the Shuffle Attention Mechanism (SA) is introduced to optimize feature classification. Finally, the Global Attention Mechanism (GAM) was used to improve global detection accuracy. Empirical findings demonstrated the improved model's efficacy, attaining a test set mean average precision (mAP) value of 96.6%, which is an improvement of 3.6% compared to the original YOLOv8n. This validates that YOLO-BGS excels in detecting textile defects. It effectively locates these defects, minimizes resource waste, and fosters sustainable production practices. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

16. Detection of coal gangue based on MSRCR algorithm and improved lightweight YOLOv8n.

Author: Hong, Yan, Pan, Ruixian, Su, Jingming, and Pang, Rong
Subjects: *IMAGE recognition (Computer vision), *IMAGE intensifiers, *COAL, *RECOGNITION (Psychology), *ALGORITHMS, *HISTOGRAMS
Abstract: Traditional methods for coal gangue sorting exhibit low efficiency, significant safety hazards, and limited applicability. Existing machine vision-based coal gangue image recognition methods struggle to balance model recognition speed and accuracy. In response to these challenges, this paper first utilizes the improved MSRCR algorithm to process images, enhancing the dark areas of coal gangue images while ensuring uniform brightness enhancement and image clarity. Furthermore, a novel lightweight coal gangue recognition method is proposed based on YOLOv8n, aiming to reduce data redundancy and improve recognition accuracy. Experimental results demonstrate that the improved lightweight model has a computational load of 7.1 GFLOPs, representing only 86.6% of the original model. The model detection rate is 73 fps, a 17 fps improvement over the original model. The accuracy, recall rate, and average precision reach 98.1%, 97.6%, and 98.9%, respectively, with improvements of 1.1, 0.9, and 0.5% points over the original model. The missing detection phenomenon is avoided effectively, and the accuracy and portability of the model are improved. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

17. ESFD-YOLOv8n: Early Smoke and Fire Detection Method Based on an Improved YOLOv8n Model.

Author: Mamadaliev, Dilshodjon, Touko, Philippe Lyonel Mbouembe, Kim, Jae-Ho, and Kim, Suk-Chan
Subjects: *OBJECT recognition (Computer vision), *ARTIFICIAL intelligence, *FEATURE extraction, *FIRE prevention, *SMOKE, *FIRE detectors, *DEEP learning
Abstract: Ensuring fire safety is essential to protect life and property, but modern infrastructure and complex settings require advanced fire detection methods. Traditional object detection systems, often reliant on manual feature extraction, may fall short, and while deep learning approaches are powerful, they can be computationally intensive, especially for real-time applications. This paper proposes a novel smoke and fire detection method based on the YOLOv8n model with several key architectural modifications. The standard Complete-IoU (CIoU) box loss function is replaced with the more robust Wise-IoU version 3 (WIoUv3), enhancing predictions through its attention mechanism and dynamic focusing. The model is streamlined by replacing the C2f module with a residual block, enabling targeted feature extraction, accelerating training and inference, and reducing overfitting. Integrating generalized efficient layer aggregation network (GELAN) blocks with C2f modules in the neck of the YOLOv8n model further enhances smoke and fire detection, optimizing gradient paths for efficient learning and high performance. Transfer learning is also applied to enhance robustness. Experiments confirmed the excellent performance of ESFD-YOLOv8n, outperforming the original YOLOv8n by 2%, 2.3%, and 2.7%, with a mean average precision (mAP@0.5) of 79.4%, precision of 80.1%, and recall of 72.7%. Despite its increased complexity, the model outperforms several state-of-the-art algorithms and meets the requirements for real-time fire and smoke detection. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

18. Research on Lightweight Rice False Smut Disease Identification Method Based on Improved YOLOv8n Model.

Author: Yang, Lulu, Guo, Fuxu, Zhang, Hongze, Cao, Yingli, and Feng, Shuai
Subjects: *RICE blast disease, *IMAGE fusion, *DIGITAL image processing, *FEATURE extraction, *FOOD security, *PYRAMIDS
Abstract: In order to detect rice false smut quickly and accurately, a lightweight false smut detection model, YOLOv8n-MBS, was proposed in this study. The model introduces the C2f_MSEC module to replace C2f in the backbone network for better extraction of key features of false smut, enhances the feature fusion capability of the neck network for different sizes of false smut by using a weighted bidirectional feature pyramid network, and designs a group-normalized shared convolution lightweight detection head to reduce the number of parameters in the model head to achieve model lightweight. The experimental results show that YOLOv8n-MBS has an average accuracy of 93.9%, a parameter count of 1.4 M, and a model size of 3.3 MB. Compared with the SSD model, the average accuracy of the model in this study increased by 4%, the number of parameters decreased by 89.8%, and the model size decreased by 86.9%; compared with the YOLO series of YOLOv7-tiny, YOLOv5n, YOLOv5s, and YOLOv8n models, the YOLOv8n-MBS model showed outstanding performance in terms of model accuracy and model performance detection; compared to the latest YOLOv9t and YOLOv10n models, the average model accuracy increased by 2.8% and 2.2%, the number of model parameters decreased by 30% and 39.1%, and the model size decreased by 29.8% and 43.1%, respectively. This method enables more accurate and lighter-weight detection of false smut, which provides the basis for intelligent management of rice blast disease in the field and thus promotes food security. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

19. Infrared Image Object Detection Algorithm for Substation Equipment Based on Improved YOLOv8.

Author: Xiang, Siyu, Chang, Zhengwei, Liu, Xueyuan, Luo, Lei, Mao, Yang, Du, Xiying, Li, Bing, and Zhao, Zhenbing
Subjects: *OBJECT recognition (Computer vision), *INFRARED imaging, *FAULT diagnosis, *INFRARED equipment, *MULTISCALE modeling
Abstract: Substations play a crucial role in the proper operation of power systems. Online fault diagnosis of substation equipment is critical for improving the safety and intelligence of power systems. Detecting the target equipment from an infrared image of substation equipment constitutes a pivotal step in online fault diagnosis. To address the challenges of missed detection, false detection, and low detection accuracy in the infrared image object detection in substation equipment, this paper proposes an infrared image object detection algorithm for substation equipment based on an improved YOLOv8n. Firstly, the DCNC2f module is built by combining deformable convolution with the C2f module, and the C2f module in the backbone is replaced by the DCNC2f module to enhance the ability of the model to extract relevant equipment features. Subsequently, the multi-scale convolutional attention module is introduced to improve the ability of the model to capture multi-scale information and enhance detection accuracy. The experimental results on the infrared image dataset of the substation equipment demonstrate that the improved YOLOv8n model achieves mAP@0.5 and mAP@0.5:0.95 of 92.7% and 68.5%, respectively, representing a 2.6% and 3.9% improvement over the baseline model. The improved model significantly enhances object detection accuracy and exhibits superior performance in infrared image object detection in substation equipment. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

20. 基于改进 YOLOv8n 的井下人员安全帽佩戴检测.

Author: 王琦, 夏鲁飞, 陈天明, 韩鸿胤, and 王亮
Abstract: Copyright of Journal of Mine Automation is the property of Industry & Mine Automation Editorial Department and its content may not be copied or emailed to multiple sites or posted to a listserv without the copyright holder's express written permission. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract. (Copyright applies to all Abstracts.)
Published: 2024
Full Text: View/download PDF

21. PAL-YOLOv8: A Lightweight Algorithm for Insulator Defect Detection.

Author: Zhang, Du, Cao, Kerang, Han, Kai, Kim, Changsu, and Jung, Hoekyung
Subjects: ALGORITHMS, NECK
Abstract: To address the challenges of high model complexity and low accuracy in detecting small targets in insulator defect detection using UAV aerial imagery, we propose a lightweight algorithm, PAL-YOLOv8. Firstly, the baseline model, YOLOv8n, is enhanced by incorporating the PKI Block from PKINet to improve the C2f module, effectively reducing the model complexity and enhancing feature extraction capabilities. Secondly, Adown from YOLOv9 is employed in the backbone and neck for downsampling, which retains more feature information while reducing the feature map size, thus improving the detection accuracy. Additionally, Focaler-SIoU is used as the bounding-box regression loss function to improve model performance by focusing on different regression samples. Finally, pruning is applied to the improved model to further reduce its size. The experimental results show that PAL-YOLOv8 achieves an mAP50 of 95.0%, which represents increases of 5.5% and 2.6% over YOLOv8n and YOLOv9t, respectively. Furthermore, GFLOPs is only 3.9, the model size is just 2.7 MB, and the parameter count is only 1.24 × 106. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

22. An Improved Lightweight YOLOv8 Network for Early Small Flame Target Detection.

Author: Du, Hubin, Li, Qiuyu, Guan, Ziqian, Zhang, Hengyuan, and Liu, Yongtao
Subjects: FLAME, REGRESSION analysis, ALGORITHMS, WOUNDS & injuries
Abstract: The efficacy of early fire detection hinges on its swift response and precision, which allows for the issuance of timely alerts in the nascent stages of a fire, thereby minimizing losses and injuries. To enhance the precision and swiftness of identifying minute early flame targets, as well as the ease of deployment at the edge end, an optimized early flame target detection algorithm for YOLOv8 is proposed. The original feature fusion module, an FPN (feature pyramid network) of YOLOv8n, has been enhanced to become the BiFPN (bidirectional feature pyramid network) module. This modification enables the network to more efficiently and rapidly perform multi-scale fusion, thereby enhancing its capacity for integrating features across different scales. Secondly, the efficient multi-scale attention (EMA) mechanism is introduced to ensure the effective retention of information on each channel and reduce the computational overhead, thereby improving the model's detection accuracy while reducing the number of model parameters. Subsequently, the NWD (normalized Wasserstein distance) loss function is employed as the bounding box loss function, which enhances the model's regression performance and robustness. The experimental results demonstrate that the size of the enhanced model is 4.8 M, a reduction of 22.5% compared to the original YOLOv8n. Additionally, the mAP0.5 metric exhibits a 2.7% improvement over the original YOLOv8n, indicating a more robust detection capability and a more compact model size. This makes it an ideal candidate for deployment in edge devices. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

23. PRE-YOLO: A Lightweight Model for Detecting Helmet-Wearing of Electric Vehicle Riders on Complex Traffic Roads.

Author: Yang, Xiang, Wang, Zhen, and Dong, Minggang
Subjects: TRAFFIC accidents, ELECTRICAL injuries, FEATURE extraction, ELECTRIC vehicles, BOOSTING algorithms, HELMETS
Abstract: Electric vehicle accidents on the road occur frequently, and head injuries are often the cause of serious casualties. However, most electric vehicle riders seldom wear helmets. Therefore, combining target detection algorithms with road cameras to intelligently monitor helmet-wearing has extremely important research significance. Therefore, a helmet-wearing detection algorithm based on the improved YOLOv8n model, PRE-YOLO, is proposed. First, we add small target detection layers and prune large target detection layers. The sophisticated algorithm considerably boosts the effectiveness of data manipulation while significantly reducing model parameters and size. Secondly, we introduce a convolutional module that integrates receptive field attention convolution and CA mechanisms into the backbone network, enhancing feature extraction capabilities by enhancing attention weights within both channel and spatial aspects. Lastly, we incorporate an EMA mechanism into the C2f module, which strengthens feature perception and captures more characteristic information while maintaining the same model parameter size. The experimental outcomes indicate that in comparison to the original model, the proposed PRE-YOLO model in this paper has improved by 1.3%, 1.7%, 2.2%, and 2.6% in terms of precision P, recall R, mAP@0.5, and mAP@0.5:0.95, respectively. At the same time, the number of model parameters has been reduced by 33.3%, and the model size has been reduced by 1.8 MB. Generalization experiments are conducted on the TWHD and EBHD datasets to further verify the versatility of the model. The research findings provide solutions for further improving the accuracy and efficiency of helmet-wearing detection on complex traffic roads, offering references for enhancing safety and intelligence in traffic. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

24. YOLOv8n-Enhanced PCB Defect Detection: A Lightweight Method Integrating Spatial–Channel Reconstruction and Adaptive Feature Selection.

Author: An, Jiayang and Shi, Zhichao
Subjects: FEATURE selection, PRINTED circuits, COMPUTATIONAL complexity, GENERALIZATION, ALGORITHMS, PRINTED circuit design
Abstract: In response to the challenges of small-size defects and low recognition rates in Printed Circuit Boards (PCBs), as well as the need for lightweight detection models that can be embedded in portable devices, this paper proposes an improved defect detection method based on a lightweight shared convolutional head using YOLOv8n. Firstly, the Spatial and Channel reconstruction Convolution (SCConv) is embedded into the Cross Stage Partial with Convolutional Layer Fusion (C2f) structure of the backbone network, which reduces redundant computations and enhances the model's learning capacity. Secondly, an adaptive feature selection module is integrated to improve the network's ability to recognize small targets. Subsequently, a Shared Lightweight Convolutional Detection (SLCD) Head replaces the original Decoupled Head, reducing the model's computational complexity while increasing detection accuracy. Finally, the Weighted Intersection over Union (WIoU) loss function is introduced to provide more precise evaluation results and improve generalization capability. Comparative experiments conducted on a public PCB dataset demonstrate that the improved algorithm achieves a mean Average Precision (mAP) of 98.6% and an accuracy of 99.8%, representing improvements of 3.8% and 3.1%, respectively, over the original model. The model size is 4.1 M, and its FPS is 144.1, meeting the requirements for real-time and lightweight portable deployment. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

25. Insulator defect detection algorithm based on improved YOLOv8 for electric power.

Author: Su, Jun, Yuan, Yongqi, Przystupa, Krzysztof, and Kochan, Orest
Abstract: Insulator defect detection plays a critical role in ensuring electrical equipment's safe and stable operation, meeting the public's demand for electricity consumption. However, extracting features of insulator defects poses challenges due to complex backgrounds, variations in target sizes leading to potential oversights, and low detection accuracy. We propose an improved YOLOv8n-based insulator defect detection model to achieve timely and precise real-time detection. Firstly, the TripletAttention Module is introduced to enhance the network's ability to extract insulator defect features and reduce background interference in detection. Secondly, SCConv (Spatial and Channel Reconstruction Convolution) is utilized to redesign the detection head, proposing a more lightweight SC-Detect to replace the original one, thereby restricting feature redundancy and enhancing feature representation capability. Finally, Slim-neck based on GSConv is employed to reconstruct the neck structure, enabling the network to achieve lightweight while possessing relatively stronger feature extraction and perceptual capabilities. Experimental results demonstrate that the improved insulator defect detection network achieves an accuracy of 96.1%, a recall rate of 94.8%, a mAP@0.5 of 97.2%, and a mAP@0.5 - 0.95 of 72%, representing increases of 1.5%, 4.2%, 2.5%, and 6%, respectively. Additionally, the parameter count decreases by 22%, and computational load reduces by 39%, thereby meeting the high-precision and real-time requirements for outdoor insulator defect detection tasks. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

26. FE-YOLO: A Lightweight Model for Construction Waste Detection Based on Improved YOLOv8 Model.

Author: Yang, Yizhong, Li, Yexue, and Tao, Maohu
Subjects: CONSTRUCTION & demolition debris, CONSTRUCTION management, LIGHTWEIGHT construction, WASTE management, COMPUTATIONAL complexity
Abstract: Construction waste detection under complex scenarios poses significant challenges due to low detection accuracy, high computational complexity, and large parameter volume in existing models. These challenges are critical as accurate and efficient detection is essential for effective waste management in the construction industry, which is increasingly focused on sustainability and resource optimization. This paper aims to address the low accuracy of detection, high computational complexity, and large parameter volume in the models of construction waste detection under complex scenarios. For this purpose, an improved YOLOv8-based algorithm called FE-YOLO is proposed in this paper. This algorithm replaces the C2f module in the backbone with the Faster_C2f module and integrates the ECA attention mechanism into the bottleneck layer. Also, a custom multi-class construction waste dataset is created for evaluation. FE-YOLO achieves an mAP@50 of 92.7% on this dataset, up by 3% compared to YOLOv8n. Meanwhile, the parameter count and floating-point operations are scaled down by 12% and 13%, respectively. Finally, a test is conducted on a publicly available construction waste dataset. The test results demonstrate the excellent performance of this algorithm in generalization and robustness. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

27. SN-CNN: A Lightweight and Accurate Line Extraction Algorithm for Seedling Navigation in Ridge-Planted Vegetables.

Author: Zhang, Tengfei, Zhou, Jinhao, Liu, Wei, Yue, Rencai, Shi, Jiawei, Zhou, Chunjian, and Hu, Jianping
Subjects: CONVOLUTIONAL neural networks, AGRICULTURAL robots, STANDARD deviations, AGRICULTURAL resources, AUTONOMOUS robots
Abstract: In precision agriculture, after vegetable transplanters plant the seedlings, field management during the seedling stage is necessary to optimize the vegetable yield. Accurately identifying and extracting the centerlines of crop rows during the seedling stage is crucial for achieving the autonomous navigation of robots. However, the transplanted ridges often experience missing seedling rows. Additionally, due to the limited computational resources of field agricultural robots, a more lightweight navigation line fitting algorithm is required. To address these issues, this study focuses on mid-to-high ridges planted with double-row vegetables and develops a seedling band-based navigation line extraction model, a Seedling Navigation Convolutional Neural Network (SN-CNN). Firstly, we proposed the C2f_UIB module, which effectively reduces redundant computations by integrating Network Architecture Search (NAS) technologies, thus improving the model's efficiency. Additionally, the model incorporates the Simplified Attention Mechanism (SimAM) in the neck section, enhancing the focus on hard-to-recognize samples. The experimental results demonstrate that the proposed SN-CNN model outperforms YOLOv5s, YOLOv7-tiny, YOLOv8n, and YOLOv8s in terms of the model parameters and accuracy. The SN-CNN model has a parameter count of only 2.37 M and achieves an mAP@0.5 of 94.6%. Compared to the baseline model, the parameter count is reduced by 28.4%, and the accuracy is improved by 2%. Finally, for practical deployment, the SN-CNN algorithm was implemented on the NVIDIA Jetson AGX Xavier, an embedded computing platform, to evaluate its real-time performance in navigation line fitting. We compared two fitting methods: Random Sample Consensus (RANSAC) and least squares (LS), using 100 images (50 test images and 50 field-collected images) to assess the accuracy and processing speed. The RANSAC method achieved a root mean square error (RMSE) of 5.7 pixels and a processing time of 25 milliseconds per image, demonstrating a superior fitting accuracy, while meeting the real-time requirements for navigation line detection. This performance highlights the potential of the SN-CNN model as an effective solution for autonomous navigation in field cross-ridge walking robots. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

28. An enhanced YOLOv8n object detector for synthetic diamond quality evaluation

Author: Shixiong Zhang, Ang Li, Jianxin Ren, and Xingchong Li
Subjects: Synthetic diamonds, Quality evaluation, YOLOv8n, ConvNeXtV2, Dynamic head, Medicine, Science
Abstract: Abstract To address the need for automated sorting of synthetic diamonds based on quality in manufacturing enterprises, this study developed a dedicated dataset and an enhanced YOLOv8n model for synthetic diamonds detection and quality evaluation, named YOLOv8n-adamas. We redesigned the backbone network to improve feature extraction capabilities and introduced a dynamic detection head based on attention mechanisms to further enhance model performance. Experimental results show that on synthetic diamonds dataset, YOLOv8n-adamas achieved a 4.0% improvement in precision (P), a 2.7% increase in recall (R), and improvements of 1.5% and 1.3% in mean average precisions at 50% and 95% Intersection over Union (IoU) thresholds (mAP50 and mAP95) compared to YOLOv8. Furthermore, YOLOv8n-adamas also outperforms other commonly used, high-performing models in various metrics on this dataset, offering effective technical support for the automated quality-based sorting of synthetic diamonds.
Published: 2024
Full Text: View/download PDF

29. Identification of water-cooled wall ash accumulation based on AWGAM-YOLOv8n

Author: Yongxing Hao, Bin Wang, Yilong Hao, and Angang Cao
Subjects: Identifying the ash accumulation, Mobilenetv3, YOLOv8n, AWGAM, Medicine, Science
Abstract: Abstract Identifying the ash accumulation generated on the water-cooled walls of the waste incinerator is essential for the cleanup by the robotic arm. This paper improves a new algorithm based on YOLOv8n, which can identify the ash accumulation position on the water-cooled wall quickly and accurately. Firstly, the multi-scale fusion image enhancement algorithm is used to improve the sharpness and contrast of the image and enrich the details of the image. Secondly, the backbone feature extraction network of YOLOv8n is replaced by Mobilenetv3 network, which reduces the parameters in the model greatly. Finally, this paper improves a new attention mechanism AWGAM (Add Weight Global Attention Mechanism) based on GAM (Global Attention Mechanism), which can better integrate the feature information between different dimensions and improve the learning ability of the model. AWGAM is added to the backbone of the model. The experimental results show that compared with the original YOLOv8n model, the improved YOLOv8n model has 59.9% fewer parameters, 4.4% higher precision, 8.8% higher recall, 3.2% higher mAP50 (mean Average Precision) and 8.8% higher mAP50-95. This model has made remarkable progress on the basis of the original algorithm, and has strong competitiveness compared with other advanced target detection models. The lightweight and high accuracy of ash accumulation detection offered by the proposed model presents promising applications in ash accumulation detection tasks of water-cooled walls.
Published: 2024
Full Text: View/download PDF

30. Helmet Net: An Improved YOLOv8 Algorithm for Helmet Wearing Detection.

Author: Deng, Li, Zhou, Jin, and Liu, Quanyi
Subjects: TRAFFIC safety, ROAD safety measures, DEEP learning, ALGORITHMS, HELMETS, CYCLISTS
Abstract: It is of profound significance to detect whether cyclists wear helmets to protect their personal safety and maintain road traffic safety. Due to the limitations of space, distance and cyclist movement, it is challenging to detect helmet-wearing accurately and quickly. In view of this, a novel You Only Look Once (YOLOv8) algorithm for helmet-wearing detection is suggested in this paper. Firstly, YOLOv8n with the best performance test results is selected as the baseline model from several advanced object detection algorithms. Secondly, improvement measures are taken for YOLOv8n. The squeeze-and-excitation networks (SENet) is integrated at the C2f of the neck to improve the network representation ability. Part of conv modules in the backbone is replaced with the lightweight convolution (LConv) to improve the computational efficiency and the generalization ability of the model. The loss function is changed with the Wise-IoU (WIoU), to enhance the overall performance of the model. Additionally, the reasoning method is replaced by the slicing aided hyper inference (SAHI), which aims to lower the rate of missed detections for smaller objects and strengthen the accuracy of their detection. Through the above improvement methods, a new helmet-wearing detection algorithm is formed, called helmet net. Furthermore, in comparison to the YOLOv8n, the proposed algorism demonstrated an increase in precision, recall, and mean average precision (mAP) by 5, 7.3, and 6.4%, respectively, for helmet-wearing detection. At the same time, the speed reaches 111.1 fps, which can contribute to the real-time detection of helmet-wearing. After adding SAHI, the detection results show that the model can detect more small objects, which further enhances the competence of model for helmet-wearing detection. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

31. Small object detection method for mining face based on improved YOLOv8n

Author: XUE Xiaoyong, HE Xinyu, YAO Chaoxiu, JIANG Ze, and PAN Hongguang
Subjects: mining face, small object detection, yolov8n, safety protection equipment testing, multi scale object recognition, Mining engineering. Metallurgy, TN1-997
Abstract: In order to effectively detect and recognize whether the personnel on the mining face in coal mines are wearing safety protection devices, a small object detection method based on improved YOLOv8n is proposed. It is applied in situations such as poor underground lighting conditions, small object sizes of safety protection device, and similar colors to the background. The method integrates Dynamic Snake Convolution (DSConv) into the C2f module of YOLOv8n backbone network to construct a C2f DSConv module, in order to enhance the model's capability to extract multi-scale features. The method introduces polarized self-attention (PSA) mechanism in the Neck layer to reduce information loss and improve feature expression capability. The method adds one detection head specifically designed for small objects at the Head layer, forming a four detection head structure to expand the detection range of the model. The experimental results show that the improved YOLOv8n model has an average precision of 98.3%, 95.8%, 89.9%, 87.2%, and 90.8% for detecting underground personnel and their safety helmets, mining lights, masks, and self rescue devices, respectively. The average precision is 92.4%, which is better than Faster R-CNN, YOLOv5s, YOLOv7, and YOLOv8n models. The detection speed reaches 208 frames per second, meeting the requirements of object detection precision and real-time performance in coal mines.
Published: 2024
Full Text: View/download PDF

32. GEB-YOLO: a novel algorithm for enhanced and efficient detection of foreign objects in power transmission lines

Author: Jiangpeng Zheng, Hao Liu, Qiuting He, and Jinfu Hu
Subjects: Foreign objects detection, GhostConv, Feature fusion, BiFPN, YOLOv8n, Medicine, Science
Abstract: Abstract Detecting foreign objects in power transmission lines is essential for mitigating safety risks and maintaining line stability. Practical detection, however, presents challenges including varied target sizes, intricate backgrounds, and large model weights. To address these issues, this study introduces an innovative GEB-YOLO model, which balances detection performance and quantification. Firstly, the algorithm features a lightweight architecture, achieved by merging the GhostConv network with the advanced YOLOv8 model. This integration considerably lowers computational demands and parameters through streamlined linear operations. Secondly, this paper proposes a novel EC2f mechanism, a groundbreaking feature that bolsters the model’s information extraction capabilities. It enhances the relationship between weights and channels via one-dimensional convolution. Lastly, the BiFPN mechanism is employed to improve the model’s processing efficiency for targets of different sizes, utilizing bidirectional connections and swift feature fusion for normalization. Experimental results indicate the model’s superiority over existing models in precision and mAP, showing improvements of 3.7 and 6.8%, respectively. Crucially, the model’s parameters and FLOPs have been reduced by 10.0 and 7.4%, leading to a model that is both lighter and more efficient. These advancements offer invaluable insights for applying laser technology in detecting foreign objects, contributing significantly to both theory and practice.
Published: 2024
Full Text: View/download PDF

33. Complex Scene Occluded Object Detection with Fusion of Mixed Local Channel Attention and Multi-Detection Layer Anchor-Free Optimization

Author: Qinghua Su and Jianhong Mu
Subjects: autonomous driving, occluded object detection, mixed local channel attention, YOLOv8n, Soft-NMS, Technology (General), T1-995
Abstract: The field of object detection has widespread applicability in many areas. Despite the multitude of object detection methods that are already established, complex scenes with occlusions still prove challenging due to the loss of information and dynamic changes that reduce the distinguishable features between the target and its background, resulting in lower detection accuracy. Addressing the shortcomings in detecting obscured objects in complex scenes with existing models, a novel approach has been proposed on the YOLOv8n architecture. First, the enhancement begins with the addition of a small object detection head atop the YOLOv8n architecture to keenly detect and pinpoint small objects. Then, a blended mixed local channel attention mechanism is integrated within YOLOv8n, which leverages the visible segment features of the target to refine the feature extraction hampered by occlusion impacts. Subsequently, Soft-NMS is introduced to optimize the candidate bounding boxes, solving the issue of missed detection under overlapping similar targets. Lastly, using universal object detection evaluation metrics, a series of ablation experiments on public datasets (CityPersons) were conducted alongside comparison trials with other models, followed by testing on various datasets. The results showed an average precision (map@0.5) reaching 0.676, marking a 6.7% improvement over the official YOLOv8 under identical experimental conditions, a 7.9% increase compared to Gold-YOLO, and a 7.1% rise over RTDETR, also demonstrating commendable performance across other datasets. Although the computational load increased with the addition of detection layers, the frames per second (FPS) still reached 192, which meets the real-time requirements for the vast majority of scenarios. Such findings indicate that the refined method not only significantly enhances performance on occluded datasets but can also be transferred to other models to boost their performance capabilities.
Published: 2024
Full Text: View/download PDF

34. YOLOv8-CML: a lightweight target detection method for color-changing melon ripening in intelligent agriculture

Author: Guojun Chen, Yongjie Hou, Tao Cui, Huihui Li, Fengyang Shangguan, and Lei Cao
Subjects: Attention mechanisms, Color-changing melon dataset, Intelligent agriculture, Target detection, YOLOv8n, Medicine, Science
Abstract: Abstract Color-changing melon is an ornamental and edible fruit. Aiming at the problems of slow detection speed and high deployment cost for Color-changing melon in intelligent agriculture equipment, this study proposes a lightweight detection model YOLOv8-CML.Firstly, a lightweight Faster-Block is introduced to reduce the number of memory accesses while reducing redundant computation, and a lighter C2f structure is obtained. Then, the lightweight C2f module fusing EMA module is constructed in Backbone to collect multi-scale spatial information more efficiently and reduce the interference of complex background on the recognition effect. Next, the idea of shared parameters is utilized to redesign the detection head to simplify the model further. Finally, the α-IoU loss function is adopted better to measure the overlap between the predicted and real frames using the α hyperparameter, improving the recognition accuracy. The experimental results show that compared to the YOLOv8n model, the parametric and computational ratios of the improved YOLOv8-CML model decreased by 42.9% and 51.8%, respectively. In addition, the model size is only 3.7 MB, and the inference speed is improved by 6.9%, while mAP@0.5, accuracy, and FPS are also improved. Our proposed model provides a vital reference for deploying Color-changing melon picking robots.
Published: 2024
Full Text: View/download PDF

35. Field cabbage detection and positioning system based on improved YOLOv8n

Author: Ping Jiang, Aolin Qi, Jiao Zhong, Yahui Luo, Wenwu Hu, Yixin Shi, and Tianyu Liu
Subjects: Cabbage, Object detection, YOLOv8n, Swin transformer, Large kernel convolutions, Plant culture, SB1-1110, Biology (General), QH301-705.5
Abstract: Abstract Background Pesticide efficacy directly affects crop yield and quality, making targeted spraying a more environmentally friendly and effective method of pesticide application. Common targeted cabbage spraying methods often involve object detection networks. However, complex natural and lighting conditions pose challenges in the accurate detection and positioning of cabbage. Results In this study, a cabbage detection algorithm based on the YOLOv8n neural network (YOLOv8-cabbage) combined with a positioning system constructed using a Realsense depth camera is proposed. Initially, four of the currently available high-performance object detection models were compared, and YOLOv8n was selected as the transfer learning model for field cabbage detection. Data augmentation and expansion methods were applied to extensively train the model, a large kernel convolution method was proposed to improve the bottleneck section, the Swin transformer module was combined with the convolutional neural network (CNN) to expand the perceptual field of feature extraction and improve edge detection effectiveness, and a nonlocal attention mechanism was added to enhance feature extraction. Ablation experiments were conducted on the same dataset under the same experimental conditions, and the improved model increased the mean average precision (mAP) from 88.8% to 93.9%. Subsequently, depth maps and colour maps were aligned pixelwise to obtain the three-dimensional coordinates of the cabbages via coordinate system conversion. The positioning error of the three-dimensional coordinate cabbage identification and positioning system was (11.2 mm, 10.225 mm, 25.3 mm), which meets the usage requirements. Conclusions We have achieved accurate cabbage positioning. The object detection system proposed here can detect cabbage in real time in complex field environments, providing technical support for targeted spraying applications and positioning.
Published: 2024
Full Text: View/download PDF

36. Distracted Driving Behavior Detection Algorithm Based on Lightweight StarDL-YOLO.

Author: Shen, Qian, Zhang, Lei, Zhang, Yuxiang, Li, Yi, Liu, Shihao, and Xu, Yin
Subjects: DISTRACTED driving, FEATURE extraction, COMPUTATIONAL complexity, MOTOR vehicle driving, GENERALIZATION, DEEP learning
Abstract: Distracted driving is one of the major factors leading drivers to ignore potential road hazards. In response to the challenges of high computational complexity, limited generalization capacity, and suboptimal detection accuracy in existing deep learning-based detection algorithms, this paper introduces a novel approach called StarDL-YOLO (StarNet-detectlscd-yolo), which leverages an enhanced version of YOLOv8n. Initially, the StarNet integrated into the backbone of YOLOv8n significantly improves the feature extraction capability of the model with remarkable reduction in computational complexity. Subsequently, the Star Block is incorporated into the neck network, forming a C2f-Star module that offers lower computational cost. Additionally, shared convolution is introduced in the detection head to further reduce computational burden and parameter size. Finally, the Wise-Focaler-MPDIoU loss function is proposed to strengthen detection accuracy. The experimental results demonstrate that StarDL-YOLO significantly improves the efficiency of the distracted driving behavior detection, achieving an accuracy of 99.6% on the StateFarm dataset. Moreover, the parameter count of the model is minimized by 56.4%, and its computational load is decreased by 45.1%. Additionally, generalization experiments are performed on the 100-Driver dataset, revealing that the proposed scheme enhances generalization effectiveness compared to YOLOv8n. Therefore, this algorithm significantly reduces computational load while maintaining high reliability and generalization capability. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

37. YOLOv8-G: An Improved YOLOv8 Model for Major Disease Detection in Dragon Fruit Stems.

Author: Huang, Luobin, Chen, Mingxia, and Peng, Zihao
Subjects: *PITAHAYAS, *FEATURE extraction, *FRUIT yield, *RESEARCH personnel, *ALGORITHMS
Abstract: Dragon fruit stem disease significantly affects both the quality and yield of dragon fruit. Therefore, there is an urgent need for an efficient, high-precision intelligent detection method to address the challenge of disease detection. To address the limitations of traditional methods, including slow detection and weak micro-integration capability, this paper proposes an improved YOLOv8-G algorithm. The algorithm reduces computational redundancy by introducing the C2f-Faster module. The loss function was modified to the structured intersection over union (SIoU), and the coordinate attention (CA) and content-aware reorganization feature extraction (CARAFE) modules were incorporated. These enhancements increased the model's stability and improved its accuracy in recognizing small targets. Experimental results showed that the YOLOv8-G algorithm achieved a mean average precision (mAP) of 83.1% and mAP50:95 of 48.3%, representing improvements of 3.3% and 2.3%, respectively, compared to the original model. The model size and floating point operations per second (FLOPS) were reduced to 4.9 MB and 6.9 G, respectively, indicating reductions of 20% and 14.8%. The improved model achieves higher accuracy in disease detection while maintaining a lighter weight, serving as a valuable reference for researchers in the field of dragon fruit stem disease detection. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

38. An Improved Fire and Smoke Detection Method Based on YOLOv8n for Smart Factories.

Author: Zhang, Ziyang, Tan, Lingye, and Robert, Tiong Lee Kong
Subjects: *DEEP learning, *SOCIAL development, *SMOKE, *BANDWIDTHS, *FACTORIES, *FIRE detectors
Abstract: Factories play a crucial role in economic and social development. However, fire disasters in factories greatly threaten both human lives and properties. Previous studies about fire detection using deep learning mostly focused on wildfire detection and ignored the fires that happened in factories. In addition, lots of studies focus on fire detection, while smoke, the important derivative of a fire disaster, is not detected by such algorithms. To better help smart factories monitor fire disasters, this paper proposes an improved fire and smoke detection method based on YOLOv8n. To ensure the quality of the algorithm and training process, a self-made dataset including more than 5000 images and their corresponding labels is created. Then, nine advanced algorithms are selected and tested on the dataset. YOLOv8n exhibits the best detection results in terms of accuracy and detection speed. ConNeXtV2 is then inserted into the backbone to enhance inter-channel feature competition. RepBlock and SimConv are selected to replace the original Conv and improve computational ability and memory bandwidth. For the loss function, CIoU is replaced by MPDIoU to ensure an efficient and accurate bounding box. Ablation tests show that our improved algorithm achieves better performance in all four metrics reflecting accuracy: precision, recall, F1, and mAP@50. Compared with the original model, whose four metrics are approximately 90%, the modified algorithm achieves above 95%. mAP@50 in particular reaches 95.6%, exhibiting an improvement of approximately 4.5%. Although complexity improves, the requirements of real-time fire and smoke monitoring are satisfied. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

39. 基于改进 YOLOv8n 的采掘工作面小目标检测方法.

Author: 薛小勇, 何新宇, 姚超修, 蒋泽, and 潘红光
Subjects: SAFETY hats, COAL mining, TESTING equipment, MINES & mineral resources, SNAKES
Abstract: Copyright of Journal of Mine Automation is the property of Industry & Mine Automation Editorial Department and its content may not be copied or emailed to multiple sites or posted to a listserv without the copyright holder's express written permission. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract. (Copyright applies to all Abstracts.)
Published: 2024
Full Text: View/download PDF

40. A Lightweight YOLOv8 Model for Apple Leaf Disease Detection.

Author: Gao, Lijun, Zhao, Xing, Yue, Xishen, Yue, Yawei, Wang, Xiaoqiang, Wu, Huanhuan, and Zhang, Xuedong
Subjects: MOBILE apps, APPLE growing, PLANT diseases, COMPUTATIONAL complexity, ALGORITHMS
Abstract: China holds the top position globally in apple production and consumption. Detecting diseases during the planting process is crucial for increasing yields and promoting the rapid development of the apple industry. This study proposes a lightweight algorithm for apple leaf disease detection in natural environments, which is conducive to application on mobile and embedded devices. Our approach modifies the YOLOv8n framework to improve accuracy and efficiency. Key improvements include replacing conventional Conv layers with GhostConv and parts of the C2f structure with C3Ghost, reducing the model's parameter count, and enhancing performance. Additionally, we integrate a Global attention mechanism (GAM) to improve lesion detection by more accurately identifying affected areas. An improved Bi-Directional Feature Pyramid Network (BiFPN) is also incorporated for better feature fusion, enabling more effective detection of small lesions in complex environments. Experimental results show a 32.9% reduction in computational complexity and a 39.7% reduction in model size to 3.8 M, with performance metrics improving by 3.4% to a mAP@0.5 of 86.9%. Comparisons with popular models like YOLOv7-Tiny, YOLOv6, YOLOv5s, and YOLOv3-Tiny demonstrate that our YOLOv8n–GGi model offers superior detection accuracy, the smallest size, and the best overall performance for identifying critical apple diseases. It can serve as a guide for implementing real-time crop disease detection on mobile and embedded devices. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

41. A Detection Algorithm for Citrus Huanglongbing Disease Based on an Improved YOLOv8n.

Author: Xie, Wu, Feng, Feihong, and Zhang, Huimin
Subjects: *CITRUS greening disease, *OBJECT recognition (Computer vision), *CITRUS, *FEATURE extraction, *ALGORITHMS, *ORCHARD management, *ORCHARDS
Abstract: Given the severe impact of Citrus Huanglongbing on orchard production, accurate detection of the disease is crucial in orchard management. In the natural environments, due to factors such as varying light intensities, mutual occlusion of citrus leaves, the extremely small size of Huanglongbing leaves, and the high similarity between Huanglongbing and other citrus diseases, there remains an issue of low detection accuracy when using existing mainstream object detection models for the detection of citrus Huanglongbing. To address this issue, we propose YOLO-EAF (You Only Look Once–Efficient Asymptotic Fusion), an improved model based on YOLOv8n. Firstly, the Efficient Multi-Scale Attention Module with cross-spatial learning (EMA) is integrated into the backbone feature extraction network to enhance the feature extraction and integration capabilities of the model. Secondly, the adaptive spatial feature fusion (ASFF) module is used to enhance the feature fusion ability of different levels of the model so as to improve the generalization ability of the model. Finally, the focal and efficient intersection over union (Focal–EIOU) is utilized as the loss function, which accelerates the convergence process of the model and improves the regression precision and robustness of the model. In order to verify the performance of the YOLO-EAF method, we tested it on the self-built citrus Huanglongbing image dataset. The experimental results showed that YOLO-EAF achieved an 8.4% higher precision than YOLOv8n on the self-built dataset, reaching 82.7%. The F1-score increased by 3.33% to 77.83%, and the mAP (0.5) increased by 3.3% to 84.7%. Through experimental comparisons, the YOLO-EAF model proposed in this paper offers a new technical route for the monitoring and management of Huanglongbing in smart orange orchards. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

42. An Improved YOLOv8n Used for Fish Detection in Natural Water Environments.

Author: Zhang, Zehao, Qu, Yi, Wang, Tan, Rao, Yuan, Jiang, Dan, Li, Shaowen, and Wang, Yating
Subjects: *FISHERY resources, *FISHERIES, *UNDERWATER photography, *PROBLEM solving
Abstract: Simple Summary: Underwater fish species are an important direction in fishery resource surveys. Rapidly determining species of underwater fish can improve the efficiency of fishery resource surveys. Therefore, this study proposes an effective method for underwater fish measurement, which can quickly acquire underwater fish species. The experimental results demonstrate the accuracy and superiority of our method. The proposed method improves the efficiency of fishery resource surveys and provides crucial data support for the precise management of fishery resources. To improve detection efficiency and reduce cost consumption in fishery surveys, target detection methods based on computer vision have become a new method for fishery resource surveys. However, the specialty and complexity of underwater photography result in low detection accuracy, limiting its use in fishery resource surveys. To solve these problems, this study proposed an accurate method named BSSFISH-YOLOv8 for fish detection in natural underwater environments. First, replacing the original convolutional module with the SPD-Conv module allows the model to lose less fine-grained information. Next, the backbone network is supplemented with a dynamic sparse attention technique, BiFormer, which enhances the model's attention to crucial information in the input features while also optimizing detection efficiency. Finally, adding a 160 × 160 small target detection layer (STDL) improves sensitivity for smaller targets. The model scored 88.3% and 58.3% in the two indicators of mAP@50 and mAP@50:95, respectively, which is 2.0% and 3.3% higher than the YOLOv8n model. The results of this research can be applied to fishery resource surveys, reducing measurement costs, improving detection efficiency, and bringing environmental and economic benefits. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

43. Lightweight Road Damage Detection Method Based on Improved YOLOv8.

Author: XU Tiefeng, HUANG He, ZHANG Hongmin, and NIU Xiaofu
Subjects: COMPUTATIONAL complexity, SPEED, MEMORY, PERCENTILES
Abstract: Aiming at the problems of large memory space occupation, high computational complexity, and difficult to meet the real- time target detection requirements of the road damage detection model in complex scenes, a lightweight road damage detection model DGE-YOLO-P is proposed for the complex natural scenes. Firstly, the C2f fusion deformable convolutional design C2f_DCNv3 module in the network is enhanced to enhance the modelling capability of object deformation and the input feature information is dimensionality reduced to effectively reduce the number of parameters and the computational complexity. The input feature information is dimensionality reduced to effectively reduce the number of model parameters and computational complexity. Then, the GS-Decoupled head detection module is designed to reduce the parameters of the detection head while realising the effective aggregation of global information. At the same time, the E-Slide Loss weight function is designed to assign higher weights to the difficult samples, fully learn the difficult sample data in road damage, and further improve the model detection accuracy. Finally, channel pruning is used to reduce the redundant channels of the model, which effectively compresses the model volume and improves the detection speed. The experimental results show that the mAP of the DGE-YOLO-P model is increased by 2.4 percentage points compared with the YOLOv8n model, while the number of model parameters, computational volume and model size are reduced by 58.1%, 66.7% and 55.5%, respectively. The detection speed FPS is increased from 34 frame/s to 51 frame/s. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

44. Detection Based on Semantics and a Detail Infusion Feature Pyramid Network and a Coordinate Adaptive Spatial Feature Fusion Mechanism Remote Sensing Small Object Detector.

Author: Zhou, Shilong and Zhou, Haijin
Subjects: *REMOTE sensing, *AERIAL photography, *DETECTORS, *SEMANTICS, *DRONE aircraft
Abstract: In response to the challenges of remote sensing imagery, such as unmanned aerial vehicle (UAV) aerial imagery, including differences in target dimensions, the dominance of small targets, and dense clutter and occlusion in complex environments, this paper optimizes the YOLOv8n model and proposes an innovative small-object-detection model called DDSC-YOLO. First, a DualC2f structure is introduced to improve the feature-extraction capabilities of the model. This structure uses dual-convolutions and group convolution techniques to effectively address the issues of cross-channel communication and preserving information in the original input feature mappings. Next, a new attention mechanism, DCNv3LKA, was developed. This mechanism uses adaptive and fine-grained information-extraction methods to simulate receptive fields similar to self-attention, allowing adaptation to a wide range of target size variations. To address the problem of false and missed detection of small targets in aerial photography, we designed a Semantics and Detail Infusion Feature Pyramid Network (SDI-FPN) and added a dedicated detection scale specifically for small targets, effectively mitigating the loss of contextual information in the model. In addition, the coordinate adaptive spatial feature fusion (CASFF) mechanism is used to optimize the original detection head, effectively overcoming multi-scale information conflicts while significantly improving small target localization accuracy and long-range dependency perception. Testing on the VisDrone2019 dataset shows that the DDSC-YOLO model improves the mAP0.5 by 9.3% over YOLOv8n, and its performance on the SSDD and RSOD datasets also confirms its superior generalization capabilities. These results confirm the effectiveness and significant progress of our novel approach to small target detection. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

45. Identification of Insect Pests on Soybean Leaves Based on SP-YOLO.

Author: Qin, Kebei, Zhang, Jie, and Hu, Yue
Subjects: *SOYBEAN diseases & pests, *CROP yields, *INSECT pests, *LEAF anatomy, *FEATURE extraction
Abstract: Soybean insect pests can seriously affect soybean yield, so efficient and accurate detection of soybean insect pests is crucial for soybean production. However, pest detection in complex environments suffers from the problems of small pest targets, large inter-class feature similarity, and background interference with feature extraction. To address the above problems, this study proposes the detection algorithm SP-YOLO for soybean pests based on YOLOv8n. The model utilizes FasterNet to replace the backbone of YOLOv8n, which reduces redundant features and improves the model's ability to extract effective features. Second, we propose the PConvGLU architecture, which enhances the capture and representation of image details while reducing computation and memory requirements. In addition, this study proposes a lightweight shared detection header, which enables the model parameter amount computation to be reduced and the model accuracy to be further improved by shared convolution and GroupNorm. The improved model achieves 80.8% precision, 66.4% recall, and 73% average precision, which is 6%, 5.4%, and 5.2%, respectively, compared to YOLOv8n. The FPS reaches 256.4, and the final model size is only 6.2 M, while the number of computational quantities of covariates is basically comparable to that of the original model. The detection capability of SP-YOLO is significantly enhanced compared to that of the existing methods, which provides a good solution for soybean pest detection. SP-YOLO provides an effective technical support for soybean pest detection. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

46. 基于改进YOLOv8n 的轻量化红花识别方法.

Author: 张新月, 胡广锐, 李浦航, 曹晓明, 张　浩, 陈　军, and 杨亮亮
Subjects: *MANUAL labor, *HARVESTING equipment, *SAFFLOWER, *CASH crops, *IMAGE recognition (Computer vision)
Abstract: Safflower is one of the most important cash crops in China. Its production area is concentrated in Xinjiang, Gansu and Ningxia. However, the harvesting of safflower relies mainly on manual labour at present. Particularly, the operating environment is easily affected by weather factors. Fortunately, intelligent harvesting can be expected to improve the efficiency of safflower harvesting with labour cost savings. Previous research often focused on pneumatic, pulling, combing, and cutting harvesting. However, it is still required for manual work during harvesting. The autonomous operation can be realized by combining target detection and navigation in the harvesting robots. However, the complex working environment in the field has limited the accurate recognition and localization in the harvesting process. This study aims to promote the performance of safflower recognition under the complex environment in the field during intelligent harvesting. A lightweight safflower recognition was also proposed using an improved YOLOv8n. The computational resources of the device were then deployed to the model on the mobile for detection. A dataset of 2309 images was created to categorize into two classes: picked and no picked. The safflower blooming was categorized into four stages, namely the bud, first flowering, prime bloom, and septum stage. The prime bloom stage was the best picking time in the most economically beneficial period of safflower. Therefore, the safflower only in the prime bloom stage was picked rather than the bud, first flowering, and septum stage. The improvement procedures were as follows. Firstly, the Vanillanet lightweight network structure was applied to substitute for the Backbone of YOLOv8n, in order to reduce the complex structure of the model. Secondly, the large separable kernel attention (LSKA) module was introduced into the Neck, in order to reduce the amount of storage and computational resource consumption. Thirdly, The YOLOv8n's loss function was revised from the center intersection of union (CIoU) to the wise intersection of union (WIoU), in order to improve the overall performance of the detector. Finally, the stochastic gradient descent (SGD) was chosen to train the model for robustness. The experimental results showed that the frames per second (FPS) of the improved lightweight model increased by 7.41%, while the weight file was only 50.17% of the original one. The precision (P) and the mean average precision (mAP) values reached 93.10% and 96.40%, respectively. Furthermore, the FPS was improved by 25.93% and 19.76%, the weight file was reduced by 21.90% and 25.86%, respectively, compared with the YOLOv5s and YOLOv7-tiny models. Meanwhile, better robustness was achieved in the improved model. The Jetson Orin NX flatform was then selected to deploy for testing. The single-image detection time of YOLOv8n and YOLOv8n-VLWS was 0.38s and 0.27s, which was 28.95% shorter than the original model. The high precision and lightweight of real-time detection was realized for the safflower in the field. The findings can provide the technical support to develop intelligent harvesting equipment for safflower. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

47. YOLOv8n-LSLW: a lightweight method for real-time detection of wild fishing behavior.

Author: Yan, Pengcheng, Wang, Wenchang, Li, Guodong, Zhao, Yuting, Wang, Jingbao, and Wen, Ziming
Abstract: With the development of the electric power industry, the laying of transmission lines covers various waters, which poses a great threat to the life safety of fishermen who intrude into high-voltage areas. To address this issue, this paper proposes a lightweight unmanned aerial vehicle (UAV) inspection algorithm. Firstly, in view of the diversity and complexity of the field environment, a complex targeted data augmentation method and an adaptive histogram equalization (AHE) method were designed. Secondly, the YOLOv8n algorithm is employed, with the design of C2f-Ghost modules and GhostConv modules to construct the light-back bone layer, aimed at improving detection accuracy and speed. Subsequently, improvements are made to the small target detection layer, and a lightweight Light Bi-directional Feature Pyramid Network (Light-BiFPN) structure is proposed. Finally, the Wise Intersection over Union (WIoU) loss function is introduced to enhance the quality of model anchor boxes. Experimental results demonstrate that the improved algorithm achieves good detection accuracy with smaller weight files, making it suitable for deployment on mobile devices and also performs well on the VisDrone2019 dataset. This algorithm plays a proactive role in safeguarding the safety of fishermen and ensuring the stable operation of power systems. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

48. SES-YOLOv8n: automatic driving object detection algorithm based on improved YOLOv8.

Author: Sun, Yang, Zhang, Yuhang, Wang, Haiyang, Guo, Jianhua, Zheng, Jiushuai, and Ning, Haonan
Abstract: The perception system in autonomous driving mainly uses object detection algorithms to obtain the distribution of obstacles for recognition and analysis. Current object detection algorithms have rapidly developed, but it is challenging to balance the requirements of real-time detection and high detection accuracy in actual application scenarios. To solve the above problems, this paper uses YOLOv8n as the baseline model and proposes an object detection network named SES-YOLOv8n. Firstly, the SPPF module in the network was replaced by the SPPCSPC module to enhance further the model's fusion ability under feature maps of different scales. The efficient multi-scale attention module EMA is introduced into the C2F module of the backbone network, which improves the perception ability in critical areas and the efficiency of feature extraction. Finally, the SPD-Conv module is used to replace part of the convolution modules in the backbone network to replace the downsampling operation, which can more effectively retain the feature information and improve the network's accuracy and learning ability. Experimental results on the KITTI dataset and BDD100K dataset show that the average accuracy of the improved network model reaches 92.7% and 41.9%, which is 3.4% and 5.0% higher than that of the baseline model and is significantly better than the baseline model. This model can realize real-time image processing in general scenes based on ensuring high detection accuracy. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

49. CTM-YOLOv8n: A Lightweight Pedestrian Traffic-Sign Detection and Recognition Model with Advanced Optimization.

Author: Chen, Qiang, Dai, Zhongmou, Xu, Yi, and Gao, Yuezhen
Subjects: TRAFFIC signs & signals, PEDESTRIANS, COLLECTIONS
Abstract: Traffic-sign detection and recognition (TSDR) is crucial to avoiding harm to pedestrians, especially children, from intelligent connected vehicles and has become a research hotspot. However, due to motion blurring, partial occlusion, and smaller sign sizes, pedestrian TSDR faces increasingly significant challenges. To overcome these difficulties, a CTM-YOLOv8n model is proposed based on the YOLOv8n model. With the aim of extracting spatial features more efficiently and making the network faster, the C2f Faster module is constructed to replace the C2f module in the head, which applies filters to only a few input channels while leaving the remaining ones untouched. To enhance small-sign detection, a tiny-object-detection (TOD) layer is designed and added to the first C2f layer in the backbone. Meanwhile, the seventh Conv layer, eighth C2f layer, and connected detection head are deleted to reduce the quantity of model parameters. Eventually, the original CIoU is replaced by the MPDIoU, which is better for training deep models. During experiments, the dataset is augmented, which contains the choice of categories 'w55' and 'w57' in the TT100K dataset and a collection of two types of traffic signs around the schools in Tianjin. Empirical results demonstrate the efficacy of our model, showing enhancements of 5.2% in precision, 10.8% in recall, 7.0% in F1 score, and 4.8% in mAP@0.50. However, the number of parameters is reduced to 0.89M, which is only 30% of the YOLOv8n model. Furthermore, the proposed CTM-YOLOv8n model shows superior performance when tested against other advanced TSDR models. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

50. Fusion of Target and Keypoint Detection for Automated Measurement of Mongolian Horse Body Measurements.

Author: Su, Lide, Li, Minghuang, Zhang, Yong, Zong, Zheying, and Gong, Caili
Subjects: CONVOLUTIONAL neural networks, STATURE, HORSE industry, BODY size, HORSES
Abstract: Accurate and efficient access to Mongolian horse body size information is an important component in the modernization of the equine industry. Aiming at the shortcomings of manual measurement methods, such as low efficiency and high risk, this study converts the traditional horse body measure measurement problem into a measurement keypoint localization problem and proposes a top-down automatic Mongolian horse body measure measurement method by integrating the target detection algorithm and keypoint detection algorithm. Firstly, the SimAM parameter-free attention mechanism is added to the YOLOv8n backbone network to constitute the SimAM–YOLOv8n algorithm, which provides the base image for the subsequent accurate keypoint detection; secondly, the coordinate regression-based RTMPose keypoint detection algorithm is used for model training to realize the keypoint localization of the Mongolian horse. Lastly, the cosine annealing method was employed to dynamically adjust the learning rate throughout the entire training process, and subsequently conduct body measurements based on the information of each keypoint. The experimental results show that the average accuracy of the SimAM–YOLOv8n algorithm proposed in this study was 90.1%, and the average accuracy of the RTMPose algorithm was 91.4%. Compared with the manual measurements, the shoulder height, chest depth, body height, body length, croup height, angle of shoulder and angle of croup had mean relative errors (MRE) of 3.86%, 4.72%, 3.98%, 2.74%, 2.89%, 4.59% and 5.28%, respectively. The method proposed in this study can provide technical support to realize accurate and efficient Mongolian horse measurements. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Database

Publisher

200 results on '"YOLOv8n"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources