Tea Bud Detection Based on Hybrid Attention Mechanism
Author:
Affiliation:

1.College of Computer and Information Sciences Fujian Agriculture and Forestry University;2.School 3.of 4.Mathematics 5.and 6.Computer 7.Science Wuyi 8.University, 9.Wuyishan

  • Article
  • | |
  • Metrics
  • |
  • Reference [26]
  • | |
  • Cited by
  • | |
  • Comments
    Abstract:

    Accurately detecting tea buds is the key to achieving automation and intelligence in tea bud harvesting. However, to accurately identify tea bud images, it is necessary to overcome the problem of tea bud colors being similar to the background and the target size being too small. Therefore, this article studies a YOLOv5s model based on hybrid attention mechanism and applies it to tea bud detection. This article makes optimizations in the following two aspects: firstly, a hybrid attention mechanism (HAM) is proposed and added to the YOLOv5s backbone network, which enables the network to focus on the target area, extract features more fully, and improve the accuracy of object recognition by the model. Secondly, by introducing normalized Wasserstein distance (NWD) as a new metric and combining it with the existing CIoU loss function. The NWD loss function calculates the similarity between the bounding boxes based on their corresponding Gaussian distributions, thereby improving the model's accuracy in detecting small targets in images. The experimental results show that compared with the original YOLOv5s model, the improved model mAP0.5 increased by 0.9%, mAP0.5:0.95 increased by 1.3%, while the number of parameters only increased by 0.044×106. These results confirm the effectiveness of the proposed method in achieving precise tea picking recognition, providing technical reference for intelligent tea picking in practical scenarios.

    Reference
    Shao Peidi,Wu Minghui,Wang Xianwei,et al.Research on the tea bud recognition based on improved k-means algorithm[C],2018.
    Li Wang,Chen Rong,Gao Yuan-yuan.Automatic Recognition of Tea Bud Image Based on Support Vector Machine[C],2020.
    Zhang Lei,Zou Lang,Wu Chuanyu,et al.Method of famous tea sprout identification and segmentation based on improved watershed algorithm[J].COMPUTERS AND ELECTRONICS IN AGRICULTURE,2021,184.
    Chen Chunlin,Lu Jinzhu,Zhou Mingchuan,et al.A YOLOv3-based computer vision system for identification of tea buds and the picking point[J].COMPUTERS AND ELECTRONICS IN AGRICULTURE,2022,198.
    Cheng Yifan,Li Yang,Zhang Rentian,et al.Locating Tea Bud Keypoints by Keypoint Detection Method Based on Convolutional Neural Network[J].SUSTAINABILITY,2023,15(8):6898-6898.
    Li Jie,Li Jiehao,Zhao Xin,et al.Lightweight detection networks for tea bud on complex agricultural environment via improved YOLO v4[J].COMPUTERS AND ELECTRONICS IN AGRICULTURE,2023,211.
    Xie Shuang,Sun Hongwei.Tea-YOLOv8s: A Tea Bud Detection Model Based on Deep Learning and Computer Vision[J].SENSORS,2023,23(14):6576.
    李孟浩,袁三男.基于改进YOLOv5s的交通标识检测算法[J].南京信息工程大学学报(自然科学版),2024,16(1):11-19
    LI Menghao, YUAN Sannan. Traffic sign detection based on improved YOLOv5s[J]. Journal of Nanjing University of Information Science & Technology, 2024,16(1):11-19
    Chen Zhiwei,Chen Jianneng,Li Yang,et al.Tea Bud Detection and 3D Pose Estimation in the Field with a Depth Camera Based on Improved YOLOv5 and the Optimal Pose-Vertices Search Method[J].AGRICULTURE-BASEL,2023,13(7).
    Glenn, J., 2020 yolov5. Git code. https://github.com/ultralytics/yolov5.
    Bochkovskiy Alexey,Wang Chien-Yao,Liao Hong-Yuan Mark.YOLOv4: Optimal Speed and Accuracy of Object Detection[J].arXiv,2020.
    Liu Shu,Qi Lu,Qin Haifang,et al.Path Aggregation Network for Instance Segmentation[C],2018.
    Lin Tsung-Yi,Dollar Piotr,Girshick Ross,et al.Feature Pyramid Networks for Object Detection[C],2017.
    Lee Jiyoon,Kim Heegwang,Park Chanyeong,et al.Small Object Detection in Infrared Images Using Attention Mechanism and Sigmoid Function[C],2024.
    Woo Sanghyun,Park Jongchan,Lee, Joon-Young,et al.CBAM: Convolutional Block Attention Module[C],2018.
    庄建军,徐子恒,张若愚.基于改进的YOLOv5模型和射线法的车辆违停检测[J].南京信息工程大学学报(自然科学版),2024,16(3):341-351
    ZHUANG Jianjun, XU Ziheng, ZHANG Ruoyu. Illegal parking detection based on improved YOLOv5 model and ray method[J]. Journal of Nanjing University of Information Science & Technology, 2024,16(3):341-351
    Wang Qilong,Wu Banggu,Zhu Pengfei,et al.ECA-Net: Efficient channel attention for deep convolutional neural networks[J].arXiv,2019.
    Hu Jie,Shen Li,Albanie Samuel,et al.Squeeze-and-Excitation Networks[J].IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE,2020,42(8):2011-2023.
    Hou Qibin,Zhou Daquan,Feng Jiashi.Coordinate attention for efficient mobile network design[J].arXiv,2021.
    Zheng Zhaohui,Wang Ping,Liu Wei,et al.Distance-IoU loss: Faster and better learning for bounding box regression[J].arXiv,2019.
    Wang Jinwang,Xu Chang,Yang Wen,et al.A Normalized GaussianWasserstein Distance for Tiny Object Detection[J].arXiv,2021.
    Li Chuyi,Li Lulu,Jiang Hongliang,et al.YOLOv6: A Single-Stage Object Detection Framework for Industrial Applications[J].arXiv,2022.
    Wang Chien-Yao,Bochkovskiy Alexey,Liao Hong-Yuan Mark.YOLOv7: Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors[J].arXiv,2022.
    Glenn, J. YOLOv8. Git Code. 2023. Available online: https://github.com/ultralytics/ultralytics.
    Related
    Cited by
    Comments
    Comments
    分享到微博
    Submit
Get Citation
Share
Article Metrics
  • Abstract:19
  • PDF: 0
  • HTML: 0
  • Cited by: 0
History
  • Received:July 29,2024
  • Revised:September 21,2024
  • Adopted:September 23,2024
Article QR Code

Address:No. 219, Ningliu Road, Nanjing, Jiangsu Province

Postcode:210044

Phone:025-58731025