Tea Bud Detection Based on Hybrid Attention Mechanism

2025-5-4- 17

Tea Bud Detection Based on Hybrid Attention Mechanism
DOI:
                        
                    
CSTR:
                        
                    
Author:
                        wangzhou1wangzhou
College of Computer and Information Sciences Fujian Agriculture and Forestry University
Find this author on All Journals
Find this author on BaiDu
Search for this author on this site
zhouqi1zhouqi
College of Computer and Information Sciences Fujian Agriculture and Forestry University
Find this author on All Journals
Find this author on BaiDu
Search for this author on this site
wanglijing1wanglijing
College of Computer and Information Sciences Fujian Agriculture and Forestry University
Find this author on All Journals
Find this author on BaiDu
Search for this author on this site
wuqingshou2,3,4,5,6,7,8,9wuqingshou
School of Mathematics and Computer Science Wuyi University, Wuyishan
Find this author on All Journals
Find this author on BaiDu
Search for this author on this site

                    
Affiliation:1.College of Computer and Information Sciences Fujian Agriculture and Forestry University;2.School 3.of 4.Mathematics 5.and 6.Computer 7.Science Wuyi 8.University, 9.Wuyishan
Clc Number:
Fund Project:

Article

Figures

Metrics

Reference [26]

Cited by

Materials

Comments

Abstract:

Accurately detecting tea buds is the key to achieving automation and intelligence in tea bud harvesting. However, to accurately identify tea bud images, it is necessary to overcome the problem of tea bud colors being similar to the background and the target size being too small. Therefore, this article studies a YOLOv5s model based on hybrid attention mechanism and applies it to tea bud detection. This article makes optimizations in the following two aspects: firstly, a hybrid attention mechanism (HAM) is proposed and added to the YOLOv5s backbone network, which enables the network to focus on the target area, extract features more fully, and improve the accuracy of object recognition by the model. Secondly, by introducing normalized Wasserstein distance (NWD) as a new metric and combining it with the existing CIoU loss function. The NWD loss function calculates the similarity between the bounding boxes based on their corresponding Gaussian distributions, thereby improving the model's accuracy in detecting small targets in images. The experimental results show that compared with the original YOLOv5s model, the improved model mAP0.5 increased by 0.9%, mAP0.5:0.95 increased by 1.3%, while the number of parameters only increased by 0.044×106. These results confirm the effectiveness of the proposed method in achieving precise tea picking recognition, providing technical reference for intelligent tea picking in practical scenarios.

Key words:tea bud detection; YOLOv5s; attention mechanism; loss function

Reference

Shao Peidi,Wu Minghui,Wang Xianwei,et al.Research on the tea bud recognition based on improved k-means algorithm[C],2018.

Li Wang,Chen Rong,Gao Yuan-yuan.Automatic Recognition of Tea Bud Image Based on Support Vector Machine[C],2020.

Zhang Lei,Zou Lang,Wu Chuanyu,et al.Method of famous tea sprout identification and segmentation based on improved watershed algorithm[J].COMPUTERS AND ELECTRONICS IN AGRICULTURE,2021,184.

Chen Chunlin,Lu Jinzhu,Zhou Mingchuan,et al.A YOLOv3-based computer vision system for identification of tea buds and the picking point[J].COMPUTERS AND ELECTRONICS IN AGRICULTURE,2022,198.

Cheng Yifan,Li Yang,Zhang Rentian,et al.Locating Tea Bud Keypoints by Keypoint Detection Method Based on Convolutional Neural Network[J].SUSTAINABILITY,2023,15(8):6898-6898.

Li Jie,Li Jiehao,Zhao Xin,et al.Lightweight detection networks for tea bud on complex agricultural environment via improved YOLO v4[J].COMPUTERS AND ELECTRONICS IN AGRICULTURE,2023,211.

Xie Shuang,Sun Hongwei.Tea-YOLOv8s: A Tea Bud Detection Model Based on Deep Learning and Computer Vision[J].SENSORS,2023,23(14):6576.

李孟浩,袁三男.基于改进YOLOv5s的交通标识检测算法[J].南京信息工程大学学报(自然科学版),2024,16(1):11-19

LI Menghao, YUAN Sannan. Traffic sign detection based on improved YOLOv5s[J]. Journal of Nanjing University of Information Science & Technology, 2024,16(1):11-19

Chen Zhiwei,Chen Jianneng,Li Yang,et al.Tea Bud Detection and 3D Pose Estimation in the Field with a Depth Camera Based on Improved YOLOv5 and the Optimal Pose-Vertices Search Method[J].AGRICULTURE-BASEL,2023,13(7).

Glenn, J., 2020 yolov5. Git code. https://github.com/ultralytics/yolov5.

Bochkovskiy Alexey,Wang Chien-Yao,Liao Hong-Yuan Mark.YOLOv4: Optimal Speed and Accuracy of Object Detection[J].arXiv,2020.

Liu Shu,Qi Lu,Qin Haifang,et al.Path Aggregation Network for Instance Segmentation[C],2018.

Lin Tsung-Yi,Dollar Piotr,Girshick Ross,et al.Feature Pyramid Networks for Object Detection[C],2017.

Lee Jiyoon,Kim Heegwang,Park Chanyeong,et al.Small Object Detection in Infrared Images Using Attention Mechanism and Sigmoid Function[C],2024.

Woo Sanghyun,Park Jongchan,Lee, Joon-Young,et al.CBAM: Convolutional Block Attention Module[C],2018.

庄建军,徐子恒,张若愚.基于改进的YOLOv5模型和射线法的车辆违停检测[J].南京信息工程大学学报(自然科学版),2024,16(3):341-351

ZHUANG Jianjun, XU Ziheng, ZHANG Ruoyu. Illegal parking detection based on improved YOLOv5 model and ray method[J]. Journal of Nanjing University of Information Science & Technology, 2024,16(3):341-351

Wang Qilong,Wu Banggu,Zhu Pengfei,et al.ECA-Net: Efficient channel attention for deep convolutional neural networks[J].arXiv,2019.

Hu Jie,Shen Li,Albanie Samuel,et al.Squeeze-and-Excitation Networks[J].IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE,2020,42(8):2011-2023.

Hou Qibin,Zhou Daquan,Feng Jiashi.Coordinate attention for efficient mobile network design[J].arXiv,2021.

Zheng Zhaohui,Wang Ping,Liu Wei,et al.Distance-IoU loss: Faster and better learning for bounding box regression[J].arXiv,2019.

Wang Jinwang,Xu Chang,Yang Wen,et al.A Normalized GaussianWasserstein Distance for Tiny Object Detection[J].arXiv,2021.

Li Chuyi,Li Lulu,Jiang Hongliang,et al.YOLOv6: A Single-Stage Object Detection Framework for Industrial Applications[J].arXiv,2022.

Wang Chien-Yao,Bochkovskiy Alexey,Liao Hong-Yuan Mark.YOLOv7: Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors[J].arXiv,2022.

Glenn, J. YOLOv8. Git Code. 2023. Available online: https://github.com/ultralytics/ultralytics.

Get Citation

Copy

Article Metrics

Abstract:19
PDF: 0
HTML: 0
Cited by: 0

History

Received:July 29,2024
Revised:September 21,2024
Adopted:September 23,2024
Online:
Published:

Article QR Code

Address：No. 219, Ningliu Road, Nanjing, Jiangsu Province

Postcode：210044

Phone：025-58731025

Get Citation

Share

Article Metrics

History

Article QR Code