VRD-YOLO:基于YOLOv5s改进的实时车载影像道路病害检测模型
作者:
中图分类号:

U418.6;TP391.41

基金项目:

国家自然科学基金(42371447);江苏省研究生科研与实践创新计划(KYCX24_1492)


VRD-YOLO:real-time vehicle-mounted image road damage detection based on improved YOLOv5s
Author:
  • 摘要
  • | |
  • 访问统计
  • |
  • 参考文献
  • |
  • 相似文献
  • |
  • 引证文献
  • | |
  • 文章评论
    摘要:

    针对车载影像中的道路病害尺寸差异大,小尺度病害多,导致检测精度低的问题,本文提出一种基于YOLOv5s改进的实时车载影像道路病害检测模型VRD-YOLO(Vehicle-mounted image Road Damage Detection-YOLO).首先,提出通道混合滑动Transformer模块,增强模型全局上下文建模能力,强化细粒度道路病害语义特征信息提取;其次,引入具有跨层融合和跨尺度融合特性的广义特征金字塔,扩大网络感受野,强化多尺度病害特征融合;再次,设计动态检测头,实现尺度感知、空间感知和任务感知,优化模型特征响应,进一步提升模型的检测性能;最后,构建车载影像道路病害数据集VIRDD(Vehicle-mounted Image Road Damage Dataset),扩充现有道路病害数据集数量及类型,并基于该数据集进行消融和对比实验.实验结果表明:VRD-YOLO在VIRDD数据集上的平均精度均值(mAP@0.5)为74.45%,检测速度(FPS)可达到28.56帧/s,与YOLOv5s模型相比,精确度、召回率、F1分数和mAP分别提升2.79、2.32、2.54和3.19个百分点.同时,通过与其他6种经典及主流目标检测模型比较,VRD-YOLO以最少的模型参数量(9.68×106)获得了最佳的检测精度,验证了本文方法的有效性和优越性.

    Abstract:

    To accurately detect road damages with large size differences and small scales in vehicle-mounted images,this paper presents a real-time road damage detection model based on improved YOLOv5s,termed as VRD-YOLO (Vehicle-mounted image Road Damage Detection-YOLO).Firstly,a Channel Mix Slide Transformer (CMST) module is proposed to enhance the model's global context modeling capability and strengthen the extraction of fine-grained road damage semantic feature information.Secondly,a generalized feature pyramid with cross-layer fusion and cross-scale fusion is introduced to expand the network receptive field and strengthen the fusion of multi-scale damage features.Thirdly,to optimize the model's feature response and further improve detection performance,a dynamic detection head is designed to achieve scale perception,spatial perception,and task perception.Finally,a Vehicle-mounted Image Road Damage Dataset (VIRDD) is constructed to expand the quantity and types of existing road damage datasets,and ablation and comparative experiments are conducted based on this dataset.Experimental results show that the VRD-YOLO achieves a mean Average Precision (mAP@0.5) of 74.45% on the VIRDD dataset,with a detection speed reaching 28.56 frames per second.Compared to the YOLOv5s model,VRD-YOLO improves the precision,recall,F1 score,and mAP by 2.79,2.32,2.54,and 3.19 percentage points,respectively.Additionally,compared with six other classical and mainstream object detection models,the proposed VRD-YOLO attains the highest detection accuracy with the smallest model parameter count of 9.68 million,verifying its effectiveness and superiority.

    参考文献
    相似文献
    引证文献
引用本文

黄文龙,赵好好,康健,支晓栋,王东川,周维勋,倪欢,管海燕. VRD-YOLO:基于YOLOv5s改进的实时车载影像道路病害检测模型[J].南京信息工程大学学报(自然科学版),2025,17(2):151-164
HUANG Wenlong, ZHAO Haohao, KANG Jian, ZHI Xiaodong, WANG Dongchuan, ZHOU Weixun, NI Huan, GUAN Haiyan. VRD-YOLO:real-time vehicle-mounted image road damage detection based on improved YOLOv5s[J]. Journal of Nanjing University of Information Science & Technology, 2025,17(2):151-164

复制
分享
文章指标
  • 点击次数:
  • 下载次数:
  • HTML阅读次数:
  • 引用次数:
历史
  • 收稿日期:2024-10-09
  • 在线发布日期: 2025-04-16
  • 出版日期: 2025-03-28

地址:江苏省南京市宁六路219号    邮编:210044

联系电话:025-58731025    E-mail:nxdxb@nuist.edu.cn

南京信息工程大学学报 ® 2025 版权所有  技术支持:北京勤云科技发展有限公司