VRD-YOLO:基于YOLOv5s改进的实时车载影像道路病害检测模型

doi:10.13878/j.cnki.jnuist.20241009001

2025年5月31日 9:10 星期六

首页 > 过刊浏览>2025年第17卷第2期 >151-164. DOI:10.13878/j.cnki.jnuist.20241009001

VRD-YOLO:基于YOLOv5s改进的实时车载影像道路病害检测模型
DOI:
                        10.13878/j.cnki.jnuist.20241009001
                    
作者:
                        黄文龙黄文龙
南京信息工程大学 遥感与测绘工程学院, 南京, 210044
在期刊界中查找
在百度中查找
在本站中查找
赵好好赵好好
南京信息工程大学 遥感与测绘工程学院, 南京, 210044
在期刊界中查找
在百度中查找
在本站中查找
康健康健
南京信息工程大学 遥感与测绘工程学院, 南京, 210044
在期刊界中查找
在百度中查找
在本站中查找
支晓栋支晓栋
北京中科鹏宇科技有限公司, 北京, 1000800
在期刊界中查找
在百度中查找
在本站中查找
王东川王东川
北京中科鹏宇科技有限公司, 北京, 1000800
在期刊界中查找
在百度中查找
在本站中查找
周维勋周维勋
南京信息工程大学 遥感与测绘工程学院, 南京, 210044
在期刊界中查找
在百度中查找
在本站中查找
倪欢倪欢
南京信息工程大学 遥感与测绘工程学院, 南京, 210044
在期刊界中查找
在百度中查找
在本站中查找
管海燕管海燕
南京信息工程大学 遥感与测绘工程学院, 南京, 210044
在期刊界中查找
在百度中查找
在本站中查找

                    
作者单位:
作者简介:
通讯作者:
中图分类号:U418.6;TP391.41
基金项目:国家自然科学基金(42371447);江苏省研究生科研与实践创新计划(KYCX24_1492)

VRD-YOLO:real-time vehicle-mounted image road damage detection based on improved YOLOv5s

Author:

HUANG Wenlong
HUANG Wenlong
School of Remote Sensing & Geomatics Engineering, Nanjing University of Information Science & Technology, Nanjing 210044, China
在期刊界中查找
在百度中查找
在本站中查找
ZHAO Haohao
ZHAO Haohao
School of Remote Sensing & Geomatics Engineering, Nanjing University of Information Science & Technology, Nanjing 210044, China
在期刊界中查找
在百度中查找
在本站中查找
KANG Jian
KANG Jian
School of Remote Sensing & Geomatics Engineering, Nanjing University of Information Science & Technology, Nanjing 210044, China
在期刊界中查找
在百度中查找
在本站中查找
ZHI Xiaodong
ZHI Xiaodong
Beijing Zhongke Pengyu Technology Co., Ltd., Beijing 100080, China
在期刊界中查找
在百度中查找
在本站中查找
WANG Dongchuan
WANG Dongchuan
Beijing Zhongke Pengyu Technology Co., Ltd., Beijing 100080, China
在期刊界中查找
在百度中查找
在本站中查找
ZHOU Weixun
ZHOU Weixun
School of Remote Sensing & Geomatics Engineering, Nanjing University of Information Science & Technology, Nanjing 210044, China
在期刊界中查找
在百度中查找
在本站中查找
NI Huan
NI Huan
School of Remote Sensing & Geomatics Engineering, Nanjing University of Information Science & Technology, Nanjing 210044, China
在期刊界中查找
在百度中查找
在本站中查找
GUAN Haiyan
GUAN Haiyan
School of Remote Sensing & Geomatics Engineering, Nanjing University of Information Science & Technology, Nanjing 210044, China
在期刊界中查找
在百度中查找
在本站中查找

Affiliation:

Fund Project:

摘要

图/表

访问统计

参考文献

相似文献

引证文献

资源附件

文章评论

摘要:

针对车载影像中的道路病害尺寸差异大,小尺度病害多,导致检测精度低的问题,本文提出一种基于YOLOv5s改进的实时车载影像道路病害检测模型VRD-YOLO(Vehicle-mounted image Road Damage Detection-YOLO).首先,提出通道混合滑动Transformer模块,增强模型全局上下文建模能力,强化细粒度道路病害语义特征信息提取;其次,引入具有跨层融合和跨尺度融合特性的广义特征金字塔,扩大网络感受野,强化多尺度病害特征融合;再次,设计动态检测头,实现尺度感知、空间感知和任务感知,优化模型特征响应,进一步提升模型的检测性能;最后,构建车载影像道路病害数据集VIRDD(Vehicle-mounted Image Road Damage Dataset),扩充现有道路病害数据集数量及类型,并基于该数据集进行消融和对比实验.实验结果表明:VRD-YOLO在VIRDD数据集上的平均精度均值(mAP@0.5)为74.45%,检测速度(FPS)可达到28.56帧/s,与YOLOv5s模型相比,精确度、召回率、F1分数和mAP分别提升2.79、2.32、2.54和3.19个百分点.同时,通过与其他6种经典及主流目标检测模型比较,VRD-YOLO以最少的模型参数量(9.68×10⁶)获得了最佳的检测精度,验证了本文方法的有效性和优越性.

关键词:YOLOv5s;道路病害检测;车载影像;Transformer;广义特征金字塔;动态检测头

Abstract:

To accurately detect road damages with large size differences and small scales in vehicle-mounted images,this paper presents a real-time road damage detection model based on improved YOLOv5s,termed as VRD-YOLO (Vehicle-mounted image Road Damage Detection-YOLO).Firstly,a Channel Mix Slide Transformer (CMST) module is proposed to enhance the model's global context modeling capability and strengthen the extraction of fine-grained road damage semantic feature information.Secondly,a generalized feature pyramid with cross-layer fusion and cross-scale fusion is introduced to expand the network receptive field and strengthen the fusion of multi-scale damage features.Thirdly,to optimize the model's feature response and further improve detection performance,a dynamic detection head is designed to achieve scale perception,spatial perception,and task perception.Finally,a Vehicle-mounted Image Road Damage Dataset (VIRDD) is constructed to expand the quantity and types of existing road damage datasets,and ablation and comparative experiments are conducted based on this dataset.Experimental results show that the VRD-YOLO achieves a mean Average Precision (mAP@0.5) of 74.45% on the VIRDD dataset,with a detection speed reaching 28.56 frames per second.Compared to the YOLOv5s model,VRD-YOLO improves the precision,recall,F1 score,and mAP by 2.79,2.32,2.54,and 3.19 percentage points,respectively.Additionally,compared with six other classical and mainstream object detection models,the proposed VRD-YOLO attains the highest detection accuracy with the smallest model parameter count of 9.68 million,verifying its effectiveness and superiority.

Key words:YOLOv5s;road damage detection;vehicle-mounted images;Transformer;generalized feature pyramid;dynamic head

引用本文

黄文龙,赵好好,康健,支晓栋,王东川,周维勋,倪欢,管海燕. VRD-YOLO:基于YOLOv5s改进的实时车载影像道路病害检测模型[J].南京信息工程大学学报(自然科学版),2025,17(2):151-164
HUANG Wenlong, ZHAO Haohao, KANG Jian, ZHI Xiaodong, WANG Dongchuan, ZHOU Weixun, NI Huan, GUAN Haiyan. VRD-YOLO:real-time vehicle-mounted image road damage detection based on improved YOLOv5s[J]. Journal of Nanjing University of Information Science & Technology, 2025,17(2):151-164

复制

文章指标

点击次数:
下载次数:
HTML阅读次数:
引用次数:

历史

收稿日期:2024-10-09
最后修改日期:
录用日期:
在线发布日期: 2025-04-16
出版日期: 2025-03-28

地址：江苏省南京市宁六路219号邮编：210044

联系电话：025-58731025 E-mail：nxdxb@nuist.edu.cn

引用本文

分享

文章指标

历史