节点文献

基于改进YOLOv5的超分辨率和多尺度融合目标检测算法

Super-resolution and multi-scale fusion target detection algorithm based on improved YOLOv5

  • 推荐 CAJ下载
  • PDF下载
  • 不支持迅雷等下载工具,请取消加速工具后下载。

【作者】 姚珊珊王静宇郝斌张飞高鹭任晓颖

【Author】 YAO Shanshan;WANG Jingyu;HAO Bin;ZHANG Fei;GAO Lu;REN Xiaoying;School of Information Engineering, Inner Mongolia University of Science and Technology;

【通讯作者】 郝斌;

【机构】 内蒙古科技大学信息工程学院

【摘要】 为了提升目标检测算法在多尺度学习方面的能力,尤其是对小目标的检测能力,本文提出了一种基于改进YOLOv5的超分辨率和多尺度融合目标检测算法。首先,该算法使用子像素卷积代替原YOLOv5模型的上采样操作,提高图像的分辨率,并尽可能保留小目标的信息。其次,使用并行快速多尺度融合(parallel fast multi-scale fusion, PFMF)模块实现深层特征和浅层特征的双向融合,将原YOLOv5算法的3尺度预测升级为4尺度预测,以此提高模型多尺度特征学习能力和对小目标的检测效果。实验结果表明,与YOLOv5s相比,改进后的模型在PASCAL VOC数据集中,mAP@0.5提高了2.8个百分点,mAP@0.5∶0.95提高了3.5个百分点;在MS COCO数据集中,mAP@0.5提高了4.3个百分点,mAP@0.5∶0.95提高了5.2个百分点。改进后的YOLOv5模型在多尺度检测,尤其是小目标的检测效果方面得到了提升,并具有一定的应用价值。

【Abstract】 To enhance the multi-scale learning capacity of target detection algorithms, particularly for small targets, this paper proposes a super-resolution and multi-scale fusion target detection algorithm based on an improved YOLOv5 framework.Firstly, instead of the up-sampling operation of the original YOLOv5 model, the algorithm utilizes sub-pixel convolution to enhance the image resolution and preserve the information of small targets to the greatest extent possible.Secondly, the algorithm utilizes the parallel fast multi-scale fusion(PFMF) module to achieve two-way fusion of deep and shallow features.This upgrade from the original YOLOv5 algorithm′s 3-scale prediction to 4-scale prediction improves the model′s ability to learn multi-scale features and detect small targets.The experimental results demonstrate that compared with YOLOv5s, the improved model achieves a 2.8% and 3.5% increase in mAP@0.5 and mAP@0.5∶0.95,respectively, on the PASCAL VOC dataset.Similarly, on the MS COCO dataset, the improved model achieves a 4.3% and 5.2% increase in mAP@0.5 and mAP@0.5∶0.95,respectively.The experiments demonstrate the improved YOLOv5 model’s enhanced capability in multi-scale detection, particularly for small targets, and indicate its potential practical value.

【基金】 内蒙古自治区科技计划项目(2021GG0046)和内蒙古自治区科技计划项目(2021GG0048)资助项目
  • 【文献出处】 光电子·激光 ,Journal of Optoelectronics·Laser , 编辑部邮箱 ,2024年08期
  • 【分类号】TP391.41;TP183
  • 【下载频次】291
节点文献中: 

本文链接的文献网络图示:

本文的引文网络