节点文献

基于模型剪枝和半精度加速改进YOLOv3-tiny算法的实时司机违章行为检测

Real-Time Drivers’ Violation Behaviors Detection Based on Improved YOLOv3-tiny Algorithm Based on Model Pruning and Half-Precision Acceleration

  • 推荐 CAJ下载
  • PDF下载
  • 不支持迅雷等下载工具,请取消加速工具后下载。

【作者】 姚巍巍张洁

【Author】 YAO Wei-Wei;ZHANG Jie;School of Mechanical Engineering, Southwest Jiaotong University;

【通讯作者】 张洁;

【机构】 西南交通大学机械工程学院

【摘要】 为解决在嵌入式设备上实时、高精度检测司机安全驾驶监督的问题,本文基于目标检测中经典的深度学习神经网络YOLOv3-tiny,运用通道剪枝技术成功在目标检测任务中实现了模型压缩,在精度不变的情况下减少了改进后神经网络的计算总量和参数总数.并基于NVIDIA的推理框架TensorRT进行了模型层级融合和半精度加速,部署加速后的模型.实验结果表明,加速模型的推理速度约为原模型的2倍,参数体积缩小一半,精度无损失,实现了高精度下实时检测的目的.

【Abstract】 In order to optimize the method of real-time and high-precision detection of drivers’ safe driving supervision,based on the classic deep learning neural network-YOLOv3-tiny-in object detection, this study successfully uses the channel pruning technology to achieve model compression in the object detection task, and reduces the calculated total amount and parameters of the improved neural network under the condition of constant accuracy. Based on NVIDIA’s inference platform TensorRT, model level fusion and half-precision acceleration are performed, and the accelerated model is deployed. The experimental results show that the speed of inference of the acceleration model is about 2 times that of the original model, the parameter volume is reduced by half, and the accuracy is not lost, which realizes the purpose of real-time detection under high precision.

【基金】 国家自然科学基金(51775449,51205323)~~
  • 【文献出处】 计算机系统应用 ,Computer Systems & Applications , 编辑部邮箱 ,2020年04期
  • 【分类号】U298;TP391.41
  • 【被引频次】18
  • 【下载频次】362
节点文献中: 

本文链接的文献网络图示:

本文的引文网络