节点文献

改进掩码自编码器的工业缺陷检测方法

Industrial defect detection method with improved masked autoencoder

  • 推荐 CAJ下载
  • PDF下载
  • 不支持迅雷等下载工具,请取消加速工具后下载。

【作者】 邓凯丽魏伟波潘振宽

【Author】 DENG Kaili;WEI Weibo;PAN Zhenkuan;College of Computer Science & Technology, Qingdao University;

【通讯作者】 魏伟波;

【机构】 青岛大学计算机科学技术学院

【摘要】 针对目前只需正常样本即可实现缺陷检测的方法存在漏检或过度检测的问题,构建一种改进掩码自编码器与改进Unet结合的方法实现像素级缺陷检测。首先,采用拟合缺陷模块生成缺陷掩码图像及正常图像对应的缺陷图像;其次,对缺陷图像随机掩码,去除缺陷图像大部分的缺陷信息,激励Transformer结构的自编码器从未掩码的正常区域学习表示并依据上下文修复缺陷图像,为了提高模型对细节的修复能力,设计了新的损失函数;最后,将缺陷图像与修复图像拼接后输入拥有通道方向交叉融合Transformer结构的Unet,实现像素级缺陷检测。实验结果表明,在MVTec AD数据集上,所提方法平均的基于图像的和基于像素的接受者操作特征曲线下的面积值(ROC AUC)分别达到了0.984和0.982,与DRAEM(Discriminatively trained Reconstruction Anomaly Embedding Model)相比分别提高了2.9和3.2个百分点;与CFLOW-AD(Anomaly Detection via Conditional normalizing FLOWs)相比分别提高了3.1和0.8个百分点,证明所提方法具有较高的识别率和检测精度。

【Abstract】 Considering the problem of missed detection or over detection in the existing defect detection methods that only need normal samples, an method that combined an improved masked autoencoder with an improved Unet was constructed to achieve pixel-level defect detection. Firstly, a defect fitting module was used to generate the defect mask image and the defect image corresponding to the normal image. Secondly, the defect image was randomly masked to remove most of the defect information from the defect image. The autoencoder with Transformer structure was stimulated to learn the representations from unmasked normal regions and to repair the defect image based on context. In order to improve the model’s ability to repair details of the image, a new loss function was designed. Finally, in order to achieve pixel-level defect detection, the defect image and the repaired image were concatenated and input into the Unet with the channel crossfusion Transformer structure. Experimental results on MVTec AD dataset show that the average image-based and pixel-based Area Under the Receiver Operating Characteristic Curve(ROC AUC) of the proposed method reached 0. 984 and 0. 982respectively; compared with DRAEM(Discriminatively trained Reconstruction Anomaly Embedding Model), it was increased by 2. 9 and 3. 2 percentage points; compared with CFLOW-AD(Anomaly Detection via Conditional normalizing FLOWs), it was increased by 3. 1 and 0. 8 percentage points. It verifies that the proposed method has high recognition rate and detection accuracy.

【基金】 山东省自然科学基金资助项目(ZR2020QF033)~~
  • 【文献出处】 计算机应用 ,Journal of Computer Applications , 编辑部邮箱 ,2024年08期
  • 【分类号】TP391.41
  • 【下载频次】9
节点文献中: 

本文链接的文献网络图示:

本文的引文网络