著作(論文等)

基本情報

氏名 永田 寅臣
氏名(カナ) ナガタ フサオミ
氏名(英語) NAGATA Fusaomi

論文名

Improvements of Detection Accuracy and Its Confidence of Defective Areas by YOLOv2 Using a Dataset Augmentation Method

著者名

Koki Arima (M2), Fusaomi Nagata, Tatsuki Shimizu (M2), Akimasa Otuka, Hirohisa Kato, Keigo Watanabe, Maki K. Habib

掲載誌名等

Artificial Life and Robotics, Springer

掲載年月

2023/09

Vol. 28,

No. 3,

開始頁

pp. 625

 

終了頁

631,

出版者(日本語)

Springer

出版者(英語)

Springer

発表形態

誌上、オンライン

概要

Recently, CNN (Convolutional Neural Network) and Grad-CAM (Gradient-weighted Class Activation Map) are being applied to various kinds of defect detection and position recognition for industrial products. However, in training process of a CNN model, a large amount of image data are required to acquire a desired generalization ability. Also, it is not easy for Grad-CAM to clearly identify the defect area which is predicted as the basis of a classification result. Moreover, when they are deployed in an actual production line, two calculation processes for CNN and Grad-CAM have to be sequentially called for defect detection and position recognition, so that the processing time is concerned. In this paper, the authors try to apply YOLOv2 (You Only Look Once) to defect detection and its visualization to process them at once. Generally, a YOLOv2 model can be built with less training images, however, a complicated labeling process is required to prepare ground truth data for training. A dataset for training a YOLOv2 model has to be composed of image files and the corresponding ground truth data file named gTruth. The gTruth file has names of all the image files and their labeled information, such as label names and box dimensions. Therefore, YOLOv2 requires complex dataset augmentation for not only images but also gTruth data. Actually, target products dealt with in this paper are produced with various kinds and small quantity, and also the frequency of occurrence of the defect is infrequent. Moreover, due to the fixed indoor production line, the valid image augmentation to be applied is limited to the horizontal flip. In this paper, a dataset augmentation method is proposed to efficiently generate training data for YOLOv2 even in such a production situation and to consequently enhance the performance of defect detection and its visualization. The effectiveness is shown through experiments.