Abstrakti
We present a new method for detecting and locating anomalies in textured-type images using transformer-based autoencoders. In this approach, a rectangular patch of an image is masked by setting its value to gray and then fetched into a pre-trained autoencoder with several blocks of transformer encoders and decoders in order to reconstruct the unknown part. It is shown that the pre-trained model is not able to reconstruct the defective parts properly when they are inside the masked patch. In this regard, the combination of the Structural Similarity Index Measure and absolute error between the reconstructed image and the original one can be used to define a new anomaly map to find and locate anomalies. In the experiment with the textured images of the MVTec dataset, we discover that not only can this approach find anomalous samples properly, but also the anomaly map itself can specify the exact locations of defects correctly at the same time. Moreover, not only is our method computatio nally efficient, as it utilizes a pre-trained model and does not require any training, but also it has a better performance compared to previous autoencoders and other reconstruction-based methods. Due to these reasons, one can use this method as a base approach to find and locate irregularities in real-world applications.
Alkuperäiskieli | Englanti |
---|---|
Otsikko | Proceedings of the 19th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications - Volume 2: VISAPP |
Toimittajat | Petia Radeva, Antonino Furnari, Kadi Bouatouch, A. Augusto Sousa |
Kustantaja | Science and Technology Publications (SciTePress) |
Sivut | 191-200 |
Vuosikerta | 2 |
ISBN (elektroninen) | 978-989-758-679-8 |
DOI - pysyväislinkit | |
Tila | Julkaistu - 2024 |
OKM-julkaisutyyppi | A4 Artikkeli konferenssijulkaisussa |
Tapahtuma | International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications - Rome Kesto: 27 helmik. 2024 → 29 helmik. 2024 Konferenssinumero: 19 |
Julkaisusarja
Nimi | VISIGRAPP |
---|---|
ISSN (painettu) | 2184-5921 |
ISSN (elektroninen) | 2184-4321 |
Conference
Conference | International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications |
---|---|
Kaupunki | Rome |
Ajanjakso | 27/02/24 → 29/02/24 |
Julkaisufoorumi-taso
- Jufo-taso 1