Enhanced Data-Recalibration: Utilizing Validation Data to Mitigate Instance-Dependent Noise in Classification

Saeed Bakhshi Germi, Esa Rahtu

Tutkimustuotos: KonferenssiartikkeliScientificvertaisarvioitu

1 Sitaatiot (Scopus)
10 Lataukset (Pure)

Abstrakti

This paper proposes a practical approach to deal with instance-dependent noise in classification. Supervised learning with noisy labels is one of the major research topics in the deep learning community. While old works typically assume class conditional and instance-independent noise, recent works provide theoretical and empirical proof to show that the noise in real-world cases is instance-dependent. Current state-of-the-art methods for dealing with instance-dependent noise focus on data-recalibrating strategies to iteratively correct labels while training the network. While some methods provide theoretical analysis to prove that each iteration results in a cleaner dataset and a better-performing network, the limiting assumptions and dependency on knowledge about noise for hyperparameter tuning often contrast their claims. The proposed method in this paper is a two-stage data-recalibration algorithm that utilizes validation data to correct noisy labels and refine the model iteratively. The algorithm works by training the network on the latest cleansed training Set to obtain better performance on a small, clean validation set while using the best performing model to cleanse the training set for the next iteration. The intuition behind the method is that a network with decent performance on the clean validation set can be utilized as an oracle network to generate less noisy labels for the training set. While there is no theoretical guarantee attached, the method’s effectiveness is demonstrated with extensive experiments on synthetic and real-world benchmark datasets. The empirical evaluation suggests that the proposed method has a better performance compared to the current state-of-the-art works. The implementation is available at https://github.com/Sbakhshigermi/EDR.
AlkuperäiskieliEnglanti
OtsikkoImage Analysis and Processing – ICIAP 2022 - 21st International Conference, 2022, Proceedings
KustantajaSpringer
Sivut621-632
Sivumäärä12
ISBN (elektroninen)978-3-031-06427-2
ISBN (painettu)978-3-031-06426-5
DOI - pysyväislinkit
TilaJulkaistu - 15 toukok. 2022
OKM-julkaisutyyppiA4 Artikkeli konferenssijulkaisussa
TapahtumaInternational Conference on Image Analysis and Processing - Lecce, Italia
Kesto: 23 toukok. 202227 toukok. 2022

Julkaisusarja

NimiLecture Notes in Computer Science
Vuosikerta13231 LNCS
ISSN (painettu)0302-9743
ISSN (elektroninen)1611-3349

Conference

ConferenceInternational Conference on Image Analysis and Processing
Maa/AlueItalia
KaupunkiLecce
Ajanjakso23/05/2227/05/22

Julkaisufoorumi-taso

  • Jufo-taso 1

Sormenjälki

Sukella tutkimusaiheisiin 'Enhanced Data-Recalibration: Utilizing Validation Data to Mitigate Instance-Dependent Noise in Classification'. Ne muodostavat yhdessä ainutlaatuisen sormenjäljen.

Siteeraa tätä