Skip to main navigation Skip to search Skip to main content

Enhanced Data-Recalibration: Utilizing Validation Data to Mitigate Instance-Dependent Noise in Classification

Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review

1 Citation (Scopus)
24 Downloads (Pure)

Abstract

This paper proposes a practical approach to deal with instance-dependent noise in classification. Supervised learning with noisy labels is one of the major research topics in the deep learning community. While old works typically assume class conditional and instance-independent noise, recent works provide theoretical and empirical proof to show that the noise in real-world cases is instance-dependent. Current state-of-the-art methods for dealing with instance-dependent noise focus on data-recalibrating strategies to iteratively correct labels while training the network. While some methods provide theoretical analysis to prove that each iteration results in a cleaner dataset and a better-performing network, the limiting assumptions and dependency on knowledge about noise for hyperparameter tuning often contrast their claims. The proposed method in this paper is a two-stage data-recalibration algorithm that utilizes validation data to correct noisy labels and refine the model iteratively. The algorithm works by training the network on the latest cleansed training Set to obtain better performance on a small, clean validation set while using the best performing model to cleanse the training set for the next iteration. The intuition behind the method is that a network with decent performance on the clean validation set can be utilized as an oracle network to generate less noisy labels for the training set. While there is no theoretical guarantee attached, the method’s effectiveness is demonstrated with extensive experiments on synthetic and real-world benchmark datasets. The empirical evaluation suggests that the proposed method has a better performance compared to the current state-of-the-art works. The implementation is available at https://github.com/Sbakhshigermi/EDR.
Original languageEnglish
Title of host publicationImage Analysis and Processing – ICIAP 2022 - 21st International Conference, 2022, Proceedings
PublisherSpringer
Pages621-632
Number of pages12
ISBN (Electronic)978-3-031-06427-2
ISBN (Print)978-3-031-06426-5
DOIs
Publication statusPublished - 15 May 2022
Publication typeA4 Article in conference proceedings
EventInternational Conference on Image Analysis and Processing - Lecce, Italy
Duration: 23 May 202227 May 2022

Publication series

NameLecture Notes in Computer Science
Volume13231 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

ConferenceInternational Conference on Image Analysis and Processing
Country/TerritoryItaly
CityLecce
Period23/05/2227/05/22

Publication forum classification

  • Publication forum level 1

Fingerprint

Dive into the research topics of 'Enhanced Data-Recalibration: Utilizing Validation Data to Mitigate Instance-Dependent Noise in Classification'. Together they form a unique fingerprint.

Cite this