A Noise-Resilient Auto-Labeling Framework With Transition Matrix

Citations

WEB OF SCIENCE

0
Citations

SCOPUS

0

초록

Recently, an auto-labeling framework has been applied in a lot of applications across various industries. Pseudo-labeling is the most common auto-labeling method, which involves converting unlabeled data into labeled data by assigning pseudo-labels. Unless we have a perfect model for pseudo-labeling, the additional labeled data we get from unlabeled data always includes noisy labels. However, this problem has not been studied by many researchers yet. Addressing this problem, we propose a noise-resilient auto-labeling framework using a transition matrix to mitigate the impact of label noise. The framework consists of three main stages: generating pseudo-labels for unlabeled data, identifying noisy samples based on the KL-divergence between estimated transition vectors and model outputs, and using noisy samples as unlabeled data and clean samples as labeled data in semi-supervised learning for training the final model. We also show how much noise is added through pseudo-labeling depending on the initial model's accuracy. Our experiments demonstrate that the proposed method outperforms the state-of-the-art methods for handling noisy labels on both standard classification benchmarks (e.g., CIFAR-10 and CIFAR-100) and real-world datasets (e.g., Clothing100K, Food-101).

키워드

NoiseNoise measurementData modelsVectorsTrainingLabelingPredictive modelsEntropyAccuracySupervised learningNoisy labelsauto-labelingactive learningpseudo-labelingdeep learningnoise handling
제목
A Noise-Resilient Auto-Labeling Framework With Transition Matrix
저자
Lee, WonheeHur, Youngbum
DOI
10.1109/ACCESS.2025.3626158
발행일
2025
유형
Article
저널명
IEEE Access
13
페이지
185790 ~ 185801