Mixture separability loss in a deep convolutional network for image classification

  • Trung Dung Do
  • Jin, Cheng-Bin
  • Van Huan Nguyen
  • Kim, Hakil
Citations

WEB OF SCIENCE

1
Citations

SCOPUS

2

초록

In machine learning, the cost function is crucial because it measures how good or bad a system is. In image classification, well-known networks only consider modifying the network structures and applying cross-entropy loss at the end of the network. However, using only cross-entropy loss causes a network to stop updating weights when all training images are correctly classified. This is the problem of early saturation. This study proposes a novel cost function, called mixture separability loss (MSL), which updates the weights of the network even when most of the training images are accurately predicted. MSL consists of between-class and within-class loss. Between-class loss maximises the differences between inter-class images, whereas within-class loss minimises the similarities between intra-class images. They designed the proposed loss function to attach to different convolutional layers in the network in order to utilise intermediate feature maps. Experiments show that a network with MSL deepens the learning process and obtains promising results with some public datasets, such as Street View House Number, Canadian Institute for Advanced Research, and the authors' self-collected Inha Computer Vision Lab gender dataset.

키워드

image representationimage classificationentropycomputer visionlearning (artificial intelligence)feedforward neural netsintra-class imagesloss functionMSLdeep convolutional networkimage classificationmachine learningwell-known networksnetwork structuresapplying cross-entropy losstraining imagesnovel cost functionbetween-class lossinter-class imagesconvolutional layersmixture separability losswithin-class lossstreet view house numberCanadian institute for advanced researchself-collected Inha computer vision lab gender dataset
제목
Mixture separability loss in a deep convolutional network for image classification
저자
Trung Dung DoJin, Cheng-BinVan Huan NguyenKim, Hakil
DOI
10.1049/iet-ipr.2018.5613
발행일
2019-01
유형
Article
저널명
IET Image Processing
13
1
페이지
135 ~ 141