Generating a New Dataset for Korean Scene Text Recognition with Augmentation Techniques

Kim, Mincheol; Choi, Wonik

doi:10.1007/978-981-10-6520-0_26

상세 보기

Generating a New Dataset for Korean Scene Text Recognition with Augmentation Techniques

Kim, Mincheol;
Choi, Wonik

Citations

WEB OF SCIENCE

1

Citations

SCOPUS

2

초록

Korean text recognition in a natural scene is a challenging task due to the complexity of character shapes and the lack of dataset comparing to English or other languages. In this paper, we present a new dataset with the goal of improving the recognition of Korean natural scene text. Our dataset is generated by data augmentation techniques without losing a reality. The number of augmented images is 3 million and these images are made up of about 30 non-commercial fonts and 511,000 words from a standard Korean language dictionary. This enormous amount of data offers new possibilities for training deeper neural networks. In our extensive experiments, results show that our dataset effectively trains convolutional recurrent neural networks that achieve state-of-the-art performance on the Korea Advanced Institute of Science & Technology (KAIST) scene text database with very few data-acquisition costs.

키워드

Scene text recognition; Data augmentation; Neural network

제목: Generating a New Dataset for Korean Scene Text Recognition with Augmentation Techniques

저자: Kim, Mincheol; Choi, Wonik

DOI: 10.1007/978-981-10-6520-0_26

발행일: 2018

유형: Proceedings Paper

저널명: Lecture Notes in Electrical Engineering

권: 461

페이지: 247 ~ 252