Morpheme-based Korean text cohesion analyzer

Citations

WEB OF SCIENCE

2
Citations

SCOPUS

2

초록

The fundamental difference between Korean and English text analysis lies in morpheme analysis. While existing Korean text analysis relies on English analysis tools, it often yields inaccurate results due to the difficulty of morpheme analysis. The primary reason is the existing morpheme analyzer depends on eojeol tokens, making it challenging to grasp Korean characteristics. Therefore, we introduce a Transformer-based morpheme analyzer that uses morpheme tokens to capture the inherent feature in Korean sentences. Then, we successfully integrate this morpheme analyzer into our Korean text analysis tool, offering it as a web service for efficient usage.

키워드

Korean text analysisMorpheme analysisTransformer
제목
Morpheme-based Korean text cohesion analyzer
저자
Kim, Dong-HyunAhn, SeokhoLee, EuijongSeo, Young-Duk
DOI
10.1016/j.softx.2024.101659
발행일
2024-05
유형
Article
저널명
SoftwareX
26