Scalable Vertical Mining for Big Data Analytics of Frequent Itemsets

Leung, Carson K.; Zhang, Hao; Souza, Joglas; Lee, Wookey

doi:10.1007/978-3-319-98809-2_1

상세 보기

Scalable Vertical Mining for Big Data Analytics of Frequent Itemsets

Leung, Carson K.;
Zhang, Hao;
Souza, Joglas;
Lee, Wookey

Citations

WEB OF SCIENCE

17

Citations

SCOPUS

23

초록

Advances in technology and the increasing growth of popularity on Internet of Things (IoT) for many applications have produced huge volume of data at a high velocity. These valuable big data can be of a wide variety or different veracity. Embedded in these big data are useful information and valuable knowledge. This leads to data science, which aims to apply big data analytics to mine implicit, previously unknown and potentially useful information from big data. As a popular data analytic task, frequent itemset mining discovers knowledge about sets of frequently co-occurring items in the big data. Such a task has drawn attention in both academia and industry partially due to its practicality in various real-life applications. Existing mining approaches mostly use serial, distributed or parallel algorithms to mine the data horizontally (i.e., on a transaction basis). In this paper, we present an alternative big data analytic approach. Specifically, our scalable algorithm uses the MapReduce programming model that runs in a Spark environment to mine the data vertically (i.e., on an item basis). Evaluation results show the effectiveness of our algorithm in big data analytics of frequent itemsets.

키워드

Data mining; Knowledge discovery; Frequent patterns; Vertical mining; Big data; Spark; VISUAL ANALYTICS; PATTERNS

제목: Scalable Vertical Mining for Big Data Analytics of Frequent Itemsets

저자: Leung, Carson K.; Zhang, Hao; Souza, Joglas; Lee, Wookey

DOI: 10.1007/978-3-319-98809-2_1

발행일: 2018

유형: Proceedings Paper

저널명: Lecture Notes in Computer Science

권: 11029

페이지: 3 ~ 17