TY - JOUR
T1 - A dynamic classification unit for online segmentation of big data via small data buffers
AU - Khalemsky, Anna
AU - Gelbard, Roy
N1 - Publisher Copyright:
© 2019
PY - 2020/1
Y1 - 2020/1
N2 - In many segmentation processes, we assign new cases according to a model that was built on the basis of past cases. As long as the new cases are “similar enough” to the past cases, segmentation proceeds normally. However, when a new case is substantially different from the known cases, a reexamination of the previously created segments is required. The reexamination may result in the creation of new segments or in the updating of the existing ones. In this paper, we assume that in big and dynamic data environments it is not possible to reexamine all past data and, therefore, we suggest using small groups of selected cases, stored in small data buffers, as an alternative to the collection of all past data. We present an incremental dynamic classifier that supports real-time unsupervised segmentation in big and dynamic data environments. In order to reduce the computational effort of unsupervised clustering in such environments, the suggested model performs calculations only on the relevant data buffers that store the relevant representative cases. In addition, the suggested model can serve as a dynamic classification unit (DCU) that can act as an autonomous agent, as well as collaborate with other DCUs. The evaluation is presented by comparing three approaches: static, dynamic, and incremental dynamic.
AB - In many segmentation processes, we assign new cases according to a model that was built on the basis of past cases. As long as the new cases are “similar enough” to the past cases, segmentation proceeds normally. However, when a new case is substantially different from the known cases, a reexamination of the previously created segments is required. The reexamination may result in the creation of new segments or in the updating of the existing ones. In this paper, we assume that in big and dynamic data environments it is not possible to reexamine all past data and, therefore, we suggest using small groups of selected cases, stored in small data buffers, as an alternative to the collection of all past data. We present an incremental dynamic classifier that supports real-time unsupervised segmentation in big and dynamic data environments. In order to reduce the computational effort of unsupervised clustering in such environments, the suggested model performs calculations only on the relevant data buffers that store the relevant representative cases. In addition, the suggested model can serve as a dynamic classification unit (DCU) that can act as an autonomous agent, as well as collaborate with other DCUs. The evaluation is presented by comparing three approaches: static, dynamic, and incremental dynamic.
KW - Big data
KW - Classification
KW - Cluster analysis
KW - Dynamic segmentation
KW - Incremental data analysis
KW - Incremental dynamic classifier
UR - http://www.scopus.com/inward/record.url?scp=85072167273&partnerID=8YFLogxK
U2 - 10.1016/j.dss.2019.113157
DO - 10.1016/j.dss.2019.113157
M3 - ???researchoutput.researchoutputtypes.contributiontojournal.article???
AN - SCOPUS:85072167273
SN - 0167-9236
VL - 128
JO - Decision Support Systems
JF - Decision Support Systems
M1 - 113157
ER -