TY - JOUR
T1 - Processing of multichannel recordings for data-mining algorithms
AU - Shmiel, Oren
AU - Shmiel, Tomer
AU - Dagan, Yaron
AU - Teicher, Mina
PY - 2007/3
Y1 - 2007/3
N2 - Data Mining, or knowledge discovery, is the computer-assisted process of digging through and analyzing large quantity of data in order to extract meaningful knowledge. Data mining methods are used in many studies to identify phenomena quicker and better than human experts. One class of these methods was designed for dealing with time series data. However, when several channels of data are collected simultaneously, data mining algorithms encounter numerous difficulties since channels may be measured in different units, may be recorded at different sampling-rates, or may have completely different characteristics. Furthermore, as the size of these data increases, the amount of irrelevant data usually increases as well and the process becomes impractical. Hence, in such cases, the analyst must be capable of focusing on the informational parts while ignoring the noise data. These kinds of difficulties complicate the analysis of multichannel data as compared to the analysis of single-channel data. This paper presents a useful technique for preprocessing multi channel data. Our technique supplies tools for coping with all the above-mentioned difficulties, and prepares the data for further analysis (using common algorithms, especially from the data mining field). The paper is divided as follows. After the introduction (Section I) we describe the state of the art (Section II), follows by the main section - methodology (Section III) which is divided to four steps (3.2-3.5). The results are described in a separate section (Section IV). Then, a discussion and conclusions of the proposed methodology are given in (Sections V and VI). Acknowledgements and the references follow.
AB - Data Mining, or knowledge discovery, is the computer-assisted process of digging through and analyzing large quantity of data in order to extract meaningful knowledge. Data mining methods are used in many studies to identify phenomena quicker and better than human experts. One class of these methods was designed for dealing with time series data. However, when several channels of data are collected simultaneously, data mining algorithms encounter numerous difficulties since channels may be measured in different units, may be recorded at different sampling-rates, or may have completely different characteristics. Furthermore, as the size of these data increases, the amount of irrelevant data usually increases as well and the process becomes impractical. Hence, in such cases, the analyst must be capable of focusing on the informational parts while ignoring the noise data. These kinds of difficulties complicate the analysis of multichannel data as compared to the analysis of single-channel data. This paper presents a useful technique for preprocessing multi channel data. Our technique supplies tools for coping with all the above-mentioned difficulties, and prepares the data for further analysis (using common algorithms, especially from the data mining field). The paper is divided as follows. After the introduction (Section I) we describe the state of the art (Section II), follows by the main section - methodology (Section III) which is divided to four steps (3.2-3.5). The results are described in a separate section (Section IV). Then, a discussion and conclusions of the proposed methodology are given in (Sections V and VI). Acknowledgements and the references follow.
KW - Data mining
KW - Multi-channel
KW - Multichannel
KW - Multivariable
KW - Recordings
KW - Signal discretization
KW - Signal quantization
UR - http://www.scopus.com/inward/record.url?scp=33847735104&partnerID=8YFLogxK
U2 - 10.1109/tbme.2006.888826
DO - 10.1109/tbme.2006.888826
M3 - ???researchoutput.researchoutputtypes.contributiontojournal.article???
C2 - 17355056
AN - SCOPUS:33847735104
SN - 0018-9294
VL - 54
SP - 444
EP - 453
JO - IEEE Transactions on Biomedical Engineering
JF - IEEE Transactions on Biomedical Engineering
IS - 3
M1 - 13
ER -