Abstract
The use of data mining has led to many significant medical discoveries. However, many challenges still exist in using these methods for knowledge discovery within this field given that the large amounts of data medical practitioners collect often creates a curse of dimensionality. To address this challenge, attribute selection approaches have been developed. However, current approaches typically put equal weight on all values within that attribute. At times, and especially within medical domains, we claim that these approaches might miss attributes where only a small subset of attribute values contain a strong indication for one of the target values and thus should still be selected. To quantify this approach, we present MIAT, an algorithm that defines Minority Interesting Attribute Thresholds to find these important attribute values. As we developed MIAT to help better diagnose upper gastrointestinal cancer, we present how we use the attributes selected through this approach to build a predictive model for this cancer. To demonstrate MIAT's generality, we also applied it to a canonical Hungarian Heart Disease Dataset. In both datasets we found that MIAT yields significantly better accuracy and sensitivity over traditional attribute selection approaches.
| Original language | English |
|---|---|
| Title of host publication | Proceedings of the 2015 IEEE International Conference on Data Science and Advanced Analytics, DSAA 2015 |
| Editors | Eric Gaussier, Longbing Cao, Patrick Gallinari, James Kwok, Gabriella Pasi, Osmar Zaiane |
| Publisher | Institute of Electrical and Electronics Engineers Inc. |
| ISBN (Electronic) | 9781467382731 |
| DOIs | |
| State | Published - 2 Dec 2015 |
| Externally published | Yes |
| Event | 2nd IEEE International Conference on Data Science and Advanced Analytics, DSAA 2015 - Paris, France Duration: 19 Oct 2015 → 21 Oct 2015 |
Publication series
| Name | Proceedings of the 2015 IEEE International Conference on Data Science and Advanced Analytics, DSAA 2015 |
|---|
Conference
| Conference | 2nd IEEE International Conference on Data Science and Advanced Analytics, DSAA 2015 |
|---|---|
| Country/Territory | France |
| City | Paris |
| Period | 19/10/15 → 21/10/15 |
Bibliographical note
Publisher Copyright:© 2015 IEEE.
UN SDGs
This output contributes to the following UN Sustainable Development Goals (SDGs)
-
SDG 3 Good Health and Well-being
Fingerprint
Dive into the research topics of 'MIAT: A novel attribute selection approach to better predict upper gastrointestinal cancer'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver