A fast implementation of the isodata clustering algorithm

Nargess Memarsadeghi, David M. Mount, Nathan S. Netanyahu, Jacqueline Le Moigne

Research output: Contribution to journalArticlepeer-review

158 Scopus citations

Abstract

Clustering is central to many image processing and remote sensing applications. ISODATA is one of the most popular and widely used clustering methods in geoscience applications, but it can run slowly, particularly with large data sets. We present a more efficient approach to ISODATA clustering, which achieves better running times by storing the points in a kd-tree and through a modification of the way in which the algorithm estimates the dispersion of each cluster. We also present an approximate version of the algorithm which allows the user to further improve the running time, at the expense of lower fidelity in computing the nearest cluster center to each point. We provide both theoretical and empirical justification that our modified approach produces clusterings that are very similar to those produced by the standard ISODATA approach. We also provide empirical studies on both synthetic data and remotely sensed Landsat and MODIS images that show that our approach has significantly lower running times.

Original languageEnglish
Pages (from-to)71-103
Number of pages33
JournalInternational Journal of Computational Geometry and Applications
Volume17
Issue number1
DOIs
StatePublished - Feb 2007

Bibliographical note

Funding Information:
Clustering is central to many image processing and remote sensing applications, ISODATA is one of the most popular and widely used clustering methods in geoseienee applications, *A preliminary version of this paper appeared in Proceedings of the IEEE International Geoseienee and Remote Sensing Symposium (IGARSS'03), Toulouse, France, 2003, Vol. Ill, 2057-2059. tThe work of this author was supported by the National Science Foundation under grant CCR-0098151. tThe author was previously with the Applied Information Sciences Branch, NASA Goddard Space Flight Center.

Funding

Clustering is central to many image processing and remote sensing applications, ISODATA is one of the most popular and widely used clustering methods in geoseienee applications, *A preliminary version of this paper appeared in Proceedings of the IEEE International Geoseienee and Remote Sensing Symposium (IGARSS'03), Toulouse, France, 2003, Vol. Ill, 2057-2059. tThe work of this author was supported by the National Science Foundation under grant CCR-0098151. tThe author was previously with the Applied Information Sciences Branch, NASA Goddard Space Flight Center.

FundersFunder number
National Science FoundationCCR-0098151

    Keywords

    • Approximation
    • Clustering
    • Filtering algorithm
    • ISODATA
    • k-means
    • kd-trees

    Fingerprint

    Dive into the research topics of 'A fast implementation of the isodata clustering algorithm'. Together they form a unique fingerprint.

    Cite this