Visualizing Class Structure of Multidimensional Data

Inderjit Dhillon, Dharmendra Modha, W. Spangler

Abstract:   We consider the problem of visualizing multidimensional data that has been categorized into classes. Our goal in visualizing is to quickly absorb inter- and intra-class relationships. Towards this end, we introduce class-preserving projections of the multidimensional data onto two-dimensional planes which can then be displayed on a computer screen. These class-preserving projections maintain the high-dimensional class structure, and are closely related to Fisher’s linear discriminants. By displaying sequences of such two-dimensional projections and by moving continuously from one projection to the next, we can create illusions of smooth motion through a multidimensional display. Such sequences are termed class tours. We illustrate the proposed ideas by various computer simulations on the classical Iris plant dataset and a text corpus of book reviews.

Download: pdf

Citation

  • Visualizing Class Structure of Multidimensional Data (pdf, software)
    I. Dhillon, D. Modha, W. Spangler.
    Symposium on the Interface: Computing Science and Statistics, pp. 488-493, May 1998.

    Bibtex: