A Generalized Maximum Entropy Approach to Bregman Co-clustering and Matrix Approximations

Abstract: An important task in unsupervised learning is maximum likelihood mixture estimation (MLME) for exponential families. In this paper, we prove a mathematical equivalence between this MLME problem and the rate distortion problem for Bregman divergences. We also present new theoretical results in rate distortion theory for Bregman divergences. Further, an analysis of the problems as a trade-off between compression and preservation of information is presented that yields the information bottleneck method as an interesting special case.

Download: pdf

Citation

A Generalized Maximum Entropy Approach to Bregman Co-clustering and Matrix Approximations (pdf, software)
A. Banerjee, I. Dhillon, J. Ghosh, S. Merugu.
In International Conference on Machine Learning (ICML), pp. 57-64, July 2004.

Bibtex:
@inproceedings{banerjee2004agenerali, author = "Arindam Banerjee AND Inderjit S. Dhillon AND Joydeep Ghosh AND Srujana Merugu", title = "A Generalized Maximum Entropy Approach to Bregman Co-clustering and Matrix Approximations", booktitle = "International Conference on Machine Learning (ICML)", page = "57–64", year = "2004", month = "jul", abstract = "An important task in unsupervised learning is maximum likelihood mixture estimation (MLME) for exponential families. In this paper, we prove a mathematical equivalence between this MLME problem and the rate distortion problem for Bregman divergences. We also present new theoretical results in rate distortion theory for Bregman divergences. Further, an analysis of the problems as a trade-off between compression and preservation of information is presented that yields the information bottleneck method as an interesting special case." }

Center for Big Data Analytics

A Generalized Maximum Entropy Approach to Bregman Co-clustering and Matrix Approximations

Arindam Banerjee, Inderjit Dhillon, Joydeep Ghosh, Srujana Merugu

Download: pdf

Citation