On a Zero-Finding Problem involving the Matrix Exponential

Abstract: An important step in the solution of a matrix nearness problem that arises in certain machine learning applications is finding the zero of f(alpha) = z^T exp(logX + alpha zz^T )z − b. The matrix valued exponential and logarithm in f(alpha) arises from the use of the von Neumann matrix divergence tr(X logX − X log Y − X + Y ) to measure the 6 nearness between the positive definite matrices X and Y . A key step of an iterative algorithm used to solve the 7 underlying matrix nearness problem requires the zero of f(alpha) to be repeatedly computed. In this paper we propose 8 zero-finding algorithms that gain their advantage by exploiting the special structure of the objective function. We 9 show how to efficiently compute the derivative of f, thereby allowing the use of Newton-type methods. In numerical 10 experiments we establish the advantage of our algorithms.

Download: pdf

Citation

On a Zero-Finding Problem involving the Matrix Exponential (pdf, software)
M. Sustik, I. Dhillon.
SIAM Journal of Matrix Analysis and Applications (SIMAX) 33(4), pp. 1237-1249, 2012.

Bibtex:
@article{sustik2012onazero, author = "Mátyás Sustik AND Inderjit S. Dhillon", title = "On a Zero-Finding Problem involving the Matrix Exponential", journal = "SIAM Journal of Matrix Analysis and Applications (SIMAX)", page = "1237–1249", volume = "33", issue = "4", year = "2012", abstract = "An important step in the solution of a matrix nearness problem that arises in certain machine learning applications is finding the zero of f(alpha) = z^T exp(logX + alpha zz^T )z − b. The matrix valued exponential and logarithm in f(alpha) arises from the use of the von Neumann matrix divergence tr(X logX − X log Y − X + Y ) to measure the 6 nearness between the positive definite matrices X and Y . A key step of an iterative algorithm used to solve the 7 underlying matrix nearness problem requires the zero of f(alpha) to be repeatedly computed. In this paper we propose 8 zero-finding algorithms that gain their advantage by exploiting the special structure of the objective function. We 9 show how to efficiently compute the derivative of f, thereby allowing the use of Newton-type methods. In numerical 10 experiments we establish the advantage of our algorithms." }

Center for Big Data Analytics

On a Zero-Finding Problem involving the Matrix Exponential

Mátyás Sustik, Inderjit Dhillon

Download: pdf

Citation