Abstract
Have you ever wanted to multiply an n × d matrix X, with n
d, on the left by an m × n matrix
of i.i.d. Gaussian random variables, but could not afford to do it because it was too slow? In this work we propose a new randomized m × n matrix T , for which one can compute T · X in only
time, for which the total variation distance between the distributions T · X and
· X is as small as desired, i.e., less than any positive constant. Here nnz(X) denotes the number of non-zero entries of X. Assuming
this is a significant savings over the na¨ıve O(nnz(X)m) time to compute
· X. Moreover, since the total variation distance is small, we can provably use T · X in place of
· X in any application and have the same guarantees as if we were using
· X, up to a small positive constant in erro probability. We apply this transform to nonnegative matrix factorization (NMF) and support vector machines (SVM).