Abstract
Matrix and tensor factorization methods are often used
for finding underlying low-dimensional patterns from noisy
data. In this paper, we study non-linear tensor factorization methods based on deep variational autoencoders. Our
approach is well-suited for settings where the relationship
between the latent representation to be learned and the raw
data representation is highly complex. We apply our approach to a large dataset of facial expressions of moviewatching audiences (over 16 million faces). Our experiments show that compared to conventional linear factorization methods, our method achieves better reconstruction of
the data, and further discovers interpretable latent factors.