资源论文MKPLS: Manifold Kernel Partial Least Squaresfor Lipreading and Speaker Identification

MKPLS: Manifold Kernel Partial Least Squaresfor Lipreading and Speaker Identification

2019-11-28 | |  55 |   39 |   0

Abstract Visual speech recognition is a challenging problem, due to confusion between visual speech features. The speaker identifification problem is usually coupled with speech recognition. Moreover, speaker identifification is important to several applications, such as automatic access control, biometrics, authentication, and personal privacy issues. In this paper, we propose a novel approach for lipreading and speaker identifification. We propose a new approach for manifold parameterization in a low-dimensional latent space, where each manifold is represented as a point in that space. We initially parameterize each instance manifold using a nonlinear mapping from a unifified manifold representation. We then factorize the parameter space using Kernel Partial Least Squares (KPLS) to achieve a low-dimension manifold latent space. We use two-way projections to achieve two manifold latent spaces, one for the speech content and one for the speaker. We apply our approach on two public databases: AVLetters and OuluVS. We show the results for three different settings of lipreading: speaker independent, speaker dependent, and speaker semi-dependent. Our approach outperforms for the speaker semi-dependent setting by at least 15% of the baseline, and competes in the other two settings.

上一篇:Multi-resolution Shape Analysis via Non-Euclidean Wavelets: Applications toMesh Segmentation and Surface Alignment Problems

下一篇:Correlation Filters for Object Alignment

用户评价
全部评价

热门资源

  • The Variational S...

    Unlike traditional images which do not offer in...

  • Learning to Predi...

    Much of model-based reinforcement learning invo...

  • Stratified Strate...

    In this paper we introduce Stratified Strategy ...

  • A Mathematical Mo...

    Direct democracy, where each voter casts one vo...

  • Rating-Boosted La...

    The performance of a recommendation system reli...