资源论文CROSS -LINGUAL ALIGNMENT VS JOINT TRAINING :A COMPARATIVE STUDY AND AS IMPLE UNIFIEDF RAMEWORK

CROSS -LINGUAL ALIGNMENT VS JOINT TRAINING :A COMPARATIVE STUDY AND AS IMPLE UNIFIEDF RAMEWORK

2019-12-31 | |  75 |   45 |   0

Abstract

Learning multilingual representations of text has proven a successful method for many cross-lingual transfer learning tasks. There are two main paradigms for learning such representations: (1) alignment, which maps different independently trained monolingual representations into a shared space, and (2) joint training, which directly learns unified multilingual representations using monolingual and cross-lingual objectives jointly. In this paper, we first conduct direct comparisons of representations learned using both of these methods across diverse crosslingual tasks. Our empirical results reveal a set of pros and cons for both methods, and show that the relative performance of alignment versus joint training is task-dependent. Stemming from this analysis, we propose a simple and novel framework that combines these two previously mutually-exclusive approaches. Extensive experiments on various tasks demonstrate that our proposed framework alleviates limitations of both approaches, and outperforms existing methods on the MUSE bilingual lexicon induction (BLI) benchmark. We further show that our proposed framework can generalize to contextualized representations and achieves state-of-the-art results on the CoNLL cross-lingual NER benchmark.1

上一篇:MIXOUT: EFFECTIVE REGULARIZATION TO FINETUNEL ARGE -SCALE PRETRAINED LANGUAGE MODELS

下一篇:PLAYING THE LOTTERY WITH REWARDS ANDMULTIPLE LANGUAGES :LOTTERY TICKETS IN RL AND NLP

用户评价
全部评价

热门资源

  • The Variational S...

    Unlike traditional images which do not offer in...

  • Learning to Predi...

    Much of model-based reinforcement learning invo...

  • Stratified Strate...

    In this paper we introduce Stratified Strategy ...

  • A Mathematical Mo...

    Direct democracy, where each voter casts one vo...

  • Rating-Boosted La...

    The performance of a recommendation system reli...