资源论文Improving Low-Resource Cross-lingual Document Retrieval by Reranking with Deep Bilingual Representations

Improving Low-Resource Cross-lingual Document Retrieval by Reranking with Deep Bilingual Representations

2019-09-19 | |  102 |   32 |   0 0 0
Abstract In this paper, we propose to boost lowresource cross-lingual document retrieval performance with deep bilingual query-document representations. We match queries and documents in both source and target languages with four components, each of which is implemented as a term interaction-based deep neural network with cross-lingual word embeddings as input. By including query likelihood scores as extra features, our model effectively learns to rerank the retrieved documents by using a small number of relevance labels for low-resource language pairs. Due to the shared cross-lingual word embedding space, the model can also be directly applied to another language pair without any training label. Experimental results on the MATERIAL dataset show that our model outperforms the competitive translation-based baselines on English-Swahili, English-Tagalog, and English-Somali cross-lingual information retrieval tasks.

上一篇:Generating Logical Forms from Graph Representations of Text and Entities

下一篇:Is Word Segmentation Necessary for Deep Learning of Chinese Representations?

用户评价
全部评价

热门资源

  • The Variational S...

    Unlike traditional images which do not offer in...

  • Learning to Predi...

    Much of model-based reinforcement learning invo...

  • Stratified Strate...

    In this paper we introduce Stratified Strategy ...

  • A Mathematical Mo...

    Direct democracy, where each voter casts one vo...

  • Rating-Boosted La...

    The performance of a recommendation system reli...