资源论文Encouraging Paragraph Embeddings to Remember Sentence Identity Improves Classification

Encouraging Paragraph Embeddings to Remember Sentence Identity Improves Classification

2019-09-24 | |  88 |   41 |   0

 Abstract While paragraph embedding models are remarkably effective for downstream classifification tasks, what they learn and encode into a single vector remains opaque. In this paper, we investigate a state-of-the-art paragraph embedding method proposed by Zhang et al. (2017) and discover that it cannot reliably tell whether a given sentence occurs in the input paragraph or not. We formulate a sentence content task to probe for this basic linguistic property and fifind that even a much simpler bag-of-words method has no trouble solving it. This result motivates us to replace the reconstructionbased objective of Zhang et al. (2017) with our sentence content probe objective in a semisupervised setting. Despite its simplicity, our objective improves over paragraph reconstruction in terms of (1) downstream classifification accuracies on benchmark datasets, (2) faster training, and (3) better generalization ability

上一篇:Combating Adversarial Misspellings with Robust Word Recognition

下一篇:Figurative Usage Detection of Symptom Words to Improve Personal Health Mention Detection

用户评价
全部评价

热门资源

  • The Variational S...

    Unlike traditional images which do not offer in...

  • Learning to Predi...

    Much of model-based reinforcement learning invo...

  • Stratified Strate...

    In this paper we introduce Stratified Strategy ...

  • A Mathematical Mo...

    Direct democracy, where each voter casts one vo...

  • Rating-Boosted La...

    The performance of a recommendation system reli...