Encouraging Paragraph Embeddings to Remember Sentence Identity Improves Classification

资源分类

2019-09-24 |

88 |

41 |

Abstract While paragraph embedding models are remarkably effective for downstream classifification tasks, what they learn and encode into a single vector remains opaque. In this paper, we investigate a state-of-the-art paragraph embedding method proposed by Zhang et al. (2017) and discover that it cannot reliably tell whether a given sentence occurs in the input paragraph or not. We formulate a sentence content task to probe for this basic linguistic property and fifind that even a much simpler bag-of-words method has no trouble solving it. This result motivates us to replace the reconstructionbased objective of Zhang et al. (2017) with our sentence content probe objective in a semisupervised setting. Despite its simplicity, our objective improves over paragraph reconstruction in terms of (1) downstream classifification accuracies on benchmark datasets, (2) faster training, and (3) better generalization ability

上一篇：Combating Adversarial Misspellings with Robust Word Recognition

下一篇：Figurative Usage Detection of Symptom Words to Improve Personal Health Mention Detection

用户评价

全部评价

还没有评论，说两句吧！

热门资源

The Variational S...

Unlike traditional images which do not offer in...
Learning to Predi...

Much of model-based reinforcement learning invo...
Stratified Strate...

In this paper we introduce Stratified Strategy ...
A Mathematical Mo...

Direct democracy, where each voter casts one vo...
Rating-Boosted La...

The performance of a recommendation system reli...

智能在线

400-630-6780
聆听.建议反馈

E-mail: support@tusaishared.com