资源论文Large-Scale Multi-Label Text Classification on EU Legislation

Large-Scale Multi-Label Text Classification on EU Legislation

2019-09-24 | |  87 |   38 |   0

 Abstract We consider Large-Scale Multi-Label Text Classifification (LMTC) in the legal domain. We release a new dataset of 57k legislative documents from EUR-LEX, annotated with 4.3k EUROVOC labels, which is suitable for LMTC, few- and zero-shot learning. Experimenting with several neural classififiers, we show that BIGRUs with label-wise attention perform better than other current state of the art methods. Domain-specifific WORD2VEC and context-sensitive ELMO embeddings further improve performance. We also fifind that considering only particular zones of the documents is suffificient. This allows us to bypass BERT’s maximum text length limit and fifinetune BERT, obtaining the best results in all but zero-shot learning cases

上一篇:Incorporating Priors with Feature Attribution on Text Classification

下一篇:Multi-Level Matching and Aggregation Network for Few-Shot Relation Classification

用户评价
全部评价

热门资源

  • The Variational S...

    Unlike traditional images which do not offer in...

  • Learning to Predi...

    Much of model-based reinforcement learning invo...

  • Stratified Strate...

    In this paper we introduce Stratified Strategy ...

  • A Mathematical Mo...

    Direct democracy, where each voter casts one vo...

  • Rating-Boosted La...

    The performance of a recommendation system reli...