Implementation of Nested Named Entity Recognition

Some files are part of NeuroNLP2.

Requirements

We tested this library with the following libraries:

Python (3.7)
PyTorch (1.3.0)
Numpy (1.17.3)
AdaBound (0.0.5)
StanfordNLP (0.2.0) for accessing the Java Stanford CoreNLP Server (3.9.2)
Transformers (2.1.1)

Running experiments

Testing this library with a sample data

Run the gen_data.py to generate the processed data files for training, and they will be placed at the "./data/" directory
```
python gen_data.py
```
Run the train.py to start training
```
python train.py
```

Reproducing our experiment on the ACE-2004 dataset

Put the corpus ACE-2004 into the "../ACE2004/" directory
Put this .tgz file into the "../" and extract it
Run the parse_ace2004.py to extract sentences for training, and they will be placed at the "./data/ace2004/"
```
python parse_ace2004.py
```
Run the gen_data_for_ace2004.py to prepare the processed data files for training, and they will be placed at the "./data/" directory
```
python gen_data_for_ace2004.py
```
Run the train.py to start training
```
python train.py
```

Reproducing our experiment on the ACE-2005 dataset

Put the corpus ACE-2005 into the "../ACE2005/" directory
Put this .tgz file into the "../" and extract it
Run the parse_ace2005.py to extract sentences for training, and they will be placed at the "./data/ace2005/"
```
python parse_ace2005.py
```
Run the gen_data_for_ace2005.py to prepare the processed data files for training, and they will be placed at the "./data/" directory
```
python gen_data_for_ace2005.py
```
Run the train.py to start training
```
python train.py
```

Reproducing our experiment on the GENIA dataset

Put the corpus GENIA into the "../GENIA/" directory
Run the parse_genia.py to extract sentences for training, and they will be placed at the "./data/genia/"
```
python parse_genia.py
```
Run the gen_data_for_genia.py to prepare the processed data files for training, and they will be placed at the "./data/" directory
```
python gen_data_for_genia.py
```
Run the train.py to start training
```
python train.py
```

Configuration

Configurations of the model and training are in config.py

Citation

Please cite our arXiv paper:

@article{shibuya2019nested,
  title={Nested Named Entity Recognition via Second-best Sequence Learning and Decoding},
  author={Shibuya, Takashi and Hovy, Eduard},
  journal={arXiv preprint arXiv:1909.02250},
  year={2019}
}

上一篇：BERT-NER-CLI

下一篇：Chinese-NER-With-Bert

用户评价

全部评价

还没有评论，说两句吧！

热门资源

TensorFlow-Course

This repository aims to provide simple and read...
seetafaceJNI

项目介绍基于中科院seetaface2进行封装的JAVA...
mxnet_VanillaCNN

This is a mxnet implementation of the Vanilla C...
tensorflow-sketch...

Discrlaimer: This is not an official Google pro...
vsepp_tensorflow

Improving Visual-Semantic Embeddings with Hard ...

智能在线

400-630-6780
聆听.建议反馈

E-mail: support@tusaishared.com