资源数据集conll2013

conll2013

2019-11-02 | |  136 |   0 |   0

This zip should contain 4 files: - README.txt (this file) - doc2Dep20MWU57k_1000concat2000.tab - doc2Dep20MWU57k_1000concat2000.txt - doc2Dep20MWU57k_1000concat2000.mat ****doc2Dep20MWU57k_1000concat2000.tab**** This file contains the 54975 word-units with POS tags. The order of the words in this file corresponds to the order of the rows in doc2Dep20MWU57k_1000concat2000.tab ****doc2Dep20MWU57k_1000concat2000.tab**** This tab-separated-value file contains the concatenated SVD matrices as created described in "Documents and Dependencies: an Exploration of Vector Space Models for Semantic Composition"(Fyshe 2013). The size of the matrix is 54975x2000. The first 1000 dimensions are Document dimensions, the second 1000 (1001-2000) are Dependency dimensions. The rows appear in the same order as the word-units in doc2Dep20MWU57k_1000concat2000.txt ****doc2Dep20MWU57k_1000concat2000.mat**** For convenience, this is the data contained in doc2Dep20MWU57k_1000concat2000.tab & doc2Dep20MWU57k_1000concat2000.txt saved into two matlab variables. count_matrix is the concatenated SVD matrices (tab file), words are the words (txt file). Questions may be directed to Alona Fyshe, afyshe at cs dot cmu dot edu.

上一篇:OpenMIIR-RawEEG 数据集

下一篇:Forex 历史中心各货币对外汇交易数据

用户评价
全部评价

热门资源

  • GRAZ 图像分类数据

    GRAZ 图像分类数据

  • MIT Cars 汽车图像...

    MIT Cars 汽车图像数据

  • 凶杀案报告数据

    凶杀案报告数据

  • 猫和狗图像分类数...

    Kaggle 上的竞赛数据,用以区分猫和狗两类对象,...

  • Bosch 流水线降低...

    数据来自产品在Bosch真实生产线上制造过程中的设备...