资源论文Persistent Homology: An Introduction and a New Text Representation for Natural Language Processing

Persistent Homology: An Introduction and a New Text Representation for Natural Language Processing

2019-11-08 | |  53 |   42 |   0
Abstract Persistent homology is a mathematical tool from topological data analysis. It performs multi-scale analysis on a set of points and identi?es clusters, holes, and voids therein. These latter topological structures complement standard feature representations, making persistent homology an attractive feature extractor for arti?cial intelligence. Research on persistent homology for AI is in its infancy, and is currently hindered by two issues: the lack of an accessible introduction to AI researchers, and the paucity of applications. In response, the ?rst part of this paper presents a tutorial on persistent homology speci?cally aimed at a broader audience without sacri?cing mathematical rigor. The second part contains one of the ?rst applications of persistent homology to natural language processing. Speci?cally, our Similarity Filtration with Time Skeleton (SIFTS) algorithm identi?es holes that can be interpreted as semantic “tie-backs” in a text document, providing a new document structure representation. We illustrate our algorithm on documents ranging from nursery rhymes to novels, and on a corpus with child and adolescent writings.

上一篇:On Robust Estimation of High Dimensional Generalized Linear Models Eunho Yang Ambuj Tewari Pradeep Ravikumar

下一篇:Concept Learning for Cross-Domain Text Classi?cation: A General Probabilistic Framework

用户评价
全部评价

热门资源

  • Learning to Predi...

    Much of model-based reinforcement learning invo...

  • Stratified Strate...

    In this paper we introduce Stratified Strategy ...

  • The Variational S...

    Unlike traditional images which do not offer in...

  • A Mathematical Mo...

    Direct democracy, where each voter casts one vo...

  • Rating-Boosted La...

    The performance of a recommendation system reli...