资源论文Automatic Discovery of the Statistical Types of Variables in a Dataset

Automatic Discovery of the Statistical Types of Variables in a Dataset

2020-03-10 | |  58 |   52 |   0

Abstract

A common practice in statistics and machine learning is to assume that the statistical data ty (e.g., ordinal, categorical or real-valued) of var ables, and usually also the likelihood model, is known. However, as the availability of realworld data increases, this assumption becomes too restrictive. Data are often heterogeneous, complex, and improperly or incompletely documented. Surprisingly, despite their practical importance, there is still a lack of tools to automatically discover the statistical types of, as well as appropriate likelihood (noise) models for, the variables in a dataset. In this paper, we fill this gap by proposing a Bayesian method, which accurately discovers the statistical data types in both synthetic and real data.

上一篇:Convergence Analysis of Proximal Gradient with Momentum for Nonconvex Optimization

下一篇:On The Projection Operator to A Three-view Cardinality Constrained Set

用户评价
全部评价

热门资源

  • Learning to Predi...

    Much of model-based reinforcement learning invo...

  • Stratified Strate...

    In this paper we introduce Stratified Strategy ...

  • The Variational S...

    Unlike traditional images which do not offer in...

  • A Mathematical Mo...

    Direct democracy, where each voter casts one vo...

  • Rating-Boosted La...

    The performance of a recommendation system reli...