The power of absolute discounting: all-dimensional distribution estimation

资源分类

2020-02-10 |

68 |

46 |

Abstract

Categorical models are a natural fit for many problems. When learning the distribution of categories from samples, high-dimensionality may dilute the data. Minimax optimality is too pessimistic to remedy this issue. A serendipitously discovered estimator, absolute discounting, corrects empirical frequencies by subtracting a constant from observed categories, which it then redistributes among the unobserved. It outperforms classical estimators empirically, and has been used extensively in natural language modeling. In this paper, we rigorously explain the prowess of this estimator using less pessimistic notions. We show that (1) absolute discounting recovers classical minimax KL-risk rates, (2) it is adaptive to an effective dimension rather than the true dimension, (3) it is strongly related to the Good–Turing estimator and inherits its competitive properties. We use powerlaw distributions as the cornerstone of these results. We validate the theory via synthetic data and an application to the Global Terrorism Database.

上一篇：Balancing information exposure in social networks

下一篇：Attend and Predict: Understanding Gene Regulation by Selective Attention on Chromatin

用户评价

全部评价

还没有评论，说两句吧！

热门资源

Learning to learn...

The move from hand-designed features to learned...
A Mathematical Mo...

Direct democracy, where each voter casts one vo...
Stratified Strate...

In this paper we introduce Stratified Strategy ...
Rating-Boosted La...

The performance of a recommendation system reli...
Hierarchical Task...

We extend hierarchical task network planning wi...

智能在线

400-630-6780
聆听.建议反馈

E-mail: support@tusaishared.com