DEF

资源分类

DEF

2019-09-19 |

78 |

0 |

Deep exponential families

This is an implementation of deep exponential families in MXNet/Gluon. DEFs are described in https://arxiv.org/abs/1411.2581

I found it much easier to implement this in an imperative / dynamic graph library like mxnet than in autodifferentiation libraries that only support static computation graphs.

Currently the code only implements a point-mass distributions for the weights and biases of each layer in the DEF (these parameters are learned using variational expectation-maximization). It should be straightforward to extend this to other distributions.

The gradients are computed with either the score function estimator or the pathwise (reparameterization trick) estimator. For score function gradient estimators, we use the optimal control variate scaling described in black box variational inference.

The code takes lots of inspiration from the official deep exponential families codebase and the gluon examples in mxnet.

Example

Train a Poisson deep exponential family model on a large collection of science articles (in the LDA-C format):

PYTHONPATH=. python experiments/poisson_gaussian_deep_exp_fam_text.py

This periodically prints out the latent factors (dimensions of the latent variable), and the weight associated with each. For example, a dimension captures documents about DNA:

0.246   fig
-0.358  dna
-0.366  protein
-0.372  cells
-0.430  cell
-0.722  gene
-0.970  binding
-1.010  two
-1.026  sequence
-1.100  proteins

To train a Poisson deep exponential family model on the MNIST dataset:

PYTHONPATH=. python experiments/poisson_gaussian_deep_exp_fam_mnist.py

Also see examples in tests/ folder.

Requirements

Install requirements with anaconda:

conda env create -f environment.yml
source activate deep_exp_fam

Testing

Run PYTHONPATH=. pytest for unit tests and mypy $(find . -name '*.py') for static type-checking.

TODO:

figure out a cleaner way to do per-sample gradients -- bug tracker: https://github.com/apache/incubator-mxnet/issues/7987 (right now, parameters are repeated in deep_exp_fam.DeepExponentialFamilyModel class and require annoying processing)
add support for priors on the weights

上一篇：VQA

下一篇：anuvada

用户评价

全部评价

还没有评论，说两句吧！

热门资源

TensorFlow-Course

This repository aims to provide simple and read...
seetafaceJNI

项目介绍基于中科院seetaface2进行封装的JAVA...
mxnet_VanillaCNN

This is a mxnet implementation of the Vanilla C...
DuReader_QANet_BiDAF

Machine Reading Comprehension on DuReader Usin...
Klukshu-Sockeye-...

KLUKSHU SOCKEYE PROJECTS 2016 This repositor...

智能在线

400-630-6780
聆听.建议反馈

E-mail: support@tusaishared.com