资源论文TOWARDS AMORTIZED RANKING -C RITICALT RAINING FOR COLLABORATIVE FILTERING

TOWARDS AMORTIZED RANKING -C RITICALT RAINING FOR COLLABORATIVE FILTERING

2020-01-02 | |  56 |   40 |   0

Abstract

We investigate new methods for training collaborative filtering models based on actor-critic reinforcement learning, to more directly maximize ranking-based objective functions. Specifically, we train a critic network to approximate ranking-based metrics, and then update the actor network to directly optimize against the learned metrics. In contrast to traditional learning-to-rank methods that require re-running the optimization procedure for new lists, our critic-based method amortizes the scoring process with a neural network, and can directly provide the (approximate) ranking scores for new lists. We demonstrate the actor-critic’s ability to significantly improve the performance of a variety of prediction models, and achieve better or comparable performance to a variety of strong baselines on three large-scale datasets.

上一篇:ON SOLVING MINIMAX OPTIMIZATION LOCALLY:A FOLLOW- THE -R IDGE APPROACH

下一篇:OPTIMAL STRATEGIESAGAINST GENERATIVE ATTACKS

用户评价
全部评价

热门资源

  • Learning to Predi...

    Much of model-based reinforcement learning invo...

  • Stratified Strate...

    In this paper we introduce Stratified Strategy ...

  • The Variational S...

    Unlike traditional images which do not offer in...

  • A Mathematical Mo...

    Direct democracy, where each voter casts one vo...

  • Rating-Boosted La...

    The performance of a recommendation system reli...