资源算法 DeepRL_blackjack

DeepRL_blackjack

2020-02-13 | |  34 |   0 |   0

DeepRL_blackjack

Authors: Anja Koller, Omnia el Sadaany, Sebastian Gaeumann University Zurich

basic blackjack implementations with deep RL In this file we explain what you can find in which file and the sources we used for the code. We modified the code from the website for better readability and analysis. We solved Blackjack using Qlearning, SARSA and Qlearning with Neural networks (DeepQlearning). These are all Reinforcement learning techniques. The main code sources where:

  1. Q-learning: QLearning w Writer_Sebi

CodeSource --> https://github.com/Pradhyo/blackjack/blob/master/blackjack.ipynb

  1. SARSA: https://colab.research.google.com/drive/1mslsNMjxVXrrMjr-uMWQVDmdkGCRIErf#scrollTo=-vgAu6f8jM1x SARSA code can bei viewed from the collab link above

CodeSource --> https://github.com/Pradhyo/blackjack/blob/master/blackjack.ipynb (adaptions from that to SARSA)

  1. Deep Q-learning: DQN_blackjack_website_1511.py DQN_anja_1012 (same as above but with saving the dataframe of averagepayouts for the plots) Codesource --> https://github.com/ml874/Blackjack--Reinforcement-Learning/blob/master/Blackjack-%20DQN%20(Only%20Hit%20or%20Stand).ipynb

Additionally for performance comparison of the algorithms we implemented the normal and the random strategy in the following files, and one for plotting to have nice comparisons

  1. Normal Strategy: Basic Strategy_Sebi.py

  2. Random Strategy: Blackjack_randomStrategy_Anja.py

  3. Comparison plots (all Algorithms): Qlearning_plots_0612.py


上一篇:deeprl-rainbow

下一篇: GroupyDHT

用户评价
全部评价

热门资源

  • seetafaceJNI

    项目介绍 基于中科院seetaface2进行封装的JAVA...

  • spark-corenlp

    This package wraps Stanford CoreNLP annotators ...

  • Keras-ResNeXt

    Keras ResNeXt Implementation of ResNeXt models...

  • capsnet-with-caps...

    CapsNet with capsule-wise convolution Project ...

  • inferno-boilerplate

    This is a very basic boilerplate example for pe...