pysc2-RLagents
Notes and scripts for SC2LE released by DeepMind and Blizzard, more details here.
Python library for SC2 API Protocol
Siraj's Youtube tutorial and accompanying code
Steven's Medium articles for a simple scripted agent and one based on Q-tables
pekaalto's work on adapting OpenAI's gym environment to SC2LE and an implementation of the FullyConv algorithm plus results on three minigames
Arthur Juliani's posts and repo for RL agents
Not SC2LE but mentioned here because my agent script was built on Juliani's A3C implementation.
Let me know if anyone else is also working on this and I'll add a link here!
Contains general notes on working with SC2LE.
The entire unfiltered action space for an SC2LE agent.
It contains 524 base actions / functions with 101938719 possible actions given a minimap_resolution of (64, 64) and screen_resolution of (84, 84).
The entire list of action argument types for use in the actions / functions.
It contains 13 argument types with descriptions.
Notes on running an agent in the pysc2.env.sc2_env.SC2Env environment. In particular, showing details and brief descriptions of the TimeStep object (observation) fed to the step function of an agent or returned from calling the step function of an environment.
Contains notes on developing RL agents for SC2LE.
Contains scripts for training and running RL agents in SC2LE.
PySC2_A3C_FullyConv.py
This script implements the A3C algorithm with the FullyConv architecture described in DeepMind's paper, for SC2LE. The code is based on Arthur Juliani's A3C implementation for the VizDoom environment (see above).
To run the script, use the following command:
python PySC2_A3C_FullyConv.py --map_name MoveToBeacon
If --map_name
is not supplied, the script runs DefeatRoaches by default.
PySC2_A3C_AtariNet.py
This script implements the A3C algorithm with the Atari-net architecture described in DeepMind's paper, for SC2LE. The code is based on Arthur Juliani's A3C implementation for the VizDoom environment (see above).
This is a generalized version of PySC2_A3C_old.py that works for all minigames and also contains some bug fixes.
To run the script, use the following command:
python PySC2_A3C_AtariNet.py --map_name MoveToBeacon
If --map_name
is not supplied, the script runs DefeatRoaches by default.
PySC2_A3C_old.py
python PySC2_A3C_old.py
.I initially focused on the DefeatRoaches minigame and so I only took in 7 screen features and 3 nonspatial features for the state space and the action space is limited to 17 base actions and their relevant arguments.
For the action space, I modeled the base actions and arguments independently. In addition, I also model x and y coordinates independently for spatial arguments, to further reduce the effective action space.
The agent currently samples the distributions returned from the policy networks for the actions taken, instead of an epsilon-greedy.
Also, the policy networks for the arguments are updated irregardless of whether the argument was used (eg. even if a no_op action is taken, the argument policies are still updated), which should probably be corrected.
Will be updating this to work with all the minigames.
As of ~10 million steps on DefeatRoaches, the agent achieved max and average scores of 338 and 65, compared to DeepMind's Atari-net agent that achieved max and average scores of 351 and 101 after 600 million steps.
上一篇:sc2aibot
下一篇:DeepDreamAnim
还没有评论,说两句吧!
热门资源
seetafaceJNI
项目介绍 基于中科院seetaface2进行封装的JAVA...
spark-corenlp
This package wraps Stanford CoreNLP annotators ...
Keras-ResNeXt
Keras ResNeXt Implementation of ResNeXt models...
capsnet-with-caps...
CapsNet with capsule-wise convolution Project ...
shih-styletransfer
shih-styletransfer Code from Style Transfer ...
智能在线
400-630-6780
聆听.建议反馈
E-mail: support@tusaishared.com