hishamcse
/

RND-SuperMarioBros-v0

Reinforcement Learning

SuperMarioBros-v0

custom-implementation

Model card Files Files and versions Community

hishamcse commited on Jul 28

Commit

3f83bfd

•

1 Parent(s): 4b657b6

Update README.md

Files changed (1) hide show

README.md +37 -27

README.md CHANGED Viewed

@@ -23,32 +23,42 @@ model-index:
       verified: false
 ---
-  # **RND with CNN** Agent playing **SuperMarioBros-v0**
-  This is a trained model of a **RND-CNN** agent playing **SuperMarioBros-v0** .
-  To learn to use this model and train yours check this notebook on kaggle: https://www.kaggle.com/code/syedjarullahhisham/drl-extra-personal-unit-5-rnd-super-mario-bros
-  # HyperParameters:
-  ```
-  "trainmethod": "RND",
-  "envid": "SuperMarioBros-v0",
-  "maxstepperepisode": 18000,
-  "learningrate": 0.0001,
-  "numenv": 128,
-  "numstep": 128,
-  "gamma": 0.999,
-  "intgamma": 0.99,
-  "lambda": 0.95,
-  "usegae": true,
-  "clipgradnorm": 0.5,
-  "entropy": 0.001,
-  "epoch": 4,
-  "minibatch": 4,
-  "ppoeps": 0.1,
-  "extcoef": 5.0,
-  "intcoef": 1.0,
-  "stickyaction": true,
-  "actionprob": 0.25,
-  "lifedone": false,
-  "obsnormstep": 50
-  ```

       verified: false
 ---
+# **RND with CNN** Agent playing **SuperMarioBros-v0**
+This is a trained model of a **RND-CNN** agent playing **SuperMarioBros-v0** .
+To learn to use this model and train yours check this notebook on kaggle: https://www.kaggle.com/code/syedjarullahhisham/drl-extra-personal-unit-5-rnd-super-mario-bros
+## Codes
+Github repos(Give a star if found useful):
+  * https://github.com/hishamcse/DRL-Renegades-Game-Bots
+  * https://github.com/hishamcse/Advanced-DRL-Renegades-Game-Bots
+Kaggle Notebook:
+  * https://www.kaggle.com/code/syedjarullahhisham/drl-extra-personal-unit-5-rnd-super-mario-bros
+  * https://www.kaggle.com/code/syedjarullahhisham/drl-extra-personal-unit-5-rnd-montezuma-mario-bros
+# HyperParameters:
+```
+"trainmethod": "RND",
+"envid": "SuperMarioBros-v0",
+"maxstepperepisode": 18000,
+"learningrate": 0.0001,
+"numenv": 128,
+"numstep": 128,
+"gamma": 0.999,
+"intgamma": 0.99,
+"lambda": 0.95,
+"usegae": true,
+"clipgradnorm": 0.5,
+"entropy": 0.001,
+"epoch": 4,
+"minibatch": 4,
+"ppoeps": 0.1,
+"extcoef": 5.0,
+"intcoef": 1.0,
+"stickyaction": true,
+"actionprob": 0.25,
+"lifedone": false,
+"obsnormstep": 50
+```