实例介绍
【实例截图】
【核心代码】
卷 软件 的文件夹 PATH 列表
卷序列号为 0D75-0D0C
D:.
└─Reinforcement-learning-with-tensorflow
│ LICENCE
│ README.md
│ RL_cover.jpg
│
├─.idea
│ misc.xml
│ modules.xml
│ Reinforcement-learning-with-tensorflow.iml
│ vcs.xml
│ workspace.xml
│
├─contents
│ ├─10_A3C
│ │ │ A3C_continuous_action.py
│ │ │ A3C_discrete_action.py
│ │ │ A3C_distributed_tf.py
│ │ │ A3C_RNN.py
│ │ │
│ │ └─log
│ │ events.out.tfevents.1551167889.DESKTOP-OSFDGN5
│ │
│ ├─11_Dyna_Q
│ │ maze_env.py
│ │ RL_brain.py
│ │ run_this.py
│ │
│ ├─12_Proximal_Policy_Optimization
│ │ │ discrete_DPPO.py
│ │ │ DPPO.py
│ │ │ PPO.py
│ │ │ simply_PPO.py
│ │ │ simply_PPO1.py
│ │ │
│ │ └─log
│ │ events.out.tfevents.1551178953.DESKTOP-OSFDGN5
│ │ events.out.tfevents.1551255571.DESKTOP-OSFDGN5
│ │ events.out.tfevents.1553224232.DESKTOP-OSFDGN5
│ │ events.out.tfevents.1553238324.DESKTOP-OSFDGN5
│ │ events.out.tfevents.1553837229.DESKTOP-OSFDGN5
│ │ events.out.tfevents.1554976756.DESKTOP-OSFDGN5
│ │ events.out.tfevents.1556507113.DESKTOP-OSFDGN5
│ │ events.out.tfevents.1556507732.DESKTOP-OSFDGN5
│ │ events.out.tfevents.1556518602.DESKTOP-OSFDGN5
│ │ events.out.tfevents.1556522855.DESKTOP-OSFDGN5
│ │ events.out.tfevents.1557300964.DESKTOP-OSFDGN5
│ │ events.out.tfevents.1557373659.DESKTOP-OSFDGN5
│ │ events.out.tfevents.1557891088.DESKTOP-OSFDGN5
│ │ events.out.tfevents.1557991550.DESKTOP-OSFDGN5
│ │ events.out.tfevents.1558410109.DESKTOP-OSFDGN5
│ │ events.out.tfevents.1558492818.DESKTOP-OSFDGN5
│ │ events.out.tfevents.1558493235.DESKTOP-OSFDGN5
│ │ events.out.tfevents.1558493411.DESKTOP-OSFDGN5
│ │ events.out.tfevents.1558523123.DESKTOP-OSFDGN5
│ │ events.out.tfevents.1558523249.DESKTOP-OSFDGN5
│ │ events.out.tfevents.1558523427.DESKTOP-OSFDGN5
│ │ events.out.tfevents.1558523705.DESKTOP-OSFDGN5
│ │ events.out.tfevents.1558523771.DESKTOP-OSFDGN5
│ │ events.out.tfevents.1558599329.DESKTOP-OSFDGN5
│ │ events.out.tfevents.1558601706.DESKTOP-OSFDGN5
│ │ events.out.tfevents.1558602205.DESKTOP-OSFDGN5
│ │ events.out.tfevents.1558602434.DESKTOP-OSFDGN5
│ │ events.out.tfevents.1558603717.DESKTOP-OSFDGN5
│ │ events.out.tfevents.1558604038.DESKTOP-OSFDGN5
│ │ events.out.tfevents.1558604207.DESKTOP-OSFDGN5
│ │ events.out.tfevents.1558604975.DESKTOP-OSFDGN5
│ │ events.out.tfevents.1558605865.DESKTOP-OSFDGN5
│ │ events.out.tfevents.1558606275.DESKTOP-OSFDGN5
│ │ events.out.tfevents.1558606399.DESKTOP-OSFDGN5
│ │ events.out.tfevents.1558607336.DESKTOP-OSFDGN5
│ │ events.out.tfevents.1558607454.DESKTOP-OSFDGN5
│ │ events.out.tfevents.1558607565.DESKTOP-OSFDGN5
│ │ events.out.tfevents.1558607756.DESKTOP-OSFDGN5
│ │ events.out.tfevents.1558607833.DESKTOP-OSFDGN5
│ │ events.out.tfevents.1558608089.DESKTOP-OSFDGN5
│ │ events.out.tfevents.1558608259.DESKTOP-OSFDGN5
│ │ events.out.tfevents.1558608468.DESKTOP-OSFDGN5
│ │ events.out.tfevents.1558608570.DESKTOP-OSFDGN5
│ │ events.out.tfevents.1558609087.DESKTOP-OSFDGN5
│ │ events.out.tfevents.1558609193.DESKTOP-OSFDGN5
│ │ events.out.tfevents.1558609349.DESKTOP-OSFDGN5
│ │ events.out.tfevents.1558609640.DESKTOP-OSFDGN5
│ │ events.out.tfevents.1558610084.DESKTOP-OSFDGN5
│ │ events.out.tfevents.1558610665.DESKTOP-OSFDGN5
│ │ events.out.tfevents.1558659163.DESKTOP-OSFDGN5
│ │ events.out.tfevents.1558659275.DESKTOP-OSFDGN5
│ │ events.out.tfevents.1558660352.DESKTOP-OSFDGN5
│ │ events.out.tfevents.1558661649.DESKTOP-OSFDGN5
│ │ events.out.tfevents.1558663823.DESKTOP-OSFDGN5
│ │ events.out.tfevents.1558664829.DESKTOP-OSFDGN5
│ │ events.out.tfevents.1558665334.DESKTOP-OSFDGN5
│ │ events.out.tfevents.1558667665.DESKTOP-OSFDGN5
│ │ events.out.tfevents.1558669203.DESKTOP-OSFDGN5
│ │ events.out.tfevents.1558672109.DESKTOP-OSFDGN5
│ │ events.out.tfevents.1558673239.DESKTOP-OSFDGN5
│ │ events.out.tfevents.1565602750.DESKTOP-OSFDGN5
│ │ events.out.tfevents.1565837491.DESKTOP-OSFDGN5
│ │ events.out.tfevents.1565837545.DESKTOP-OSFDGN5
│ │ events.out.tfevents.1565838580.DESKTOP-OSFDGN5
│ │ events.out.tfevents.1567677923.DESKTOP-OSFDGN5
│ │ events.out.tfevents.1567738605.DESKTOP-OSFDGN5
│ │ events.out.tfevents.1567738776.DESKTOP-OSFDGN5
│ │ events.out.tfevents.1567738924.DESKTOP-OSFDGN5
│ │ events.out.tfevents.1567738968.DESKTOP-OSFDGN5
│ │ events.out.tfevents.1568098203.DESKTOP-OSFDGN5
│ │
│ ├─13_saras
│ │ saras.py
│ │ __init__.py
│ │
│ ├─1_command_line_reinforcement_learning
│ │ treasure_on_right.py
│ │
│ ├─2_Q_Learning_maze
│ │ maze_env.py
│ │ RL_brain.py
│ │ run_this.py
│ │
│ ├─3_Sarsa_maze
│ │ │ maze_env.py
│ │ │ RL_brain.py
│ │ │ run_this.py
│ │ │
│ │ └─__pycache__
│ │ maze_env.cpython-35.pyc
│ │ RL_brain.cpython-35.pyc
│ │
│ ├─4_Sarsa_lambda_maze
│ │ maze_env.py
│ │ RL_brain.py
│ │ run_this.py
│ │
│ ├─5.1_Double_DQN
│ │ RL_brain.py
│ │ run_Pendulum.py
│ │
│ ├─5.2_Prioritized_Replay_DQN
│ │ RL_brain.py
│ │ run_MountainCar.py
│ │
│ ├─5.3_Dueling_DQN
│ │ RL_brain.py
│ │ run_Pendulum.py
│ │
│ ├─5_Deep_Q_Network
│ │ DQN_modified.py
│ │ maze_env.py
│ │ RL_brain.py
│ │ run_this.py
│ │
│ ├─6_OpenAI_gym
│ │ │ RL_brain.py
│ │ │ run_CartPole.py
│ │ │ run_MountainCar.py
│ │ │
│ │ └─__pycache__
│ │ RL_brain.cpython-35.pyc
│ │
│ ├─7_Policy_gradient_softmax
│ │ RL_brain.py
│ │ run_CartPole.py
│ │ run_MountainCar.py
│ │
│ ├─8_Actor_Critic_Advantage
│ │ AC_CartPole.py
│ │ AC_continue_Pendulum.py
│ │
│ ├─9_Deep_Deterministic_Policy_Gradient_DDPG
│ │ │ DDPG.py
│ │ │ DDPG_update.py
│ │ │ DDPG_update2.py
│ │ │
│ │ └─logs
│ │ events.out.tfevents.1553588612.DESKTOP-OSFDGN5
│ │ events.out.tfevents.1553664943.DESKTOP-OSFDGN5
│ │ events.out.tfevents.1553664974.DESKTOP-OSFDGN5
│ │ events.out.tfevents.1553665084.DESKTOP-OSFDGN5
│ │ events.out.tfevents.1553665248.DESKTOP-OSFDGN5
│ │ events.out.tfevents.1554974631.DESKTOP-OSFDGN5
│ │ events.out.tfevents.1556422021.DESKTOP-OSFDGN5
│ │ events.out.tfevents.1557133756.DESKTOP-OSFDGN5
│ │ events.out.tfevents.1561971410.DESKTOP-OSFDGN5
│ │ events.out.tfevents.1561971451.DESKTOP-OSFDGN5
│ │
│ └─Curiosity_Model
│ Curiosity.png
│ Curiosity.py
│ Random_Network_Distillation.py
│
└─experiments
├─2D_car
│ car_env.py
│ collision.py
│ DDPG.py
│
├─Robot_arm
│ │ A3C.py
│ │ arm_env.py
│ │ DDPG.py
│ │ DPPO.py
│ │
│ └─__pycache__
│ arm_env.cpython-35.pyc
│
├─Solve_BipedalWalker
│ │ A3C.py
│ │ A3C_rnn.py
│ │ DDPG.py
│ │
│ └─log
│ events.out.tfevents.1490801027.Morvan
│
└─Solve_LunarLander
A3C.py
DuelingDQNPrioritizedReplay.py
run_LunarLander.py
标签:
小贴士
感谢您为本站写下的评论,您的评论对其它用户来说具有重要的参考价值,所以请认真填写。
- 类似“顶”、“沙发”之类没有营养的文字,对勤劳贡献的楼主来说是令人沮丧的反馈信息。
- 相信您也不想看到一排文字/表情墙,所以请不要反馈意义不大的重复字符,也请尽量不要纯表情的回复。
- 提问之前请再仔细看一遍楼主的说明,或许是您遗漏了。
- 请勿到处挖坑绊人、招贴广告。既占空间让人厌烦,又没人会搭理,于人于己都无利。
关于好例子网
本站旨在为广大IT学习爱好者提供一个非营利性互相学习交流分享平台。本站所有资源都可以被免费获取学习研究。本站资源来自网友分享,对搜索内容的合法性不具有预见性、识别性、控制性,仅供学习研究,请务必在下载后24小时内给予删除,不得用于其他任何用途,否则后果自负。基于互联网的特殊性,平台无法对用户传输的作品、信息、内容的权属或合法性、安全性、合规性、真实性、科学性、完整权、有效性等进行实质审查;无论平台是否已进行审查,用户均应自行承担因其传输的作品、信息、内容而可能或已经产生的侵权或权属纠纷等法律责任。本站所有资源不代表本站的观点或立场,基于网友分享,根据中国法律《信息网络传播权保护条例》第二十二与二十三条之规定,若资源存在侵权或相关问题请联系本站客服人员,点此联系我们。关于更多版权及免责申明参见 版权及免责申明
网友评论
我要评论