此资源内容只作交流和学习使用,请勿侵犯他人的知识产权,本站不储存、复制任何文件,所有文件均来自网络。
感谢您对本站的支持。
032 Coding the Double Q Learning Agent and Analyzing Performance.mp4 58.28 MB mp4
032 Coding the Double Q Learning Agent and Analyzing Performance.en.srt 9.64 KB srt
031 Analyzing the Paper.mp4 182.66 MB mp4
031 Analyzing the Paper.en.srt 23.64 KB srt
代码 https://www.aliyundrive.com/s/CQtQSsPHcii/folder/641e9c416403792734604d3fbb91efbd7aeff560 folder38:策略梯度PG_同一个回合中不同的action回溯不同的TotalReward_代码实战.mp4 30.66 MB mp437:策略梯度PG_对TotalReward进行均值归一化.mp4 29.71 MB mp436:代码实战_策略梯度PG选择行为和参数训练.mp4 32.87 MB mp435:代码实战_策略梯度PG网络构建.mp4 28.63 MB mp434:代码实战_策略梯度PG和CartPole交互.mp4 44.45 MB mp433:策略梯度PG_讲解CartPole环境.mp4 31.95 MB mp432:策略梯度PG_总结整体流程_对比交叉熵损失函数求导.mp4 30.01 MB mp431:策略梯度PG_简化导函数的公式推导.mp4 33.34 MB mp430:策略梯度PG_明确目标函数和导函数.mp4 33.85 MB mp429:策略梯度...
Hands-On-Reinforcement-Learning-with-Python-master.zip - 18.20MBHands-On Reinforcement Learning - Sudharsan Ravichandiran.pdf - 42.81MB