245 10 |a Deep reinforcement learning hands-on = |b 深度强化学习实践 / |c Maxim Lapan著.

505 0_ |a What is reinforcement learning? -- OpenAI Gym -- Deep learning with Py Torch -- The cross-entropy method -- Tabular learning and the bellman equation -- Deep Q-networks -- DQN extensions -- Stocks trading using RL -- Policy gradients : an alternative -- The actor-critic method -- Asynchronous advantage actor-critic -- Chatbots training with RL -- Web navigation -- Continuous action space -- Trust regions : TRPO, PPO, and ACKTR -- Black-box optimization in RL -- Beyond model-free : imagination -- AlphaGo Zero.

534 __ |p Reprint.Originally published: |c Birmingham : Packt Publishing Ltd., c2018. |z 9781788834247.