9781788834247-1788834240-Deep Reinforcement Learning Hands-On: Apply modern RL methods, with deep Q-networks, value iteration, policy gradients, TRPO, AlphaGo Zero and more

Deep Reinforcement Learning Hands-On: Apply modern RL methods, with deep Q-networks, value iteration, policy gradients, TRPO, AlphaGo Zero and more

ISBN-13: 9781788834247
ISBN-10: 1788834240
Author: Maxim Lapan
Publication date: 2018
Publisher: Packt Publishing
Format: Paperback 546 pages
FREE US shipping
Buy

From $39.99

Book details

ISBN-13: 9781788834247
ISBN-10: 1788834240
Author: Maxim Lapan
Publication date: 2018
Publisher: Packt Publishing
Format: Paperback 546 pages

Summary

Deep Reinforcement Learning Hands-On: Apply modern RL methods, with deep Q-networks, value iteration, policy gradients, TRPO, AlphaGo Zero and more (ISBN-13: 9781788834247 and ISBN-10: 1788834240), written by authors Maxim Lapan, was published by Packt Publishing in 2018. With an overall rating of 3.5 stars, it's a notable title among other AI & Machine Learning (Algorithms, Programming, Computer Science) books. You can easily purchase or rent Deep Reinforcement Learning Hands-On: Apply modern RL methods, with deep Q-networks, value iteration, policy gradients, TRPO, AlphaGo Zero and more (Paperback) from BooksRun, along with many other new and used AI & Machine Learning books and textbooks. And, if you're looking to sell your copy, our current buyback offer is $2.53.

Description

Publisher's Note: This edition from 2018 is outdated and not compatible with any of the most recent updates to Python libraries. A new third edition, updated for 2020 with six new chapters that include multi-agent methods, discrete optimization, RL in robotics, and advanced exploration techniques is now available.

This practical guide will teach you how deep learning (DL) can be used to solve complex real-world problems.

Key Features
  • Explore deep reinforcement learning (RL), from the first principles to the latest algorithms
  • Evaluate high-profile RL methods, including value iteration, deep Q-networks, policy gradients, TRPO, PPO, DDPG, D4PG, evolution strategies and genetic algorithms
  • Keep up with the very latest industry developments, including AI-driven chatbots
Book Description

Deep Reinforcement Learning Hands-On is a comprehensive guide to the very latest DL tools and their limitations. You will evaluate methods including Cross-entropy and policy gradients, before applying them to real-world environments. Take on both the Atari set of virtual games and family favorites such as Connect4.

The book provides an introduction to the basics of RL, giving you the know-how to code intelligent learning agents to take on a formidable array of practical tasks. Discover how to implement Q-learning on 'grid world' environments, teach your agent to buy and trade stocks, and find out how natural language models are driving the boom in chatbots.

What you will learn
  • Understand the DL context of RL and implement complex DL models
  • Learn the foundation of RL: Markov decision processes
  • Evaluate RL methods including Cross-entropy, DQN, Actor-Critic, TRPO, PPO, DDPG, D4PG and others
  • Discover how to deal with discrete and continuous action spaces in various environments
  • Defeat Atari arcade games using the value iteration method
  • Create your own OpenAI Gym environment to train a stock trading agent
  • Teach your agent to play Connect4 using AlphaGo Zero
  • Explore the very latest deep RL research on topics including AI-driven chatbots
Who This Book Is For

Some fluency in Python is assumed. Basic deep learning (DL) approaches should be familiar to readers and some practical experience in DL will be helpful. This book is an introduction to deep reinforcement learning (RL) and requires no background in RL.

Table of Contents
  1. What is Reinforcement Learning?
  2. OpenAI Gym
  3. Deep Learning with PyTorch
  4. The Cross-Entropy Method
  5. Tabular Learning and the Bellman Equation
  6. Deep Q-Networks
  7. DQN Extensions
  8. Stocks Trading Using RL
  9. Policy Gradients – An Alternative
  10. The Actor-Critic Method
  11. Asynchronous Advantage Actor-Critic
  12. Chatbots Training with RL
  13. Web Navigation
  14. Continuous Action Space
  15. Trust Regions – TRPO, PPO, and ACKTR
  16. Black-Box Optimization in RL
  17. Beyond Model-Free – Imagination
  18. AlphaGo Zero
Rate this book Rate this book

We would LOVE it if you could help us and other readers by reviewing the book