This repository converts Tensorflow code of the book Deep Reinforcement Learning with Python (authored by Sudharsan Ravichandiran, 2nd edition) into Pytorch version.
- Ubuntu 20.04
- CUDA 12.0
- Python 3.9
- Pytorch 1.12.0
- Gym
- Deep-Q learning (DQN)
- Policy Gradient (PG)
- Multi-Agent Actor-Ctiric (A3C)
- Deep Deterministic Policy Gradient (DDPG)
- Soft Action-Critic (SAC)
- Twin-Delayed DDPG (TD3)
in progress
- Trust Region Policy Optimization (TRPO)
- Proximal Policy Optimization Algorithm (PPO)