Tutorials¶
- Bandit Tutorials
- Classical
- Deep RL Tutorials
- Deep Reinforcement Learning Background
- Vanilla Policy Gradient
- Advantage Actor Critic
- Proximal Policy Optimization
- Deep Q-Networks (DQN)
- Double Deep Q-Network
- Dueling Deep Q-Network
- Deep Q Networks with Noisy Nets
- Prioritized Deep Q-Networks
- Deep Deterministic Policy Gradients
- Twin Delayed DDPG
- Soft Actor-Critic
- Categorical Deep Q-Networks
- Custom Policy Networks
- Using A2C
- Using Shared Parameters in Actor Critic Agents in GenRL
- Vanilla Policy Gradient (VPG)
- Saving and Loading Weights and Hyperparameters with GenRL