stable
User Guide
Installation
About
Tutorials
API
Agents
A2C
DDPG
DQN
PPO1
VPG
TD3
SAC
Q-Learning
SARSA
Contextual Bandit
Multi-Armed Bandit
Environments
Core
Utilities
Trainers
Common
GenRL
Docs
»
Agents
Edit on GitHub
Agents
ΒΆ
Deep
A2C
DDPG
DQN
PPO1
VPG
TD3
SAC
Classical
Q-Learning
SARSA
Bandit
Contextual Bandit
Base
Bootstrap Neural
Fixed
Linear Posterior
Neural Greedy
Neural Linear Posterior
Neural Noise Sampling
Variational
Multi-Armed Bandit
Base
Bayesian Bandit
Bernoulli Bandit
Espilon Greedy
Gaussian
Gradient
Thmopson Sampling
Upper Confidence Bound
Read the Docs
v: stable
Versions
latest
stable
Downloads
pdf
html
epub
On Read the Docs
Project Home
Builds
Free document hosting provided by
Read the Docs
.