Skip to main content
Back to top
Ctrl
+
K
Contents
Getting Started
Examples
Tutorial 1: Step-by-Step AC Implementation
Tutorial 2: Build Your Own Model
Example 1: Adapting SAC to DRND
Example 2: Adapting SAC to REDQ
Example 3: Uncertainty-Aware SAC with Bayesian Neural Networks
API
Agents
Config
Experiments
Loggers
Models
Basic
Actor
Critic and CriticEnsemble
ActorCritic
Ensemble
Custom Loss Functions
Deep Deterministic Policy Gradient (DDPG)
Distributional Random Network Distillation (DRND)
Distributional Soft Actor Critic (DSAC)
Optimistic Actor-Critic (OAC)
Deep Exploration with PAC-Bayes (PBAC)
Proximal Policy Optimization (PPO)
Randomized Ensembled Double Q-Learning (REDQ)
Soft Actor Critic (SAC)
Twin Delayed DDPG (TD3)
Nets
Layer Module
Heads
Bayesian Layers
Actor Networks
Critic Networks
Replay Buffers
Utils
Environment Module
DMC Wrappers
MetaWorld Wrappers
Noisy Wrappers
Reward Wrappers
Custom Activation Functions
Harvest Utilities
Make Environment Utility
Network Utilities
Utility Functions
Search
Error
Please activate JavaScript to enable the search functionality.
Ctrl
+
K