Code for the paper "Phasic Policy Gradient"
☆268Apr 2, 2023Updated 3 years ago
Alternatives and similar repositories for phasic-policy-gradient
Users that are interested in phasic-policy-gradient are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆55Feb 28, 2024Updated 2 years ago
- An implementation of PPO in Pytorch☆123Updated this week
- Code for the paper "Batch size invariance for policy optimization"☆60Apr 2, 2023Updated 3 years ago
- Ranking Policy Gradient☆23Nov 27, 2019Updated 6 years ago
- Reinforcement Learning in PyTorch☆2,275Jan 4, 2021Updated 5 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- RAD: Reinforcement Learning with Augmented Data☆419Mar 29, 2021Updated 5 years ago
- Procgen Benchmark: Procedurally-Generated Game-Like Gym-Environments☆1,157Mar 27, 2026Updated last month
- Vectorized interface for reinforcement learning environments☆147Mar 26, 2026Updated last month
- Deep Learning Project☆23Jan 18, 2020Updated 6 years ago
- CURL: Contrastive Unsupervised Representation Learning for Sample-Efficient Reinforcement Learning☆604Oct 28, 2020Updated 5 years ago
- Collection of reinforcement learning algorithms☆2,899Jun 17, 2024Updated last year
- JAX (Flax) implementation of algorithms for Deep Reinforcement Learning with continuous action spaces.☆756Oct 26, 2022Updated 3 years ago
- PyTorch implementation of Soft Actor-Critic + Autoencoder(SAC+AE)☆256May 3, 2020Updated 6 years ago
- Open-source codebase for EfficientZero, from "Mastering Atari Games with Limited Data" at NeurIPS 2021.☆932Dec 20, 2023Updated 2 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ☆10Aug 17, 2022Updated 3 years ago
- DMControl Generalization Benchmark☆189Jan 3, 2024Updated 2 years ago
- ☆203Mar 25, 2023Updated 3 years ago
- Automatic Data-Regularized Actor-Critic (Auto-DrAC)☆104Mar 24, 2023Updated 3 years ago
- Code for the paper "Evolved Policy Gradients"☆254Nov 22, 2018Updated 7 years ago
- DrQ: Data regularized Q☆422Jan 13, 2023Updated 3 years ago
- Softlearning is a reinforcement learning framework for training maximum entropy policies in continuous domains. Includes the official imp…☆1,425Nov 29, 2023Updated 2 years ago
- Code for the paper "Leveraging Procedural Generation to Benchmark Reinforcement Learning"☆182Apr 2, 2023Updated 3 years ago
- A collection of reference environments for offline reinforcement learning☆1,677Nov 18, 2024Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Code for the paper "When to Trust Your Model: Model-Based Policy Optimization"☆544Nov 22, 2022Updated 3 years ago
- Efficient Exploration via State Marginal Matching (2019)☆69Jun 30, 2019Updated 6 years ago
- Code for MOPO: Model-based Offline Policy Optimization☆190May 17, 2022Updated 4 years ago
- DrQ-v2: Improved Data-Augmented Reinforcement Learning☆435May 31, 2022Updated 3 years ago
- Author's PyTorch implementation of TD3 for OpenAI gym tasks☆2,073Jul 14, 2023Updated 2 years ago
- Learning Action-Value Gradients in Model-based Policy Optimization☆32Sep 7, 2021Updated 4 years ago
- Experiment code for "Deep Reinforcement Learning in a Handful of Trials using Probabilistic Dynamics Models"☆475Jul 6, 2023Updated 2 years ago
- Implicit Normalizing Flows + Reinforcement Learning☆62May 31, 2019Updated 6 years ago
- Datasets for data-driven deep reinforcement learning with PyBullet environments☆152Mar 19, 2021Updated 5 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ☆399Jul 18, 2019Updated 6 years ago
- Repository for Iterated Relearning: The Impact of Non-stationarity on Generalisation in Deep Reinforcement Learning☆11Jun 8, 2020Updated 5 years ago
- OpenAI Gym wrapper for the DeepMind Control Suite☆229May 19, 2024Updated 2 years ago
- Offline Reinforcement Learning (aka Batch Reinforcement Learning) on Atari 2600 games☆559Jun 26, 2023Updated 2 years ago
- Author's PyTorch implementation of TD3+BC, a simple variant of TD3 for offline RL☆408Dec 18, 2021Updated 4 years ago
- [NeurIPS'21 Outstanding Paper] Library for reliable evaluation on RL and ML benchmarks, even with only a handful of seeds.☆872Aug 12, 2024Updated last year
- RE3: State Entropy Maximization with Random Encoders for Efficient Exploration☆69Jul 29, 2021Updated 4 years ago