NoSavedDATA / PyTorch-BBF-Bigger-Better-Faster-Atari-100k
☆12Updated this week
Related projects ⓘ
Alternatives and complementary repositories for PyTorch-BBF-Bigger-Better-Faster-Atari-100k
- ReDo: The Dormant Neuron Phenomenon in Deep Reinforcement Learning (pytorch)☆25Updated last month
- Official implementation of the BRO algorithm☆10Updated 3 weeks ago
- Challenges and Opportunities in Offline Reinforcement Learning from Visual Observations☆95Updated 5 months ago
- Accompanying Code for "Flipping Coins to Estimate Pseudocounts for Exploration in Reinforcement Learning", ICML 2023☆18Updated 10 months ago
- Transformer-based World Models☆71Updated last year
- Pytorch version of Dreamer, which follows the original TF v2 codes.☆114Updated 2 years ago
- Conservative Q Learning on top of SAC☆120Updated 2 years ago
- Skeleton for scalable and flexible Jax RL implementations☆63Updated last year
- Deep Hierarchical Planning from Pixels☆90Updated last year
- A PyTorch implementation of Implicit Q-Learning☆66Updated 3 years ago
- ☆235Updated 2 years ago
- OGBench: Benchmarking Offline Goal-Conditioned RL☆79Updated 3 weeks ago
- Conservative Q learning in Jax☆51Updated last year
- Synthetic Experience Replay☆74Updated 5 months ago
- Author's PyTorch implementation of TD7 for online and offline RL☆116Updated last year
- official implementation for our paper Cal-QL: Calibrated Offline RL Pre-Training for Efficient Online Fine-Tuning☆78Updated 3 months ago
- ☆62Updated 5 months ago
- Representation Learning for RL☆119Updated last year
- JAX implementation of deep RL agents with resets from the paper "The Primacy Bias in Deep Reinforcement Learning"☆99Updated 2 years ago
- Learning diverse options through the Laplacian representation.☆22Updated 10 months ago
- Simple maze environments using mujoco-py☆52Updated 10 months ago
- Efficient seed-parallel implementation of "Breaking the Replay Ratio Barrier"☆21Updated last year
- DAC: Diffusion Actor-Critic: Formulating Constrained Policy Iteration as Diffusion Noise Regression for Offline Reinforcement Learning.☆12Updated 5 months ago
- ☆15Updated last year
- Official code release for "CrossQ: Batch Normalization in Deep Reinforcement Learning for Greater Sample Efficiency and Simplicity"☆57Updated 5 months ago
- ☆38Updated last year
- Prioritized Experience Replay implementation with proportional prioritization☆69Updated last year
- ☆17Updated 4 months ago
- Implementation of Trajectory Transformer with attention caching and batched beam search☆107Updated last year
- Benchmarking RL generalization in an interpretable way.☆132Updated 9 months ago