samlobel / CFN
Accompanying Code for "Flipping Coins to Estimate Pseudocounts for Exploration in Reinforcement Learning", ICML 2023
☆19Updated last year
Alternatives and similar repositories for CFN:
Users that are interested in CFN are comparing it to the libraries listed below
- [ICLR 2024] Closing the Gap between TD Learning and Supervised Learning - A Generalisation Point of View.☆23Updated 11 months ago
- ☆41Updated last year
- Official code release for "CrossQ: Batch Normalization in Deep Reinforcement Learning for Greater Sample Efficiency and Simplicity"☆68Updated 9 months ago
- Efficient seed-parallel implementation of "Breaking the Replay Ratio Barrier"☆22Updated last year
- Conservative Q learning in Jax☆53Updated 2 years ago
- HIQL: Offline Goal-Conditioned RL with Latent States as Actions (NeurIPS 2023)☆78Updated 4 months ago
- Simple maze environments using mujoco-py☆54Updated last year
- Challenges and Opportunities in Offline Reinforcement Learning from Visual Observations☆100Updated 10 months ago
- COOM: Benchmarking Continual Reinforcement Learning on Doom☆17Updated last month
- Implementation of Tactical Optimistic and Pessimistic value estimation☆25Updated last year
- ☆18Updated 2 months ago
- ☆48Updated last year
- Code for "SimbaV2: Hyperspherical Normalization for Scalable Deep Reinforcement Learning"☆40Updated last week
- Source files to replicate experiments in my ICLR 2022 paper.☆70Updated 9 months ago
- Official codebase for Generating Diverse Cooperative Agents by Learning Incompatible Policies (notable-top-25% @ ICLR 2023)☆16Updated 10 months ago
- Skeleton for scalable and flexible Jax RL implementations☆74Updated last year
- ☆55Updated 2 years ago
- Official codebase for "The Generalization Gap in Offline Reinforcement Learning" accepted to ICLR 2024☆28Updated 7 months ago
- ☆47Updated 2 years ago
- ☆17Updated last year
- ☆53Updated last year
- When Do Transformers Shine in RL? Decoupling Memory from Credit Assignment, NeurIPS 2023 (oral)☆62Updated last year
- Pytorch implementation on OpenAI's Procgen ppo-baseline, built from scratch.☆30Updated 4 years ago
- ExORL: Exploratory Data for Offline Reinforcement Learning☆111Updated 3 years ago
- ☆15Updated last year
- ☆18Updated 2 years ago
- ☆23Updated last year
- JAX implementation of deep RL agents with resets from the paper "The Primacy Bias in Deep Reinforcement Learning"☆101Updated 2 years ago
- Pessimistic Bootstrapping for Uncertainty-Driven Offline Reinforcement Learning☆28Updated 3 years ago
- ReDo: The Dormant Neuron Phenomenon in Deep Reinforcement Learning (pytorch)☆25Updated 5 months ago