NoSavedDATA / PyTorch-BBF-Bigger-Better-Faster-Atari-100k
☆16Updated 5 months ago
Alternatives and similar repositories for PyTorch-BBF-Bigger-Better-Faster-Atari-100k:
Users that are interested in PyTorch-BBF-Bigger-Better-Faster-Atari-100k are comparing it to the libraries listed below
- ReDo: The Dormant Neuron Phenomenon in Deep Reinforcement Learning (pytorch)☆25Updated 6 months ago
- Challenges and Opportunities in Offline Reinforcement Learning from Visual Observations☆100Updated 11 months ago
- ☆11Updated 2 years ago
- ☆43Updated 5 months ago
- Accompanying Code for "Flipping Coins to Estimate Pseudocounts for Exploration in Reinforcement Learning", ICML 2023☆19Updated last year
- Transformer-based World Models☆80Updated 2 years ago
- official implementation for our paper Cal-QL: Calibrated Offline RL Pre-Training for Efficient Online Fine-Tuning☆92Updated 8 months ago
- Skeleton for scalable and flexible Jax RL implementations☆80Updated last year
- ☆81Updated 2 months ago
- Official implementation of the BRO algorithm☆42Updated 2 months ago
- ☆15Updated last year
- Conservative Q learning in Jax☆53Updated 2 years ago
- Domain-Robust Visual Imitation Learning with Mutual Information Constraints code☆17Updated 4 years ago
- A benchmark for offline goal-conditioned RL and offline RL☆157Updated 3 weeks ago
- Goal-Conditioned Reinforcement Learning with JAX☆149Updated 3 weeks ago
- HIQL: Offline Goal-Conditioned RL with Latent States as Actions (NeurIPS 2023)☆80Updated 4 months ago
- Simple maze environments using mujoco-py☆54Updated last year
- ☆25Updated last year
- Efficient seed-parallel implementation of "Breaking the Replay Ratio Barrier"☆23Updated last year
- Code release for "HarmonyDream: Task Harmonization Inside World Models" (ICML 2024), https://arxiv.org/abs/2310.00344☆35Updated 10 months ago
- METRA: Scalable Unsupervised RL with Metric-Aware Abstraction (ICLR 2024)☆68Updated last year
- [ICLR 2025] Bootstrapped Model Predictive Control☆12Updated last week
- DrM, a visual RL algorithm, minimizes the dormant ratio to guide exploration-exploitation trade-offs, achieving significant improvements …☆73Updated 11 months ago
- Official code release for "CrossQ: Batch Normalization in Deep Reinforcement Learning for Greater Sample Efficiency and Simplicity"☆70Updated 10 months ago
- [ICLR 2023 Oral] The official implementation of SQL and EQL in "Offline RL with No OOD Actions: In-Sample Learning via Implicit Value Reg…☆44Updated last year
- [ICML 2023] Pre-train world model-based agents with different unsupervised strategies, fine-tune the agent's components selectively, and …☆37Updated last year
- Foundation Policies with Hilbert Representations (ICML 2024)☆83Updated last year
- A lightweight reimplementation of Adversarially Trained Actor Critic☆18Updated last year
- Object Centric Atari games☆74Updated this week
- [ICLR 2024] Closing the Gap between TD Learning and Supervised Learning - A Generalisation Point of View.☆23Updated last year