SonyResearch / simba
☆78Updated 3 weeks ago
Alternatives and similar repositories for simba:
Users that are interested in simba are comparing it to the libraries listed below
- A benchmark for offline goal-conditioned RL and offline RL☆139Updated 3 weeks ago
- HIQL: Offline Goal-Conditioned RL with Latent States as Actions (NeurIPS 2023)☆77Updated 3 months ago
- METRA: Scalable Unsupervised RL with Metric-Aware Abstraction (ICLR 2024)☆64Updated last year
- Official code release for "CrossQ: Batch Normalization in Deep Reinforcement Learning for Greater Sample Efficiency and Simplicity"☆67Updated 9 months ago
- Foundation Policies with Hilbert Representations (ICML 2024)☆80Updated 11 months ago
- ☆18Updated last month
- Skeleton for scalable and flexible Jax RL implementations☆74Updated last year
- Challenges and Opportunities in Offline Reinforcement Learning from Visual Observations☆100Updated 9 months ago
- Transformer-based World Models☆78Updated last year
- official implementation for our paper Cal-QL: Calibrated Offline RL Pre-Training for Efficient Online Fine-Tuning☆90Updated 7 months ago
- [ICML 2023] Pre-train world model-based agents with different unsupervised strategies, fine-tune the agent's components selectively, and …☆37Updated last year
- Recall to Imagine, a model-based RL algorithm with superhuman memory. Oral (1.2%) @ ICLR 2024☆63Updated 9 months ago
- Simple single-file baselines for Q-Learning in pure-GPU setting☆149Updated this week
- Goal-Conditioned Reinforcement Learning with JAX☆131Updated this week
- Source files to replicate experiments in my ICLR 2022 paper.☆69Updated 8 months ago
- Accompanying Code for "Flipping Coins to Estimate Pseudocounts for Exploration in Reinforcement Learning", ICML 2023☆19Updated last year
- Official implementation of the BRO algorithm☆39Updated last month
- Learning diverse options through the Laplacian representation.☆23Updated last year
- ReDo: The Dormant Neuron Phenomenon in Deep Reinforcement Learning (pytorch)☆25Updated 5 months ago
- Code for "Unsupervised Zero-Shot RL via Functional Reward Representations"☆53Updated 11 months ago
- ☆25Updated last year
- ☆79Updated 9 months ago
- ☆42Updated 4 months ago
- a simple and scalable agent for training adaptive policies with sequence-based RL☆114Updated last month
- Synchronized Curriculum Learning for RL Agents☆41Updated this week
- Implementation of Jump-Start Reinforcement Learning (JSRL) with Stable Baselines3☆31Updated last year
- Synthetic Experience Replay☆89Updated 9 months ago
- Code for TRANSDREAMER: REINFORCEMENT LEARNING WITH TRANSFORMER WORLD MODELS☆25Updated last year