SonyResearch / simba
☆75Updated 2 weeks ago
Alternatives and similar repositories for simba:
Users that are interested in simba are comparing it to the libraries listed below
- A benchmark for offline goal-conditioned RL and offline RL☆132Updated last week
- Official code release for "CrossQ: Batch Normalization in Deep Reinforcement Learning for Greater Sample Efficiency and Simplicity"☆67Updated 9 months ago
- official implementation for our paper Cal-QL: Calibrated Offline RL Pre-Training for Efficient Online Fine-Tuning☆88Updated 7 months ago
- METRA: Scalable Unsupervised RL with Metric-Aware Abstraction (ICLR 2024)☆64Updated last year
- Skeleton for scalable and flexible Jax RL implementations☆73Updated last year
- Goal-Conditioned Reinforcement Learning with JAX☆127Updated this week
- HIQL: Offline Goal-Conditioned RL with Latent States as Actions (NeurIPS 2023)☆77Updated 3 months ago
- ☆18Updated last month
- Foundation Policies with Hilbert Representations (ICML 2024)☆79Updated 10 months ago
- Transformer-based World Models☆77Updated last year
- [ICML 2023] Pre-train world model-based agents with different unsupervised strategies, fine-tune the agent's components selectively, and …☆37Updated last year
- Challenges and Opportunities in Offline Reinforcement Learning from Visual Observations☆99Updated 9 months ago
- Accompanying Code for "Flipping Coins to Estimate Pseudocounts for Exploration in Reinforcement Learning", ICML 2023☆19Updated last year
- Synthetic Experience Replay☆87Updated 9 months ago
- Code for TRANSDREAMER: REINFORCEMENT LEARNING WITH TRANSFORMER WORLD MODELS☆25Updated last year
- Clean single-file implementation of offline RL algorithms in JAX☆134Updated 2 months ago
- Official implementation of the BRO algorithm☆38Updated last month
- Implementation of Jump-Start Reinforcement Learning (JSRL) with Stable Baselines3☆31Updated last year
- ☆79Updated 9 months ago
- ExORL: Exploratory Data for Offline Reinforcement Learning☆110Updated 3 years ago
- Recall to Imagine, a model-based RL algorithm with superhuman memory. Oral (1.2%) @ ICLR 2024☆64Updated 9 months ago
- Code for "Unsupervised Zero-Shot RL via Functional Reward Representations"☆53Updated 11 months ago
- ReDo: The Dormant Neuron Phenomenon in Deep Reinforcement Learning (pytorch)☆25Updated 4 months ago
- Public code for "Reinforcement Learning from Passive Data via Latent Intentions"☆87Updated last year
- ☆25Updated last year
- Simple single-file baselines for Q-Learning in pure-GPU setting☆141Updated 3 months ago
- Meta-RL Model-Based Algorithm☆31Updated 9 months ago
- a simple and scalable agent for training adaptive policies with sequence-based RL☆113Updated 3 weeks ago