SonyResearch / simbaLinks
☆92Updated 3 months ago
Alternatives and similar repositories for simba
Users that are interested in simba are comparing it to the libraries listed below
Sorting:
- Code for "SimbaV2: Hyperspherical Normalization for Scalable Deep Reinforcement Learning"☆46Updated last month
- METRA: Scalable Unsupervised RL with Metric-Aware Abstraction (ICLR 2024)☆69Updated last year
- A benchmark for offline goal-conditioned RL and offline RL☆174Updated 2 months ago
- Foundation Policies with Hilbert Representations (ICML 2024)☆88Updated last year
- Goal-Conditioned Reinforcement Learning with JAX☆162Updated 3 weeks ago
- HIQL: Offline Goal-Conditioned RL with Latent States as Actions (NeurIPS 2023)☆84Updated 6 months ago
- Challenges and Opportunities in Offline Reinforcement Learning from Visual Observations☆101Updated last year
- Unified Implementations of Offline Reinforcement Learning Algorithms☆78Updated last month
- official implementation for our paper Cal-QL: Calibrated Offline RL Pre-Training for Efficient Online Fine-Tuning☆95Updated 10 months ago
- Skeleton for scalable and flexible Jax RL implementations☆81Updated last year
- Official code release for "CrossQ: Batch Normalization in Deep Reinforcement Learning for Greater Sample Efficiency and Simplicity"☆74Updated last year
- Transformer-based World Models☆82Updated 2 years ago
- Official implementation of the BRO algorithm☆44Updated 4 months ago
- MR.Q is a general-purpose model-free reinforcement learning algorithm.☆100Updated last month
- Official codebase for "Privileged Sensing Scaffolds Reinforcement Learning", contains the Scaffolder algorithm and Sensory Scaffolding Su…☆28Updated last year
- a simple and scalable agent for training adaptive policies with sequence-based RL☆124Updated last week
- Source files to replicate experiments in my ICLR 2022 paper.☆71Updated 11 months ago
- Accompanying Code for "Flipping Coins to Estimate Pseudocounts for Exploration in Reinforcement Learning", ICML 2023☆19Updated last year
- [NeurIPS 2024] Maximum Entropy Reinforcement Learning via Energy-Based Normalizing Flow☆34Updated 7 months ago
- ☆47Updated 6 months ago
- Code for "Unsupervised Zero-Shot RL via Functional Reward Representations"☆57Updated last year
- Code for TRANSDREAMER: REINFORCEMENT LEARNING WITH TRANSFORMER WORLD MODELS☆24Updated last year
- JAX implementation of WSRL and RL baselines | ICLR 2025☆43Updated last month
- PWM: Policy Learning with Large World Models☆49Updated 3 months ago
- [ICML 2023] Pre-train world model-based agents with different unsupervised strategies, fine-tune the agent's components selectively, and …☆39Updated last year
- Deep Hierarchical Planning from Pixels☆102Updated 2 years ago
- Jax/Flax Implementation of TD-MPC2☆61Updated last week
- DrM, a visual RL algorithm, minimizes the dormant ratio to guide exploration-exploitation trade-offs, achieving significant improvements …☆74Updated last year
- ExORL: Exploratory Data for Offline Reinforcement Learning☆115Updated 3 years ago
- Synthetic Experience Replay☆92Updated last year