nirgreshler / bayesian-online-planningLinks
The code for the paper "A Bayesian Approach to Online Planning" published in ICML 2024.
☆13Updated last year
Alternatives and similar repositories for bayesian-online-planning
Users that are interested in bayesian-online-planning are comparing it to the libraries listed below
Sorting:
- Multi-Agent Verification: Scaling Test-Time Compute with Multiple Verifiers☆23Updated 8 months ago
- Code for the paper "Learning Temporal Distances: Contrastive Successor Features Can Provide a Metric Structure for Decision-Making"☆28Updated last year
- speed-running solving robot manipulation tasks☆24Updated last year
- Generalised UDRL☆37Updated 3 years ago
- GPT implementation in Flax☆18Updated 3 years ago
- Galactic Scaling End-to-End Reinforcement Learning for Rearrangement at 100k Steps-Per-Second☆86Updated 2 years ago
- Repo to reproduce the First-Explore paper results☆38Updated 10 months ago
- This repository is the official implementation of the TRAC optimizer in Fast TRAC: A Parameter-Free Optimizer for Lifelong Reinforcement …☆32Updated 6 months ago
- This code accompanies the paper "Leveraging Skills from Unlabeled Prior Data for Efficient Online Exploration."☆35Updated 4 months ago
- ☆14Updated last year
- ☆15Updated 3 years ago
- Code for the paper Task Agnostic Morphology Evolution.☆20Updated 4 years ago
- PyTorch Package For Quasimetric Learning☆44Updated last year
- Implemenation of the HIERarchical imagionation On Structured State Space Sequence Models (HIEROS) paper☆20Updated last year
- Dataset and benchmark for assessing LLMs in translating natural language descriptions of planning problems into PDDL☆60Updated last year
- PyTorch implementation for "Discovery of Incremental Skills" (DISk) algorithm from ICLR 2022 paper "One After Another: Learning Increment…☆19Updated 3 years ago
- ☆13Updated 8 months ago
- ☆64Updated last year
- JAX implementation of VQVAE/VQGAN autoencoders (+FSQ)☆39Updated last year
- Repository for "Quality-Diversity Actor-Critic: Learning High-Performing and Diverse Behaviors via Value and Successor Features Critics" …☆19Updated last year
- Code of the Paper "Time-Efficient Reinforcement Learning with Stochastic Stateful Policies"☆25Updated last year
- Cross-Domain Imitation Learning via Optimal Transport☆25Updated 3 years ago
- ☆14Updated 8 months ago
- Intepretability method to find what navigation agents learn☆20Updated 3 years ago
- A Large Recurrent Action Model: xLSTM enables Fast Inference for Robotics Tasks☆36Updated last year
- ☆27Updated last year
- Code for the Ask4Help project☆22Updated 2 years ago
- Code for Continual Learning of Control Primitives☆18Updated 5 years ago
- ☆23Updated 4 years ago
- ☆40Updated 2 months ago