Plannable Approximations to MDP Homomorphisms: Equivariance under Actions
β30Jun 30, 2020Updated 5 years ago
Alternatives and similar repositories for PRAE
Users that are interested in PRAE are comparing it to the libraries listed below
Sorting:
- π Codebase for the ICML '20 paper "Ready Policy One: World Building Through Active Learning" (arxiv: 2002.02693)β18Jul 6, 2023Updated 2 years ago
- β31Feb 20, 2021Updated 5 years ago
- The implementation of "The Kanerva Machine" with Pytorch and Pyroβ12Jun 14, 2018Updated 7 years ago
- MuJoCo models for Unitree Robotsβ12Nov 24, 2021Updated 4 years ago
- Variational Reinforcement Learningβ17Jul 25, 2024Updated last year
- A minimal implementation of Go-Explore without domain knowledgeβ15Apr 26, 2021Updated 4 years ago
- Repository for ML Reproducibility Challenge 2020 for the Neurips paper, "The Value Equivalence Principle for Model-Based Reinforcement Leβ¦β18Apr 13, 2021Updated 4 years ago
- Open source code combining implementations of Upside Down Reinforcement Learning and Reward Conditioned Policiesβ19Mar 10, 2021Updated 4 years ago
- Non-linear policy graph improvement - planning for Dec-POMDPsβ16Mar 3, 2021Updated 5 years ago
- β32Feb 21, 2021Updated 5 years ago
- β19Jul 18, 2021Updated 4 years ago
- Clockwork VAEs in JAX/Flaxβ32Jul 16, 2021Updated 4 years ago
- Implementation of REBAR in PyTorchβ17Jul 18, 2018Updated 7 years ago
- Decentralized Reinforcment Learning: Global Decision-Making via Local Economic Transactions (ICML 2020)β43Dec 8, 2022Updated 3 years ago
- Implicit Differentiable Optimal Control (IDOC) with JAXβ12May 11, 2022Updated 3 years ago
- General framework for Bayesian inversion of continuous hierarchical modelsβ10Sep 20, 2021Updated 4 years ago
- Sequential Monte Carlo sampler for PyMC2 models.β13Apr 4, 2018Updated 7 years ago
- Quasi-Newton Algorithm for Stochastic Optimizationβ11May 20, 2022Updated 3 years ago
- Code for the paper "Gamma-Models: Generative Temporal Difference Learning for Infinite-Horizon Prediction"β46Sep 20, 2023Updated 2 years ago
- MSRSegNet: Multi-Scale Residual Network for Semantic Segmentationβ10Aug 9, 2018Updated 7 years ago
- Neural Fixed-Point Acceleration for Convex Optimizationβ29Oct 6, 2022Updated 3 years ago
- Reimplementation code for the paper "Generative Temporal Models with Spatial Memory for Partially Observed Environments"β30Jul 20, 2022Updated 3 years ago
- A set of environments utilizing pybullet for simulation of robotic manipulation tasks.β29Mar 8, 2021Updated 4 years ago
- Code for "Deep predictive coding network for object recognition"β26Apr 2, 2020Updated 5 years ago
- Repository for the paper "Long-Horizon Visual Planning with Goal-Conditioned Hierarchical Predictors"β46Nov 22, 2022Updated 3 years ago
- A PyTorch implementation of BCOβ12Jun 19, 2023Updated 2 years ago
- This repo contains most of outstanding papers on visual saliency (2013-2017).β10Dec 6, 2017Updated 8 years ago
- [ICLR 22] Value Gradient weighted Model-Based Reinforcement Learning.β25Apr 15, 2023Updated 2 years ago
- Example implementation of "Exact Byte-Level Probabilities from Tokenized Language Models for FIM-Tasks and Model Ensembles" by Buu Phan, β¦β18Jan 22, 2026Updated last month
- Sample pytorch implementation of Covariant Compositional Networksβ13Feb 17, 2018Updated 8 years ago
- β14Jun 26, 2019Updated 6 years ago
- Multi-agent active perception with prediction rewardsβ11Nov 13, 2020Updated 5 years ago
- DeepDip, a DRL Gym agent that plays no-press Diplomacy in BANDANAβ13Jul 22, 2019Updated 6 years ago
- Code repository for the CoRL 2021 paper "RoCUS: Robot Controller Understanding via Sampling"β12Mar 24, 2022Updated 3 years ago
- Code for reproducing experiments in Model-Based Active Exploration, ICML 2019β81Jul 23, 2019Updated 6 years ago
- Implementing Visual Saliency Modelsβ13Jan 10, 2018Updated 8 years ago
- A squad movement planning library for StarCraft AI using Monte Carlo Tree Search and Negamaxβ14Jan 1, 2019Updated 7 years ago
- [CoRL 2021] A robotics benchmark for cross-embodiment imitation.β60Oct 4, 2023Updated 2 years ago
- β14Oct 7, 2022Updated 3 years ago