Learning Off-Policy with Online Planning [CoRL 2021 Best Paper Finalist]
☆42Aug 27, 2022Updated 3 years ago
Alternatives and similar repositories for LOOP
Users that are interested in LOOP are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- OpenAi's gym environment wrapper to vectorize them with Ray☆23May 25, 2023Updated 2 years ago
- On the Feasibility of Cross-Task Transfer with Model-Based Reinforcement Learning☆16Apr 30, 2023Updated 3 years ago
- Revisiting Peng's Q(lambda) for Modern Reinforcement Learning☆15Jul 23, 2021Updated 4 years ago
- ☆99Mar 24, 2023Updated 3 years ago
- Library for Model Based RL☆1,061Jul 12, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Implementation and evaluation of Almanac (Automaton/Logic Multi-Agent Natural Actor-Critic), an algorithm for multi-agent reinforcement l…☆10May 5, 2022Updated 4 years ago
- Learning Task Embeddings for Teamwork Adaptation in Multi-Agent Reinforcement Learning☆15Apr 25, 2024Updated 2 years ago
- E2C implementation in PyTorch☆43Jul 5, 2017Updated 8 years ago
- PyTorch implementation of Munchausen Reinforcement Learning based on DQN and SAC. Handles discrete and continuous action spaces☆15Oct 3, 2021Updated 4 years ago
- Implementation of Diversity Is All You Need (DIAYN) on top of Stable Baselines 3.☆13Jul 11, 2022Updated 3 years ago
- Learning Action-Value Gradients in Model-based Policy Optimization☆32Sep 7, 2021Updated 4 years ago
- Code for the NeurIPS 2021 paper "Safe Reinforcement Learning by Imagining the Near Future"☆52Apr 8, 2022Updated 4 years ago
- Contains implementation of the DoubIL and ResiduIL algorithms from the ICML '22 paper Causal Imitation Learning under Temporally Correlat…☆11Dec 9, 2022Updated 3 years ago
- Official repository for our paper on "Action Inference by Maximising Evidence: Zero-Shot Imitation from Observation with World Models"☆13Dec 4, 2023Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- 💥💥 This is a easy installable extension for OpenAi Gym Environment. This simulates SpaceX Falcon landing.☆59Jul 24, 2018Updated 7 years ago
- Public Release of Plan2vec Implementation in pyTorch☆57Oct 28, 2022Updated 3 years ago
- ☆60Feb 3, 2023Updated 3 years ago
- Implementation of the Model-Based Meta-Policy-Optimization (MB-MPO) algorithm☆44Nov 15, 2018Updated 7 years ago
- ☆72Jun 20, 2022Updated 3 years ago
- Code for "Temporal Difference Learning for Model Predictive Control"☆513Nov 25, 2023Updated 2 years ago
- ☆23Aug 19, 2022Updated 3 years ago
- Model-based reinforcement learning in TensorFlow☆56Jul 27, 2021Updated 4 years ago
- LAMBDA is a model-based reinforcement learning agent that uses Bayesian world models for safe policy optimization☆38Jan 16, 2023Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Simple maze environments using mujoco-py☆60Dec 27, 2023Updated 2 years ago
- Code for "World Model as a Graph: Learning Latent Landmarks for Planning" (ICML 2021 Long Presentation)☆69Jul 17, 2021Updated 4 years ago
- Safe Reinforcement Learning algorithms☆75Aug 27, 2022Updated 3 years ago
- JAX code for the paper "Control-Oriented Model-Based Reinforcement Learning with Implicit Differentiation"☆43Jun 14, 2021Updated 4 years ago
- DMControl Generalization Benchmark☆189Jan 3, 2024Updated 2 years ago
- DrQ-v2: Improved Data-Augmented Reinforcement Learning☆433May 31, 2022Updated 3 years ago
- CACC using a reiforcement learning approach☆11May 6, 2022Updated 3 years ago
- CIC: Contrastive Intrinsic Control for Unsupervised Skill Discovery☆85Jul 27, 2022Updated 3 years ago
- Bingham Policy Parameterization for 3D Rotations in Reinforcement Learning☆26Feb 8, 2022Updated 4 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆35Jan 4, 2023Updated 3 years ago
- Estimating Q(s,s') with Deep Deterministic Dynamics Gradients☆32Feb 21, 2020Updated 6 years ago
- The implementation of TNNLS paper "Multi-agent Continual Coordination via Progressive Task Contextualization".☆17Dec 24, 2024Updated last year
- ☆11Oct 19, 2023Updated 2 years ago
- Pytorch Implementation of MuZero☆353Jul 23, 2023Updated 2 years ago
- Code for "Dream and Search to Control: Latent Space Planning for Continuous Control"☆12Jul 12, 2021Updated 4 years ago
- ☆13Mar 16, 2023Updated 3 years ago