Codebase of Deployment-Efficient Reinforcement Learning via Model-Based Offline Optimization (ICLR2021)
☆54Jul 7, 2021Updated 4 years ago
Alternatives and similar repositories for BREMEN
Users that are interested in BREMEN are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆10May 24, 2021Updated 4 years ago
- Code for MOPO: Model-based Offline Policy Optimization☆190May 17, 2022Updated 3 years ago
- ☆14May 31, 2022Updated 3 years ago
- ☆26Mar 16, 2023Updated 3 years ago
- ☆203Mar 25, 2023Updated 3 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Code for Diagnosing Bottlenecks in Deep Q-learning. Contains implementations of tabular environments plus solvers.☆18May 14, 2019Updated 6 years ago
- Pixyz Tutorial in RL Architecture Study Group☆11Apr 25, 2019Updated 6 years ago
- ☆88Jul 30, 2024Updated last year
- Official code for "Task-Embedded Control Networks for Few-Shot Imitation Learning".☆46Nov 29, 2019Updated 6 years ago
- [ICML 2022] The official implementation of DWBC in "Discriminator-Weighted Offline Imitation Learning from Suboptimal Demonstrations"☆35Jan 5, 2023Updated 3 years ago
- Official repository for paper "Versatile Offline Imitation from Observations and Examples via Regularized State-Occupancy Matching" (ICML…☆29Jan 12, 2023Updated 3 years ago
- Stochastic Latent Actor-Critic: Deep Reinforcement Learning with a Latent Variable Model☆154Oct 26, 2020Updated 5 years ago
- 🔍 Codebase for the ICML '20 paper "Ready Policy One: World Building Through Active Learning" (arxiv: 2002.02693)☆18Jul 6, 2023Updated 2 years ago
- ☆54Dec 19, 2018Updated 7 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- The official implementation of "Mind the Gap: Offline Policy Optimization for Imperfect Rewards" (ICLR2023)☆16Mar 3, 2023Updated 3 years ago
- PyTorch Implementation of TecNets (Task-Embedded Control Networks)☆10Dec 8, 2022Updated 3 years ago
- Code for FOCAL Paper Published at ICLR 2021☆55Dec 4, 2023Updated 2 years ago
- Co-Adaptation of Algorithmic and Implementational Innovations in Inference-based Deep Reinforcement Learning (NeurIPS2021)☆20Oct 25, 2021Updated 4 years ago
- Code for paper "Bridging Imagination and Reality for Model-Based Deep Reinforcement Learning".☆14May 23, 2021Updated 4 years ago
- Policy Transfer across Visual and Dynamics Domain Gaps via Iterative Grounding (RSS 2021)☆12Oct 22, 2021Updated 4 years ago
- Code for Expert Supervised Reinforcement Learning☆10Apr 7, 2021Updated 4 years ago
- Author's PyTorch implementation of paper "Provably Good Batch Reinforcement Learning Without Great Exploration"☆11Oct 22, 2020Updated 5 years ago
- Ranking Policy Gradient☆23Nov 27, 2019Updated 6 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Author's PyTorch implementation of TD3+BC, a simple variant of TD3 for offline RL☆400Dec 18, 2021Updated 4 years ago
- Code Release for Task Agnostic Dynamics Priors for Deep Reinforcement Learning☆12Jun 13, 2019Updated 6 years ago
- The code for the paper *The Sensitivity of Counterfactual Fairness to Unmeasured Confounding* @ UAI 2019☆14Apr 4, 2020Updated 5 years ago
- ☆17Nov 16, 2020Updated 5 years ago
- ☆80Dec 9, 2022Updated 3 years ago
- Code for conservative Q-learning☆476Dec 7, 2021Updated 4 years ago
- Symbol Emergence in Robotics tool KIT☆21Nov 15, 2023Updated 2 years ago
- Algorithmic Framework for Model-based Deep Reinforcement Learning with Theoretical Guarantees☆93Sep 13, 2019Updated 6 years ago
- ☆15Oct 20, 2020Updated 5 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Author's PyTorch implementation of BCQ for continuous and discrete actions☆660Apr 6, 2021Updated 4 years ago
- Environments to support https://github.com/sholtodouglas/learning_from_play and reinforcement learning for robotic manipulation.☆21Mar 28, 2021Updated 4 years ago
- ☆112Aug 6, 2024Updated last year
- [ICLR 2020, Oral] Harnessing Structures for Value-Based Planning and Reinforcement Learning☆34Feb 1, 2020Updated 6 years ago
- Exploring the Dyna-Q reinforcement learning algorithm☆17Feb 27, 2018Updated 8 years ago
- Facebear's minimal implementation of SBAC (Soft behavior regularized actor critic, NIPS22 offline RL workshop)☆11Jul 4, 2022Updated 3 years ago
- ☆22Oct 4, 2021Updated 4 years ago