polixir / morecLinks
☆10Updated last year
Alternatives and similar repositories for morec
Users that are interested in morec are comparing it to the libraries listed below
Sorting:
- ☆47Updated 6 months ago
- Code for MOBILE: Model-Bellman Inconsistency Penalized Offline Policy Optimization☆18Updated last year
- ☆15Updated last year
- ☆33Updated 2 years ago
- ☆23Updated last year
- ☆28Updated last year
- Official implementation for: Consistency Models as a Rich and Efficient Policy Class for Reinforcement Learning ICLR'24☆25Updated 9 months ago
- Challenges and Opportunities in Offline Reinforcement Learning from Visual Observations☆101Updated last year
- Author's PyTorch implementation of ICML'23 paper "Policy Regularization with Dataset Constraint for Offline Reinforcement Learning" for D…☆18Updated 6 months ago
- [ICLR 2023 Oral] The official implementation of SQL and EQL in "Offline RL with No OOD Actions: In-Sample Learning via Implicit Value Reg…☆44Updated last year
- ☆15Updated last year
- HIQL: Offline Goal-Conditioned RL with Latent States as Actions (NeurIPS 2023)☆84Updated 6 months ago
- ☆7Updated 2 years ago
- [ICLR 2024] Closing the Gap between TD Learning and Supervised Learning - A Generalisation Point of View.☆23Updated last year
- ☆17Updated last year
- The official implementation of "Mind the Gap: Offline Policy Optimization for Imperfect Rewards" (ICLR2023)☆16Updated 2 years ago
- ☆48Updated last year
- ☆18Updated last year
- official implementation of ODICE☆18Updated last year
- Domain-Robust Visual Imitation Learning with Mutual Information Constraints code☆18Updated 4 years ago
- ☆61Updated 6 months ago
- [NeurIPS 2023] The official implementation of "Offline Multi-Agent Reinforcement Learning with Implicit Global-to-Local Value Regularizat…☆36Updated last year
- Code base for paper: Reparameterized Policy Learning for Multimodal Trajectory Optimization☆25Updated last year
- ☆18Updated 2 years ago
- Conservative Q learning in Jax☆54Updated 2 years ago
- Code release for "HarmonyDream: Task Harmonization Inside World Models" (ICML 2024), https://arxiv.org/abs/2310.00344☆39Updated 11 months ago
- ☆24Updated last year
- ☆56Updated 2 years ago
- DrM, a visual RL algorithm, minimizes the dormant ratio to guide exploration-exploitation trade-offs, achieving significant improvements …☆74Updated last year
- DAC: Diffusion Actor-Critic: Formulating Constrained Policy Iteration as Diffusion Noise Regression for Offline Reinforcement Learning.☆19Updated last year