Code for MOBILE: Model-Bellman Inconsistency Penalized Offline Policy Optimization
☆23Apr 17, 2024Updated last year
Alternatives and similar repositories for mobile
Users that are interested in mobile are comparing it to the libraries listed below
Sorting:
- Some notes and solutions to "Machine Learning" authored by Zhi-Hua Zhou☆11Jul 20, 2021Updated 4 years ago
- Faster RCNN using TensorFlow☆10Jul 31, 2022Updated 3 years ago
- ☆10Mar 11, 2024Updated last year
- ☆10Sep 19, 2023Updated 2 years ago
- Author's PyTorch implementation of ICML'23 paper "Policy Regularization with Dataset Constraint for Offline Reinforcement Learning" for D…☆18Nov 8, 2024Updated last year
- PyTorch implementations for Offline Preference-Based RL (PbRL) algorithms☆21Mar 24, 2025Updated 11 months ago
- re-implementation of the offline model-based RL algorithm MOPO in pytorch☆25Feb 28, 2022Updated 4 years ago
- [NeurIPS'20] Code for the paper "Offline Imitation Learning with a Misspecified Simulator"☆12Nov 24, 2021Updated 4 years ago
- An elegant PyTorch offline reinforcement learning library for researchers.☆383Jul 11, 2025Updated 7 months ago
- GPT implementation in Flax☆18Jan 8, 2022Updated 4 years ago
- ☆20Mar 19, 2024Updated last year
- Author's PyTorch implementation of TD7 for online and offline RL☆161Sep 12, 2023Updated 2 years ago
- Standalone library of frequently-used wrappers for dm_env environments.☆18Jul 9, 2024Updated last year
- ☆27Apr 22, 2024Updated last year
- Challenges and Opportunities in Offline Reinforcement Learning from Visual Observations☆113May 27, 2024Updated last year
- ☆12May 29, 2025Updated 9 months ago
- The Official Code for Offline Model-based Adaptable Policy Learning (NeurIPS'21 & TPAMI)☆25Jan 16, 2024Updated 2 years ago
- Minimal RLHF implementation built on top of minGPT.☆32Jul 4, 2024Updated last year
- MuJoCo models for Unitree Robots☆12Nov 24, 2021Updated 4 years ago
- A beginner-friendly repository on Deep Reinforcement Learning (RL), written in PyTorch.☆27Jan 27, 2026Updated last month
- Conservative Q learning in Jax☆57Feb 7, 2023Updated 3 years ago
- Official code for "RAMBO: Robust Adversarial Model-Based Offline RL", NeurIPS 2022☆32Jun 2, 2023Updated 2 years ago
- Official code for TLDR: Unsupervised Goal-Conditioned RL via Temporal Distance-Aware Representations☆36Jan 24, 2026Updated last month
- Code for MOPO: Model-based Offline Policy Optimization☆191May 17, 2022Updated 3 years ago
- Auto-tune the Entropy Temperature of Soft Actor-Critic via Metagradient - 7th ICML AutoML workshop 2020☆33Jul 22, 2021Updated 4 years ago
- JAX implementation of deep RL agents with resets from the paper "The Primacy Bias in Deep Reinforcement Learning"☆105May 17, 2022Updated 3 years ago
- Flow RL is a high-performance RL library with flow and diffusion models.