yihaosun1124 / mobileView external linksLinks
Code for MOBILE: Model-Bellman Inconsistency Penalized Offline Policy Optimization
☆22Apr 17, 2024Updated last year
Alternatives and similar repositories for mobile
Users that are interested in mobile are comparing it to the libraries listed below
Sorting:
- Faster RCNN using TensorFlow☆10Jul 31, 2022Updated 3 years ago
- Some notes and solutions to "Machine Learning" authored by Zhi-Hua Zhou☆11Jul 20, 2021Updated 4 years ago
- ☆10Sep 19, 2023Updated 2 years ago
- ☆10Mar 11, 2024Updated last year
- Author's PyTorch implementation of ICML'23 paper "Policy Regularization with Dataset Constraint for Offline Reinforcement Learning" for D…☆18Nov 8, 2024Updated last year
- re-implementation of the offline model-based RL algorithm MOPO in pytorch☆25Feb 28, 2022Updated 3 years ago
- [NeurIPS'20] Code for the paper "Offline Imitation Learning with a Misspecified Simulator"☆12Nov 24, 2021Updated 4 years ago
- An elegant PyTorch offline reinforcement learning library for researchers.☆383Jul 11, 2025Updated 7 months ago
- GPT implementation in Flax☆18Jan 8, 2022Updated 4 years ago
- Author's PyTorch implementation of TD7 for online and offline RL☆161Sep 12, 2023Updated 2 years ago
- Standalone library of frequently-used wrappers for dm_env environments.☆18Jul 9, 2024Updated last year
- ☆26Apr 22, 2024Updated last year
- Challenges and Opportunities in Offline Reinforcement Learning from Visual Observations☆113May 27, 2024Updated last year
- ☆12May 29, 2025Updated 8 months ago
- The Official Code for Offline Model-based Adaptable Policy Learning (NeurIPS'21 & TPAMI)☆25Jan 16, 2024Updated 2 years ago
- Minimal RLHF implementation built on top of minGPT.☆32Jul 4, 2024Updated last year
- MuJoCo models for Unitree Robots☆12Nov 24, 2021Updated 4 years ago
- A beginner-friendly repository on Deep Reinforcement Learning (RL), written in PyTorch.☆26Jan 27, 2026Updated 2 weeks ago
- Conservative Q learning in Jax☆57Feb 7, 2023Updated 3 years ago
- Official code for "RAMBO: Robust Adversarial Model-Based Offline RL", NeurIPS 2022☆32Jun 2, 2023Updated 2 years ago
- Official code for TLDR: Unsupervised Goal-Conditioned RL via Temporal Distance-Aware Representations☆36Jan 24, 2026Updated 2 weeks ago
- Code for MOPO: Model-based Offline Policy Optimization☆191May 17, 2022Updated 3 years ago
- Flow RL is a high-performance RL library with flow and diffusion models.☆26Feb 6, 2026Updated last week
- Auto-tune the Entropy Temperature of Soft Actor-Critic via Metagradient - 7th ICML AutoML workshop 2020☆33Jul 22, 2021Updated 4 years ago
- JAX implementation of deep RL agents with resets from the paper "The Primacy Bias in Deep Reinforcement Learning"☆104May 17, 2022Updated 3 years ago
- Implementation of ICLR 2025 paper "Q-Adapter: Customizing Pre-trained LLMs to New Preferences with Forgetting Mitigation"☆18Oct 5, 2024Updated last year
- ☆15Sep 14, 2020Updated 5 years ago
- Open source code combining implementations of Upside Down Reinforcement Learning and Reward Conditioned Policies☆19Mar 10, 2021Updated 4 years ago
- Official implementation of Harnessing Mixed Offline Reinforcement Learning Datasets via Trajectory Reweighting☆17Feb 14, 2024Updated last year
- Code for NeurIPS2023 accepted paper: Counterfactual Conservative Q Learning for Offline Multi-agent Reinforcement Learning.☆40Feb 18, 2025Updated 11 months ago
- ☆18May 30, 2023Updated 2 years ago
- Implementation of Direct Preference Optimization☆17Jul 17, 2023Updated 2 years ago
- Code to accompany the paper "Mismatched No More: Joint Model-Policy Optimization for Model-Based RL"☆20Oct 6, 2021Updated 4 years ago
- Paper List for In-context Learning 🌷☆20Jan 3, 2023Updated 3 years ago
- Implementation of Denoising Diffusion Probabilistic Models (DDPM) in JAX and Flax.☆22Oct 12, 2023Updated 2 years ago
- ☆80Dec 9, 2022Updated 3 years ago
- Official Codebase for TMLR 2023, Benchmarks and Algorithms for Offline Preference-Based Reward Learning☆20Dec 30, 2022Updated 3 years ago
- Learning Robust Dynamics Through Variational Sparse Gating☆20Oct 19, 2022Updated 3 years ago
- ☆52Jan 20, 2023Updated 3 years ago