(NeurIPS 2023) Residual Q-Learning: Offline and Online Policy Customization without Value
☆35Mar 29, 2024Updated 2 years ago
Alternatives and similar repositories for RQL-release
Users that are interested in RQL-release are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Implementation of ICLR 2025 paper "Q-Adapter: Customizing Pre-trained LLMs to New Preferences with Forgetting Mitigation"☆18Oct 5, 2024Updated last year
- [ICML 2025] Official Github Repo for WOMD-Reasoning Dataset☆43Nov 27, 2025Updated 4 months ago
- Train, visualize, and evaluate RL policies for the Terra environment.☆18Feb 10, 2026Updated 2 months ago
- ☆32Oct 31, 2025Updated 5 months ago
- The official implementation of Residual-MPPI☆15Mar 22, 2025Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Implementation of PatchAIL in the ICLR 2023 paper <Visual Imitation with Patch Rewards>☆14Feb 15, 2023Updated 3 years ago
- ☆29Oct 3, 2023Updated 2 years ago
- Codes accompanying the paper "Score Regularized Policy Optimization through Diffusion Behavior" (ICLR 2024).☆48Feb 10, 2024Updated 2 years ago
- Official code for ACT: Empowering Decision Transformer with Dynamic Programming via Advantage Conditioning (AAAI'24)☆17Feb 10, 2024Updated 2 years ago
- Generative Modeling via Drifting in MLX☆42Feb 6, 2026Updated 2 months ago
- ☆18Apr 15, 2021Updated 4 years ago
- [CVPR 2025 highlight] Generating 6DoF Object Manipulation Trajectories from Action Description in Egocentric Vision☆40Dec 2, 2025Updated 4 months ago
- ☆19Apr 22, 2024Updated last year
- ☆50Aug 27, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- ☆21Mar 19, 2024Updated 2 years ago
- Official implementation of PreTraM☆26Aug 13, 2022Updated 3 years ago
- Challenging Memory-based Deep Reinforcement Learning Agents☆112Oct 27, 2024Updated last year
- Implementation of Latent Diffusion Planning (Amber Xie, Oleh Rybkin, Dorsa Sadigh, Chelsea Finn)☆64Jun 29, 2025Updated 9 months ago
- ☆11Sep 29, 2021Updated 4 years ago
- [ICLR 2023] The official code for paper "Guarded Policy Optimization with Imperfect Online Demonstrations"☆14Apr 30, 2023Updated 2 years ago
- JAX reimplementation of the DeepMind paper "Genie: Generative Interactive Environments"☆103Jan 23, 2025Updated last year
- Official code repository for the ICLR 2022 paper "You Mostly Walk Alone: Analyzing Feature Attribution in Trajectory Prediction".☆14Jul 25, 2024Updated last year
- ☆12Mar 18, 2024Updated 2 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- [ICLR 2023 Oral] The official implementation of SQL and EQL in "Offline RL with No OOD Actions: In-Sample Learning via Implicit Value Reg…☆46Jul 27, 2023Updated 2 years ago
- A unified Python simulation and hardware communication environment for Franka FR3 robots.☆21Aug 15, 2024Updated last year
- A custom open ai gym environment for solo experimentation.☆12Apr 14, 2021Updated 4 years ago
- Code for Stable Control Representations☆26Apr 5, 2025Updated last year
- Extreme Q-Learning: Max Entropy RL without Entropy☆87Feb 14, 2023Updated 3 years ago
- ☆25Aug 19, 2024Updated last year
- jw converter: 将某校教务平台的课表转换为 ICS 文件☆15Mar 17, 2026Updated 3 weeks ago
- official implementation of ODICE☆19Jan 31, 2024Updated 2 years ago
- [AAMAS'26] xTED: Cross-Domain Adaptation via Diffusion-Based Trajectory Editing☆25Jan 8, 2026Updated 3 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- ☆17Dec 30, 2024Updated last year
- Single-file SAC-N implementation on jax with flax and equinox. 10x faster than pytorch☆57May 21, 2023Updated 2 years ago
- Code for the ICLR 2024 spotlight paper: "Learning to Act without Actions" (introducing Latent Action Policies)☆138Jul 31, 2024Updated last year
- ☆14Mar 5, 2024Updated 2 years ago
- Generalizable Imitation Learning from Observation via Inferring Goal Proximity (NeurIPS 2021)☆24Nov 16, 2021Updated 4 years ago
- Handeye calibration for FR3 & Realsense with Ros2. Using Ros2 Humble, easy_handeye2, ros2_aruco.☆20Jun 4, 2025Updated 10 months ago
- [ICLR 2024 Spotlight] Code for ICLR 2024 paper "Towards Robust Offline Reinforcement Learning under Diverse Data Corruption"☆20Nov 25, 2024Updated last year