(NeurIPS 2023) Residual Q-Learning: Offline and Online Policy Customization without Value
☆35Mar 29, 2024Updated 2 years ago
Alternatives and similar repositories for RQL-release
Users that are interested in RQL-release are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Implementation of ICLR 2025 paper "Q-Adapter: Customizing Pre-trained LLMs to New Preferences with Forgetting Mitigation"☆18Oct 5, 2024Updated last year
- Author's PyTorch implementation of ICML'23 paper "Policy Regularization with Dataset Constraint for Offline Reinforcement Learning" for D…☆18Nov 8, 2024Updated last year
- [ICML 2025] Official Github Repo for WOMD-Reasoning Dataset☆44Nov 27, 2025Updated 5 months ago
- Format your bibtex (.bib) file to help standardize citations for conference and journal submissions☆14Nov 23, 2025Updated 5 months ago
- Train, visualize, and evaluate RL policies for the Terra environment.☆19Apr 23, 2026Updated last week
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆32Oct 31, 2025Updated 6 months ago
- The official implementation of Residual-MPPI☆16Mar 22, 2025Updated last year
- Implementation of PatchAIL in the ICLR 2023 paper <Visual Imitation with Patch Rewards>☆14Feb 15, 2023Updated 3 years ago
- Codes accompanying the paper "Score Regularized Policy Optimization through Diffusion Behavior" (ICLR 2024).☆48Feb 10, 2024Updated 2 years ago
- Official code for ACT: Empowering Decision Transformer with Dynamic Programming via Advantage Conditioning (AAAI'24)☆17Feb 10, 2024Updated 2 years ago
- Generative Modeling via Drifting in MLX☆42Feb 6, 2026Updated 2 months ago
- ☆18Apr 15, 2021Updated 5 years ago
- [CVPR 2025 highlight] Generating 6DoF Object Manipulation Trajectories from Action Description in Egocentric Vision☆40Dec 2, 2025Updated 5 months ago
- Mathematics, Algorithmic, Data-Science, Teaching Materials☆13Jan 18, 2026Updated 3 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆19Apr 22, 2024Updated 2 years ago
- ☆50Aug 27, 2024Updated last year
- ☆21Mar 19, 2024Updated 2 years ago
- Official implementation of PreTraM☆26Aug 13, 2022Updated 3 years ago
- PyTorch implementation of DreamerV3, Mastering Diverse Domains through World Models.☆11Feb 16, 2024Updated 2 years ago
- Challenging Memory-based Deep Reinforcement Learning Agents☆113Oct 27, 2024Updated last year
- Implementation of Latent Diffusion Planning (Amber Xie, Oleh Rybkin, Dorsa Sadigh, Chelsea Finn)☆65Jun 29, 2025Updated 10 months ago
- ☆11Sep 29, 2021Updated 4 years ago
- [ICLR 2023] The official code for paper "Guarded Policy Optimization with Imperfect Online Demonstrations"☆14Apr 30, 2023Updated 3 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- JAX reimplementation of the DeepMind paper "Genie: Generative Interactive Environments"☆105Jan 23, 2025Updated last year
- Official code repository for the ICLR 2022 paper "You Mostly Walk Alone: Analyzing Feature Attribution in Trajectory Prediction".☆14Jul 25, 2024Updated last year
- ☆12Mar 18, 2024Updated 2 years ago
- [ICLR 2023 Oral] The official implementation of SQL and EQL in "Offline RL with No OOD Actions: In-Sample Learning via Implicit Value Reg…☆46Jul 27, 2023Updated 2 years ago
- A unified Python simulation and hardware communication environment for Franka FR3 robots.☆22Aug 15, 2024Updated last year
- A custom open ai gym environment for solo experimentation.☆12Apr 14, 2021Updated 5 years ago
- Extreme Q-Learning: Max Entropy RL without Entropy☆87Feb 14, 2023Updated 3 years ago
- jw converter: 将某校教务平台的课表转换为 ICS 文件☆15Mar 17, 2026Updated last month
- [AAMAS'26] xTED: Cross-Domain Adaptation via Diffusion-Based Trajectory Editing☆26Jan 8, 2026Updated 3 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆17Dec 30, 2024Updated last year
- Single-file SAC-N implementation on jax with flax and equinox. 10x faster than pytorch☆57May 21, 2023Updated 2 years ago
- Code for the ICLR 2024 spotlight paper: "Learning to Act without Actions" (introducing Latent Action Policies)☆140Jul 31, 2024Updated last year
- ☆14Mar 5, 2024Updated 2 years ago
- Generalizable Imitation Learning from Observation via Inferring Goal Proximity (NeurIPS 2021)☆24Nov 16, 2021Updated 4 years ago
- Handeye calibration for FR3 & Realsense with Ros2. Using Ros2 Humble, easy_handeye2, ros2_aruco.☆21Jun 4, 2025Updated 10 months ago
- [ICLR 2024 Spotlight] Code for ICLR 2024 paper "Towards Robust Offline Reinforcement Learning under Diverse Data Corruption"☆22Nov 25, 2024Updated last year