ruoqizzz / entropy-offlineRLView external linksLinks
code for paper "Entropy-regularized Diffusion Policy with Q-Ensembles for Offline Reinforcement Learning"
☆20Feb 24, 2024Updated last year
Alternatives and similar repositories for entropy-offlineRL
Users that are interested in entropy-offlineRL are comparing it to the libraries listed below
Sorting:
- [ICML 2024] The offical implementation of A2PR, a simple way to achieve SOTA in offline reinforcement learning with an adaptive advantage…☆34May 31, 2024Updated last year
- ☆121May 30, 2023Updated 2 years ago
- Official implementation for: Consistency Models as a Rich and Efficient Policy Class for Reinforcement Learning ICLR'24☆26Aug 28, 2024Updated last year
- Official implementation of "Flow Based Policy for Online Reinforcement Learning"☆69Oct 29, 2025Updated 3 months ago
- Code for "Policy Decorator: Model-Agnostic Online Refinement for Large Policy Model"☆109Oct 24, 2025Updated 3 months ago
- A benchmark for offline goal-conditioned RL and offline RL☆333Jan 14, 2026Updated last month
- [CVPR'2024] "SkillDiffuser: Interpretable Hierarchical Planning via Skill Abstractions in Diffusion-Based Task Execution"☆88Sep 29, 2024Updated last year
- ☆19Apr 15, 2024Updated last year
- Risk-sensitive Inverse Reinforcement Learning☆11Sep 11, 2019Updated 6 years ago
- Simple MoE - Day 17 of 365 Days of Repos☆16Jan 17, 2025Updated last year
- Evaluating and reproducing real-world robot manipulation policies (e.g., RT-1, RT-1-X, Octo) in simulation under common setups (e.g., Goo…☆11Dec 30, 2024Updated last year
- The official code release for "Langevin Soft Actor-Critic: Efficient Exploration through Uncertainty-Driven Critic Learning", ICLR 2025☆13May 28, 2025Updated 8 months ago
- Wuji Hand SDK: C++ core with Python bindings, for controlling and communicating with Wuji Hand.☆17Feb 2, 2026Updated 2 weeks ago
- official implementation for our paper Cal-QL: Calibrated Offline RL Pre-Training for Efficient Online Fine-Tuning (NeurIPS 2023)☆119Jul 31, 2024Updated last year
- This is the official PyTorch implementation of the paper "Boosting Continuous Control with Consistency Policy".☆47Nov 11, 2025Updated 3 months ago
- This repo is for reproducing our results in “Lipschitz Generative Adversarial Nets”.☆11Sep 26, 2020Updated 5 years ago
- ☆13May 29, 2024Updated last year
- Code for "Goal-Conditioned Predictive Coding for Offline Reinforcement Learning" (NeurIPS 2023)☆13Dec 8, 2023Updated 2 years ago
- Chirp instantaneous frequency estimation using stochastic differential equation Gaussian processes☆13Oct 30, 2024Updated last year
- Comp 781 Project☆10Jan 2, 2026Updated last month
- ☆13Oct 12, 2023Updated 2 years ago
- Official code for ICLR 2024 paper, SEABO: A Simple Search-Based Method for Offline Imitation Learning☆12Jan 19, 2024Updated 2 years ago
- ReDiffuser: Reliable Decision-Making Using a Diffuser with Confidence Estimation☆15Jun 2, 2024Updated last year
- This repository contains a collection of resources and papers on Diffusion Models for RL, accompanying the paper "Diffusion Models for Re…☆651Nov 29, 2024Updated last year
- 基于以太坊的数字版权管理系统☆11Mar 1, 2021Updated 4 years ago
- This repo implements Video generation model using Latent Diffusion Transformers(Latte) in PyTorch and provides training and inference cod…☆16Jan 6, 2025Updated last year
- 基于fisco bcos区块链实现的nft数字藏品网站,用IPFS进行存储,每次交易均进行上链,实现交易不可篡改, 可追溯溯源等功能☆20Jan 25, 2024Updated 2 years ago
- ☆10Jun 22, 2020Updated 5 years ago
- Official code for ACT: Empowering Decision Transformer with Dynamic Programming via Advantage Conditioning (AAAI'24)☆17Feb 10, 2024Updated 2 years ago
- ☆16Jan 30, 2025Updated last year
- A list of Offline to Online RL papers (continually updated)☆69Nov 27, 2025Updated 2 months ago
- Least Squares Policy Iteration (LSPI) in Python☆11May 25, 2015Updated 10 years ago
- ☆52Jul 21, 2022Updated 3 years ago
- This project is developing a hybrid DRL-MPC model for motion planning of AVs at unsignalized intersection. The work is based on the Highw…☆18Sep 2, 2024Updated last year
- RLHF for Video Diffusion Models☆23Jul 30, 2025Updated 6 months ago
- CleanDiffuser: An Easy-to-use Modularized Library for Diffusion Models in Decision Making☆698Apr 20, 2025Updated 9 months ago
- Study the Scale Invariance or Equivariance Convolutional Neural Network☆13Feb 2, 2020Updated 6 years ago
- Actuator Degeneration Adaptation Transformer☆14Sep 19, 2023Updated 2 years ago
- PyTorch implementation of different Deep RL algorithms for the LunarLander-v2 environment in OpenAI Gym☆11May 20, 2018Updated 7 years ago