code for paper "Entropy-regularized Diffusion Policy with Q-Ensembles for Offline Reinforcement Learning"
☆20Feb 24, 2024Updated 2 years ago
Alternatives and similar repositories for entropy-offlineRL
Users that are interested in entropy-offlineRL are comparing it to the libraries listed below
Sorting:
- Lecture "A computational introduction to stochastic differential equations".☆34Jun 30, 2025Updated 8 months ago
- [ICML 2024] The offical implementation of A2PR, a simple way to achieve SOTA in offline reinforcement learning with an adaptive advantage…☆34May 31, 2024Updated last year
- ☆122May 30, 2023Updated 2 years ago
- Code for "Policy Decorator: Model-Agnostic Online Refinement for Large Policy Model"☆109Oct 24, 2025Updated 4 months ago
- Official implementation of "Flow Based Policy for Online Reinforcement Learning"☆75Oct 29, 2025Updated 4 months ago
- A benchmark for offline goal-conditioned RL and offline RL☆339Jan 14, 2026Updated last month
- [CVPR'2024] "SkillDiffuser: Interpretable Hierarchical Planning via Skill Abstractions in Diffusion-Based Task Execution"☆88Sep 29, 2024Updated last year
- Evaluating and reproducing real-world robot manipulation policies (e.g., RT-1, RT-1-X, Octo) in simulation under common setups (e.g., Goo…☆11Dec 30, 2024Updated last year
- Wuji Hand SDK: C++ core with Python bindings, for controlling and communicating with Wuji Hand.☆17Updated this week
- Official implementation of "Learning Proposals for Practical Energy-Based Regression", AISTATS 2022.☆13Feb 4, 2023Updated 3 years ago
- official implementation for our paper Cal-QL: Calibrated Offline RL Pre-Training for Efficient Online Fine-Tuning (NeurIPS 2023)☆120Jul 31, 2024Updated last year
- This is the official PyTorch implementation of the paper "Boosting Continuous Control with Consistency Policy".☆47Nov 11, 2025Updated 3 months ago
- Chirp instantaneous frequency estimation using stochastic differential equation Gaussian processes☆13Oct 30, 2024Updated last year
- This repo is for reproducing our results in “Lipschitz Generative Adversarial Nets”.☆11Sep 26, 2020Updated 5 years ago
- Simple MoE - Day 17 of 365 Days of Repos☆17Jan 17, 2025Updated last year
- ☆13May 29, 2024Updated last year
- Next generation MCMC samplers with automatic differentiaion and adaptive Poisson thinning☆13Mar 2, 2026Updated last week
- Comp 781 Project☆10Jan 2, 2026Updated 2 months ago
- ☆13Oct 12, 2023Updated 2 years ago
- Almost Surely Stable Deep Dynamics [NeurIPS 2020]☆13Dec 8, 2022Updated 3 years ago
- Official code for ICLR 2024 paper, SEABO: A Simple Search-Based Method for Offline Imitation Learning☆12Jan 19, 2024Updated 2 years ago
- ReDiffuser: Reliable Decision-Making Using a Diffuser with Confidence Estimation☆15Jun 2, 2024Updated last year
- ☆21Apr 15, 2024Updated last year
- This is the repository for the animaquina addon, for now this supports only Xarm. Install and use at your own risk.☆12Jan 17, 2025Updated last year
- official implementation of QVPO☆61Jan 23, 2026Updated last month
- RLHF for Video Diffusion Models☆26Jul 30, 2025Updated 7 months ago
- This repo implements Video generation model using Latent Diffusion Transformers(Latte) in PyTorch and provides training and inference cod…☆17Jan 6, 2025Updated last year
- The official code release for "Langevin Soft Actor-Critic: Efficient Exploration through Uncertainty-Driven Critic Learning", ICLR 2025☆18May 28, 2025Updated 9 months ago
- This project is developing a hybrid DRL-MPC model for motion planning of AVs at unsignalized intersection. The work is based on the Highw…☆20Updated this week
- 基于fisco bcos区块链实现的nft数字藏品网站,用IPFS进行存储,每次交易均进行上链,实现交易不可篡改,可追溯溯源等功能☆20Jan 25, 2024Updated 2 years ago
- Code for "Goal-Conditioned Predictive Coding for Offline Reinforcement Learning" (NeurIPS 2023)☆14Dec 8, 2023Updated 2 years ago
- ☆16Jan 30, 2025Updated last year
- Gradient-informed particle MCMC methods☆12Jan 29, 2024Updated 2 years ago
- ☆10Jun 22, 2020Updated 5 years ago
- Least Squares Policy Iteration (LSPI) in Python☆11May 25, 2015Updated 10 years ago
- ☆52Jul 21, 2022Updated 3 years ago
- Official code for ACT: Empowering Decision Transformer with Dynamic Programming via Advantage Conditioning (AAAI'24)☆17Feb 10, 2024Updated 2 years ago
- 基于以太坊的数字版权管理系统☆11Mar 1, 2021Updated 5 years ago
- CleanDiffuser: An Easy-to-use Modularized Library for Diffusion Models in Decision Making☆703Apr 20, 2025Updated 10 months ago