code for paper "Entropy-regularized Diffusion Policy with Q-Ensembles for Offline Reinforcement Learning"
☆21Feb 24, 2024Updated 2 years ago
Alternatives and similar repositories for entropy-offlineRL
Users that are interested in entropy-offlineRL are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Codebase for Extracting Reward Functions from Diffusion Models☆16Dec 7, 2023Updated 2 years ago
- ☆13May 29, 2024Updated 2 years ago
- [ICML 2024] The offical implementation of A2PR, a simple way to achieve SOTA in offline reinforcement learning with an adaptive advantage…☆34May 31, 2024Updated 2 years ago
- Official implementation of "Flow Based Policy for Online Reinforcement Learning"☆91Oct 29, 2025Updated 8 months ago
- NeurIPS 2024 DACER☆179Feb 28, 2026Updated 4 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆127May 30, 2023Updated 3 years ago
- ☆24May 20, 2025Updated last year
- ☆20Jan 30, 2025Updated last year
- ☆22May 27, 2024Updated 2 years ago
- A benchmark for offline goal-conditioned RL and offline RL☆427Jan 14, 2026Updated 5 months ago
- Code for "Goal-Conditioned Predictive Coding for Offline Reinforcement Learning" (NeurIPS 2023)☆14Dec 8, 2023Updated 2 years ago
- A list of Offline to Online RL papers (continually updated)☆99Apr 25, 2026Updated 2 months ago
- Next generation MCMC samplers with automatic differentiaion and adaptive Poisson thinning☆13Updated this week
- Official code for ACT: Empowering Decision Transformer with Dynamic Programming via Advantage Conditioning (AAAI'24)☆17Feb 10, 2024Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Evaluating and reproducing real-world robot manipulation policies (e.g., RT-1, RT-1-X, Octo) in simulation under common setups (e.g., Goo…☆11Dec 30, 2024Updated last year
- 基于以太坊的数字版权管理系统☆11Mar 1, 2021Updated 5 years ago
- [CVPR'2024] "SkillDiffuser: Interpretable Hierarchical Planning via Skill Abstractions in Diffusion-Based Task Execution"☆92Sep 29, 2024Updated last year
- Code for "Policy Decorator: Model-Agnostic Online Refinement for Large Policy Model"☆117Oct 24, 2025Updated 8 months ago
- 基于fisco bcos区块链实现的nft数字藏品网站,用IPFS进行存储,每次交易均进行上链,实现交易不可篡改,可追溯溯源等功能☆20Jan 25, 2024Updated 2 years ago
- Official code for ICLR 2024 paper, SEABO: A Simple Search-Based Method for Offline Imitation Learning☆12Jan 19, 2024Updated 2 years ago
- Gradient-informed particle MCMC methods☆12Jan 29, 2024Updated 2 years ago
- RLHF for Video Diffusion Models☆26Jul 30, 2025Updated 10 months ago
- ☆15Apr 17, 2020Updated 6 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- This repository contains a collection of resources and papers on Diffusion Models for RL, accompanying the paper "Diffusion Models for Re…☆667Nov 29, 2024Updated last year
- The official code release for "Langevin Soft Actor-Critic: Efficient Exploration through Uncertainty-Driven Critic Learning", ICLR 2025☆22May 28, 2025Updated last year
- Simple MoE - Day 17 of 365 Days of Repos☆20Jun 2, 2026Updated 3 weeks ago
- Official Codebase for "Aligning Diffusion Behaviors with Q-functions for Efficient Continuous Control" (NeurIPS 2024)☆15Oct 29, 2024Updated last year
- This project is developing a hybrid DRL-MPC model for motion planning of AVs at unsignalized intersection. The work is based on the Highw…☆20Mar 8, 2026Updated 3 months ago
- Chirp instantaneous frequency estimation using stochastic differential equation Gaussian processes☆13Oct 30, 2024Updated last year
- official implementation of QVPO☆65Jan 23, 2026Updated 5 months ago
- ☆64Nov 15, 2024Updated last year
- Tempo: Small Vision-Language Models are Smart Compressors for Long Video Understanding☆72Apr 29, 2026Updated 2 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- CleanDiffuser: An Easy-to-use Modularized Library for Diffusion Models in Decision Making☆719Apr 20, 2025Updated last year
- Taylor moment expansion in Python (JaX and SymPy) and Matlab☆11Nov 26, 2024Updated last year
- Official repo for Offline RL for Online RL☆18Oct 14, 2023Updated 2 years ago
- PyTorch implementation of different Deep RL algorithms for the LunarLander-v2 environment in OpenAI Gym☆11May 20, 2018Updated 8 years ago
- Using diffusion model to reach controllable end-to-end driving with Carla simulation environment.☆29Mar 18, 2025Updated last year
- Official codebase for GTA: Generative Trajectory Augmentation with Guidance for Offline Reinforcement Learning.☆32Nov 12, 2024Updated last year
- Comp 781 Project☆10Jan 2, 2026Updated 5 months ago