ruoqizzz/entropy-offlineRL

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/ruoqizzz/entropy-offlineRL)

ruoqizzz / entropy-offlineRL

code for paper "Entropy-regularized Diffusion Policy with Q-Ensembles for Offline Reinforcement Learning"

☆21

Alternatives and similar repositories for entropy-offlineRL

Users that are interested in entropy-offlineRL are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

FelipeNuti / diffusion-relative-rewards
View on GitHub
Codebase for Extracting Reward Functions from Diffusion Models
☆16Dec 7, 2023Updated 2 years ago
spdes / computational-sde-intro-lecture
View on GitHub
Lecture "A computational introduction to stochastic differential equations".
☆35Mar 13, 2026Updated 4 months ago
ltlhuuu / A2PR
View on GitHub
[ICML 2024] The offical implementation of A2PR, a simple way to achieve SOTA in offline reinforcement learning with an adaptive advantage…
☆34May 31, 2024Updated 2 years ago
Roythuly / OMPO
View on GitHub
☆13May 29, 2024Updated 2 years ago
BellmanTimeHut / DIPO
View on GitHub
☆129May 30, 2023Updated 3 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
quantumiracle / Consistency_Model_For_Reinforcement_Learning
View on GitHub
Official implementation for: Consistency Models as a Rich and Efficient Policy Class for Reinforcement Learning ICLR'24
☆27Aug 28, 2024Updated last year
brown-palm / GCPC
View on GitHub
Code for "Goal-Conditioned Predictive Coding for Offline Reinforcement Learning" (NeurIPS 2023)
☆14Dec 8, 2023Updated 2 years ago
happy-yan / DACER-Diffusion-with-Online-RL
View on GitHub
NeurIPS 2024 DACER
☆182Feb 28, 2026Updated 4 months ago
Roythuly / OBAC
View on GitHub
☆22May 27, 2024Updated 2 years ago
twitter / diffusion-rl
View on GitHub
☆80Dec 9, 2022Updated 3 years ago
162348 / PDMPFlux.jl
View on GitHub
Next generation MCMC samplers with automatic differentiaion and adaptive Poisson thinning
☆13Jul 14, 2026Updated last week
linhlpv / awesome-offline-to-online-RL-papers
View on GitHub
A list of Offline to Online RL papers (continually updated)
☆102Apr 25, 2026Updated 2 months ago
allenzren / SimplerEnv
View on GitHub
Evaluating and reproducing real-world robot manipulation policies (e.g., RT-1, RT-1-X, Octo) in simulation under common setups (e.g., Goo…
☆11Dec 30, 2024Updated last year
nakamotoo / Cal-QL
View on GitHub
official implementation for our paper Cal-QL: Calibrated Offline RL Pre-Training for Efficient Online Fine-Tuning (NeurIPS 2023)
☆123Jul 31, 2024Updated last year
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
AdrienCorenflos / particle_mala
View on GitHub
Gradient-informed particle MCMC methods
☆12Jan 29, 2024Updated 2 years ago
BrunoFANG1 / openpi_subtask_generation
View on GitHub
☆26Oct 11, 2025Updated 9 months ago
kevin031060 / Genetic_Local_Search_TSP
View on GitHub
☆15Apr 17, 2020Updated 6 years ago
NPLawrence / stochastic_dynamics
View on GitHub
Almost Surely Stable Deep Dynamics [NeurIPS 2020]
☆12Dec 8, 2022Updated 3 years ago
spdes / chirpgp
View on GitHub
Chirp instantaneous frequency estimation using stochastic differential equation Gaussian processes
☆13Oct 30, 2024Updated last year
apexrl / Diff4RLSurvey
View on GitHub
This repository contains a collection of resources and papers on Diffusion Models for RL, accompanying the paper "Diffusion Models for Re…
☆669Nov 29, 2024Updated last year
dmksjfl / SEABO
View on GitHub
Official code for ICLR 2024 paper, SEABO: A Simple Search-Based Method for Offline Imitation Learning
☆12Jan 19, 2024Updated 2 years ago
Wei-Nijuan / DecisionSpikeFormer
View on GitHub
[CVPR 2025] Decision SpikeFormer: Spike-Driven Transformer for Decision Making
☆19Aug 8, 2025Updated 11 months ago
qiuliyun / go-copyright-p1
View on GitHub
基于以太坊的数字版权管理系统
☆11Mar 1, 2021Updated 5 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
bytedance / FlowRL
View on GitHub
Official implementation of "Flow Based Policy for Online Reinforcement Learning"
☆92Oct 29, 2025Updated 8 months ago
sujoyp / subgoal-discovery
View on GitHub
Learning from Trajectories via Subgoal Discovery
☆12Dec 10, 2020Updated 5 years ago
MaxSobolMark / OOO
View on GitHub
Official repo for Offline RL for Online RL
☆18Oct 14, 2023Updated 2 years ago
cccedric / cpql
View on GitHub
This is the official PyTorch implementation of the paper "Boosting Continuous Control with Consistency Policy".
☆48Nov 11, 2025Updated 8 months ago
CleanDiffuserTeam / CleanDiffuser
View on GitHub
CleanDiffuser: An Easy-to-use Modularized Library for Diffusion Models in Decision Making
☆721Apr 20, 2025Updated last year
tinnerhrhe / MTDiff
View on GitHub
☆64Nov 15, 2024Updated last year
molumitu / BOOM_MBRL
View on GitHub
[NeurIPS 2025] BOOM, A Planning-driven Model-Based RL algorithm
☆62Apr 23, 2026Updated 2 months ago
tejaskhot / pytorch-LunarLander
View on GitHub
PyTorch implementation of different Deep RL algorithms for the LunarLander-v2 environment in OpenAI Gym
☆11May 20, 2018Updated 8 years ago
ArmaanSethi / Hindsight-Experience-Replay-and-Hierarchical-Reinforcement-Learning
View on GitHub
Comp 781 Project
☆10Jan 2, 2026Updated 6 months ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
katefvision / katefvision.github.io
View on GitHub
☆13Jan 9, 2018Updated 8 years ago
LiZhYun / RL-Plotter-with-Wandb
View on GitHub
A plotter for reinforcement learning (RL) using Weights & Biases
☆14Dec 20, 2023Updated 2 years ago
banma12956 / HIPI-RL
View on GitHub
☆10Jun 22, 2020Updated 6 years ago
he-nantian / ReDiffuser
View on GitHub
ReDiffuser: Reliable Decision-Making Using a Diffuser with Confidence Estimation
☆15Jun 2, 2024Updated 2 years ago
Zhihaibi / Optimal_control_CMU16-745
View on GitHub
☆16May 9, 2024Updated 2 years ago
imweil / Job-Shop-Scheduling
View on GitHub
a two stage hierarchical RL method framework of variant of the job dhop dcheduling problem
☆24Dec 24, 2025Updated 6 months ago
apexrl / GCRL-Collection
View on GitHub
This repo relates to the survey paper <Goal-Conditioned Reinforcement Learning: Problems and Solutions>. We collects widely used benchmar…
☆145May 10, 2023Updated 3 years ago