DAC: Diffusion Actor-Critic: Formulating Constrained Policy Iteration as Diffusion Noise Regression for Offline Reinforcement Learning.
☆31Jun 3, 2024Updated 2 years ago
Alternatives and similar repositories for DAC
Users that are interested in DAC are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official codebase for Exact Energy-Guided Diffusion Sampling via Contrastive Energy Prediction☆35Nov 3, 2023Updated 2 years ago
- code for the paper Offline Prioritized Experience Replay☆12Jun 13, 2023Updated 3 years ago
- ☆11Oct 3, 2022Updated 3 years ago
- Official implementation of Harnessing Mixed Offline Reinforcement Learning Datasets via Trajectory Reweighting☆16Feb 14, 2024Updated 2 years ago
- EDIS: Energy-guided DIffusion Sampling☆19Aug 10, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Codes accompanying the paper "Offline Reinforcement Learning with Value-Based Episodic Memory" (ICLR 2022 https://arxiv.org/abs/2110.0979…☆15Mar 9, 2022Updated 4 years ago
- ☆16Apr 14, 2026Updated 2 months ago
- Code for the paper "Learning a Diffusion Model Policy from Rewards via Q-Score Matching"☆34Apr 15, 2025Updated last year
- [COG24] - Official repository of "OfflineMania: A Benchmark Environment for Offline Reinforcement Learning in Racing Games"☆12Jul 15, 2024Updated last year
- Repo for Implicit Diffusion Q-Learning☆125Dec 5, 2023Updated 2 years ago
- [NeurIPS 2023] Refining Diffusion Planner for Reliable Behavior Synthesis by Automatic Detection of Infeasible Plans☆22Jan 31, 2024Updated 2 years ago
- [NeuIPS2024 DTQL] Diffusion Trusted Q-Learning for Offline RL — Official PyTorch Implementation☆27May 31, 2024Updated 2 years ago
- 该项目为Overleaf提供可直接导入的《自动化学报》中文稿件LateX☆31Nov 9, 2023Updated 2 years ago
- Official implementation of NeurIPS'23 paper, Beyond Uniform Sampling: Offline Reinforcement Learning with Imbalanced Datasets☆25Jan 29, 2024Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Q-learning with Adjoint Matching☆96May 11, 2026Updated last month
- Estimating Q(s,s') with Deep Deterministic Dynamics Gradients☆32Feb 21, 2020Updated 6 years ago
- ☆35Jun 21, 2024Updated 2 years ago
- [ICLR 2024] Closing the Gap between TD Learning and Supervised Learning - A Generalisation Point of View.☆25Apr 19, 2024Updated 2 years ago
- Ros2 vendor for the Acados NMPC solver.☆21Jan 26, 2026Updated 5 months ago
- Official PyTorch implementation of AlberDICE☆23Dec 8, 2023Updated 2 years ago
- ICML'2024: Q-value Regularized Transformer for Offline Reinforcement Learning☆38Dec 30, 2024Updated last year
- GBC: Generalized Behavior-Cloning Framework for Whole-Body Humanoid Imitation☆45May 12, 2026Updated last month
- Joint trajectory planning for constrained manipulation using the Closed-Chain Affordance framework by Janak Panthi☆14May 23, 2026Updated last month
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆430Apr 29, 2024Updated 2 years ago
- Code for Scalable Offline Model-Based RL with Action chunking☆29Feb 20, 2026Updated 4 months ago
- Synthetic Experience Replay☆114Apr 16, 2026Updated 2 months ago
- ☆31May 30, 2025Updated last year
- [ICML 2024] The offical implementation of A2PR, a simple way to achieve SOTA in offline reinforcement learning with an adaptive advantage…☆34May 31, 2024Updated 2 years ago
- Official implementation of ICML'24 paper "Offline Multi-Objective Optimization".☆25May 24, 2026Updated last month
- [NeurIPS 2025 Spotlight] Generative Trajectory Stitching through Diffusion Composition☆75Sep 6, 2025Updated 9 months ago
- Information-based Active SLAM via Topological Feature Graphs☆10Aug 7, 2022Updated 3 years ago
- ☆31Oct 3, 2023Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Fork of mjlab which supports unitree H1 & robotiq_2f85, personal implementation of the locomotion policy training in "HOMIE: Humanoid Loc…☆29Mar 25, 2026Updated 3 months ago
- A benchmark for evaluating reinforcement learning algorithms that train the policies using imaginary rollouts from LLMs.☆15Nov 4, 2025Updated 7 months ago
- PyTorch implementation for our paper "Improving GFlowNets for Text-to-Image Diffusion Alignment."☆32Sep 6, 2024Updated last year
- [CoRL 2024] Official PyTorch implementation of "Goal-Reaching Policy Learning from Non-Expert Observations via Effective Subgoal Guidance…☆50Jun 13, 2026Updated 3 weeks ago
- Official implementation of HEAD CoRL 2025☆26Aug 22, 2025Updated 10 months ago
- ☆11Nov 27, 2025Updated 7 months ago
- The implementation of elevation mapping on humanoid robots using a single MID-360 LiDAR.☆96Feb 22, 2025Updated last year