ChenDRAG / SfBCLinks

Codes accompanying the paper "Offline Reinforcement Learning via High-Fidelity Generative Behavior Modeling" (ICLR 2023) https://arxiv.org/abs/2209.14548

☆40

Alternatives and similar repositories for SfBC

Users that are interested in SfBC are comparing it to the libraries listed below

Sorting:

HzcIrving / DLRL-PlayGround
The code repo contains multiple code reproduction processes of various SOTA deep learning algorithms
☆36Updated 3 years ago
zcczhang / UVD
Universal Visual Decomposer: Long-Horizon Manipulation Made Easy
☆54Updated 6 months ago
yyzpiero / EVO-PopulationBasedTraining
Population-Based Training (PBT) for Reinforcement Learning using Message Passing Interface (MPI)
☆37Updated 3 years ago
jiechuanjiang / eoi_pymarl
The Emergence of Individuality
☆13Updated 3 years ago
TianciGao / DiffPPO
Combining Diffusion Models with PPO to Improve Sample Efficiency and Exploration in Reinforcement Learning
☆123Updated last month
Rafa-zy / QLASS
☆38Updated last week
ChenDRAG / mujoco-benchmark
Provide full reinforcement learning benchmark on mujoco environments, including ddpg, sac, td3, pg, a2c, ppo, library
☆85Updated 4 years ago
DCDmllm / WorldGPT
WorldGPT: Empowering LLM as Multimodal World Model
☆117Updated 11 months ago
aialt / AAGPT
AAGPT is another experimental open-source application showcasing the capabilities of large language models, such as GPT-3.5 and GPT-4.
☆137Updated 2 years ago
Allenpandas / Reinforcement-Learning-Papers
📚 List of Top-tier Conference Papers on Reinforcement Learning (RL)，including: NeurIPS, ICML, AAAI, IJCAI, AAMAS, ICLR, ICRA, etc.
☆326Updated last year
BoyangL1 / Advanced_DeepIRL
Enhancing Pedestrian Route Choice Models through Maximum-Entropy Deep Inverse Reinforcement Learning with Individual Covariates (MEDIRL-I…
☆35Updated 9 months ago
HxLyn3 / ADMPO
Any-step Dynamics Model for Policy Optimization
☆58Updated 5 months ago
pigBond / olympics-mujoco
A Mujoco-based simulation platform for humanoid robots with a 3-tier architecture, supporting imitation and reinforcement learning, and f…
☆59Updated last year
sunrainyg / Awesome-PRH-papers
A curated list of awesome papers on the platonic representation hypothesis.
☆44Updated 3 months ago
Liang-ZX / SkillDiffuser
[CVPR'2024] "SkillDiffuser: Interpretable Hierarchical Planning via Skill Abstractions in Diffusion-Based Task Execution"
☆80Updated 9 months ago
SMARTlab-Purdue / SAN-NaviSTAR
This repository contains the source code for our paper: "NaviSTAR: Socially Aware Robot Navigation with Hybrid Spatio-Temporal Graph Tran…
☆56Updated 4 months ago
OPTML-Group / Fast-BAT
[ICML22] "Revisiting and Advancing Fast Adversarial Training through the Lens of Bi-level Optimization" by Yihua Zhang*, Guanhua Zhang*, …
☆65Updated 2 years ago
shihongl1998 / LLM-as-a-blackbox-optimizer
☆67Updated 4 months ago
luo-junyu / RobustFT
RobustFT: Robust Supervised Fine-tuning for Large Language Models under Noisy Response
☆42Updated 7 months ago
yding25 / URDF_models
A collection of URDF model used in Pybullet
☆36Updated 9 months ago
zcczhang / rmrl
When Learning Is Out of Reach, Reset: Generalization in Autonomous Visuomotor Reinforcement Learning
☆12Updated last year
SysCV / soccer-player
☆33Updated 5 months ago
OPTML-Group / BiP
[NeurIPS22] "Advancing Model Pruning via Bi-level Optimization" by Yihua Zhang*, Yuguang Yao*, Parikshit Ram, Pu Zhao, Tianlong Chen, Min…
☆117Updated 2 years ago
Jinjiarui / CoRide
Code for CIKM'19 "CoRide: Joint Order Dispatching and Fleet Management for Multi-Scale Ride-Hailing Platforms"
☆62Updated 2 years ago
ByZ0e / AI2Thor_keyboard_player
AI2-THOR Data Collection Tool Based On Keyboard Interaction
☆51Updated last year
Yueeeeeeee / HRPO
Hybrid Latent Reasoning via Reinforcement Learning
☆137Updated last month
ChenDRAG / CEP-energy-guided-diffusion
Official codebase for Exact Energy-Guided Diffusion Sampling via Contrastive Energy Prediction （ICML 2023）
☆48Updated last year
thu-ml / SRPO
Codes accompanying the paper "Score Regularized Policy Optimization through Diffusion Behavior" (ICLR 2024).
☆45Updated last year
zxzm-zak / FastUMI_Data
A Scalable and Hardware-Independent Universal Manipulation Interface
☆79Updated 2 months ago
yongchao98 / CodeSteer-v1.0
Code and dataset of CodeSteer
☆59Updated 3 months ago