ChenDRAG / SfBCLinks
Codes accompanying the paper "Offline Reinforcement Learning via High-Fidelity Generative Behavior Modeling" (ICLR 2023) https://arxiv.org/abs/2209.14548
☆40Updated last year
Alternatives and similar repositories for SfBC
Users that are interested in SfBC are comparing it to the libraries listed below
Sorting:
- The code repo contains multiple code reproduction processes of various SOTA deep learning algorithms☆37Updated 3 years ago
- Population-Based Training (PBT) for Reinforcement Learning using Message Passing Interface (MPI)☆36Updated 3 years ago
- The Emergence of Individuality☆12Updated 3 years ago
- Universal Visual Decomposer: Long-Horizon Manipulation Made Easy☆60Updated 7 months ago
- Combining Diffusion Models with PPO to Improve Sample Efficiency and Exploration in Reinforcement Learning☆124Updated last week
- AAGPT is another experimental open-source application showcasing the capabilities of large language models, such as GPT-3.5 and GPT-4.☆136Updated 2 years ago
- 📚 List of Top-tier Conference Papers on Reinforcement Learning (RL),including: NeurIPS, ICML, AAAI, IJCAI, AAMAS, ICLR, ICRA, etc.☆334Updated last year
- WorldGPT: Empowering LLM as Multimodal World Model☆119Updated last year
- Provide full reinforcement learning benchmark on mujoco environments, including ddpg, sac, td3, pg, a2c, ppo, library☆85Updated 4 years ago
- Enhancing Pedestrian Route Choice Models through Maximum-Entropy Deep Inverse Reinforcement Learning with Individual Covariates (MEDIRL-I…☆35Updated 11 months ago
- Any-step Dynamics Model for Policy Optimization☆60Updated 6 months ago
- ☆53Updated 2 weeks ago
- A Mujoco-based simulation platform for humanoid robots with a 3-tier architecture, supporting imitation and reinforcement learning, and f…☆60Updated last year
- A curated list of awesome papers on the platonic representation hypothesis.☆44Updated 4 months ago
- [CVPR'2024] "SkillDiffuser: Interpretable Hierarchical Planning via Skill Abstractions in Diffusion-Based Task Execution"☆83Updated 11 months ago
- Code for CIKM'19 "CoRide: Joint Order Dispatching and Fleet Management for Multi-Scale Ride-Hailing Platforms"☆62Updated last month
- A collection of URDF model used in Pybullet☆35Updated 10 months ago
- When Learning Is Out of Reach, Reset: Generalization in Autonomous Visuomotor Reinforcement Learning☆12Updated last year
- This repository contains the core implementation of our ICML 2025 paper: "Token Signature: Predicting Chain-of-Thought Gains with Token D…☆41Updated last month
- PyTorch implements multi-agent reinforcement learning algorithms, including QMIX, Independent PPO, Centralized PPO, Grid Wise Control, Gr…☆237Updated last year
- Hybrid Latent Reasoning via Reinforcement Learning☆150Updated 3 months ago
- This repository contains the source code for our paper: "NaviSTAR: Socially Aware Robot Navigation with Hybrid Spatio-Temporal Graph Tran…☆57Updated 5 months ago
- [ICML22] "Revisiting and Advancing Fast Adversarial Training through the Lens of Bi-level Optimization" by Yihua Zhang*, Guanhua Zhang*, …☆65Updated 2 years ago
- ☆33Updated 6 months ago
- AI2-THOR Data Collection Tool Based On Keyboard Interaction☆53Updated last year
- Open source code for Paper: Evaluating Memory in LLM Agents via Incremental Multi-Turn Interactions☆60Updated last month
- Reinforcement learning algorithms with pytorch☆31Updated 2 years ago
- SemiEvol: Semi-supervised Fine-tuning for LLM Adaptation☆54Updated 4 months ago
- [ICML 2025] Official implementation of paper "Token Cleaning: Fine-Grained Data Selection for LLM Supervised Fine-Tuning"☆47Updated 2 months ago
- [ICLR 2024 Spotlight] Code for ICLR 2024 paper "Towards Robust Offline Reinforcement Learning under Diverse Data Corruption"☆18Updated 9 months ago