ChenDRAG / SfBCLinks
Codes accompanying the paper "Offline Reinforcement Learning via High-Fidelity Generative Behavior Modeling" (ICLR 2023) https://arxiv.org/abs/2209.14548
☆39Updated last year
Alternatives and similar repositories for SfBC
Users that are interested in SfBC are comparing it to the libraries listed below
Sorting:
- Universal Visual Decomposer: Long-Horizon Manipulation Made Easy☆50Updated 4 months ago
- The code repo contains multiple code reproduction processes of various SOTA deep learning algorithms☆37Updated 3 years ago
- Population-Based Training (PBT) for Reinforcement Learning using Message Passing Interface (MPI)☆36Updated 2 years ago
- The Emergence of Individuality☆13Updated 3 years ago
- Combining Diffusion Models with PPO to Improve Sample Efficiency and Exploration in Reinforcement Learning☆113Updated 3 months ago
- WorldGPT: Empowering LLM as Multimodal World Model☆116Updated 10 months ago
- Any-step Dynamics Model for Policy Optimization☆56Updated 3 months ago
- Provide full reinforcement learning benchmark on mujoco environments, including ddpg, sac, td3, pg, a2c, ppo, library☆85Updated 4 years ago
- RobustFT: Robust Supervised Fine-tuning for Large Language Models under Noisy Response☆41Updated 5 months ago
- Official codebase for Exact Energy-Guided Diffusion Sampling via Contrastive Energy Prediction (ICML 2023)☆48Updated last year
- Code and dataset of CodeSteer☆56Updated 2 months ago
- When Learning Is Out of Reach, Reset: Generalization in Autonomous Visuomotor Reinforcement Learning☆11Updated 11 months ago
- Codes accompanying the paper "Score Regularized Policy Optimization through Diffusion Behavior" (ICLR 2024).☆44Updated last year
- ☆23Updated last year
- Official PyTorch implementation of "ACE:Off-Policy Actor-Critic with Causality-Aware Entropy Regularization"☆30Updated last year
- SemiEvol: Semi-supervised Fine-tuning for LLM Adaptation☆54Updated last month
- A curated list of awesome papers on the platonic representation hypothesis.☆41Updated last month
- Official repository for Paper "Offline Goal-Conditioned Reinforcement Learning via f-Advantage Regression" (NeurIPS 2022)☆35Updated last year
- code for the paper Offline Prioritized Experience Replay☆13Updated last year
- The official implementation of "When Data Geometry Meets Deep Function: Generalizing Offline Reinforcement Learning" (ICLR2023)☆44Updated 2 years ago
- Reinforcement learning algorithms with pytorch☆31Updated 2 years ago
- ☆17Updated last year
- AI2-THOR Data Collection Tool Based On Keyboard Interaction☆50Updated 11 months ago
- ☆14Updated 3 years ago
- Code base for paper: Reparameterized Policy Learning for Multimodal Trajectory Optimization☆26Updated last year
- A collection of URDF model used in Pybullet☆36Updated 7 months ago
- Official code for "Unleashing the Power of Pre-trained Language Models for Offline Reinforcement Learning".☆47Updated last year
- Implementation of ICLR 2025 paper "Q-Adapter: Customizing Pre-trained LLMs to New Preferences with Forgetting Mitigation"☆17Updated 8 months ago
- Official repository for paper "Versatile Offline Imitation from Observations and Examples via Regularized State-Occupancy Matching" (ICML…☆26Updated 2 years ago
- Offline RLHF codebase implementation for "Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human …☆38Updated last year