ChenDRAG / SfBC
Codes accompanying the paper "Offline Reinforcement Learning via High-Fidelity Generative Behavior Modeling" (ICLR 2023) https://arxiv.org/abs/2209.14548
☆38Updated last year
Alternatives and similar repositories for SfBC:
Users that are interested in SfBC are comparing it to the libraries listed below
- Universal Visual Decomposer: Long-Horizon Manipulation Made Easy☆45Updated 2 months ago
- The Emergence of Individuality☆13Updated 3 years ago
- The code repo contains multiple code reproduction processes of various SOTA deep learning algorithms☆37Updated 3 years ago
- Population-Based Training (PBT) for Reinforcement Learning using Message Passing Interface (MPI)☆36Updated 2 years ago
- WorldGPT: Empowering LLM as Multimodal World Model☆115Updated 7 months ago
- When Learning Is Out of Reach, Reset: Generalization in Autonomous Visuomotor Reinforcement Learning☆11Updated 8 months ago
- Combining Diffusion Models with PPO to Improve Sample Efficiency and Exploration in Reinforcement Learning☆105Updated 3 weeks ago
- Enhancing Pedestrian Route Choice Models through Maximum-Entropy Deep Inverse Reinforcement Learning with Individual Covariates (MEDIRL-I…☆32Updated 6 months ago
- Provide full reinforcement learning benchmark on mujoco environments, including ddpg, sac, td3, pg, a2c, ppo, library☆85Updated 3 years ago
- Reinforcement learning algorithms with pytorch☆31Updated 2 years ago
- ☆30Updated 2 years ago
- Pre-Training Goal-based Models for Sample-Efficient Reinforcement Learning☆25Updated last year
- Uni-RLHF platform for "Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human Feedback" (ICLR2024…☆33Updated 4 months ago
- An improved version of EOI on Starcraft II task so_many_baneling. (The Emergence of Individuality)☆16Updated 3 years ago
- ☆11Updated 11 months ago
- Official code repository for Prompt-DT.☆107Updated 2 years ago
- Official codebase for Exact Energy-Guided Diffusion Sampling via Contrastive Energy Prediction (ICML 2023)☆46Updated last year
- This is the official code repository of MoTCoder: Elevating Large Language Models with Modular of Thought for Challenging Programming Tas…☆55Updated this week
- ☆63Updated 3 weeks ago
- ☆20Updated 11 months ago
- Code accompanying the paper "TiZero: Mastering Multi-Agent Football with Curriculum Learning and Self-Play" (AAMAS 2023) 足球游戏智能体☆52Updated last year
- RobustFT: Robust Supervised Fine-tuning for Large Language Models under Noisy Response☆40Updated 3 months ago
- ☆61Updated 4 months ago
- AI2-THOR Data Collection Tool Based On Keyboard Interaction☆49Updated 9 months ago
- Official implementation for "PEAC: Unsupervised Pre-training for Cross-Embodiment Reinforcement Learning" (NeurIPS 2024)☆13Updated 5 months ago
- Related papers for offline reforcement learning (we mainly focus on representation and sequence modeling and conventional offline RL)☆18Updated 2 years ago
- Implementation of the paper "Q-Adapter: Customizing Pre-trained LLMs to New Preferences with Forgetting Mitigation"☆15Updated 5 months ago
- ☆30Updated last year
- [NeurIPS 2023] Efficient Diffusion Policy☆96Updated last year
- A collection of URDF model used in Pybullet☆36Updated 5 months ago