inspirai / TimeChamberLinks
A Massively Parallel Large Scale Self-Play Framework
☆354Updated 2 years ago
Alternatives and similar repositories for TimeChamber
Users that are interested in TimeChamber are comparing it to the libraries listed below
Sorting:
- [NeurIPS 2023 FMDM Workshop] Skill Reinforcement Learning and Planning for Open-World Long-Horizon Tasks☆192Updated last year
- Deep reinforcement learning without experience replay, target networks, or batch updates.☆262Updated 6 months ago
- Unified Reinforcement Learning Framework☆773Updated last year
- TextStarCraft2,a pure language env which support llms play starcraft2☆288Updated 5 months ago
- A large-scale multi-modal pre-trained model☆132Updated 2 years ago
- A comprehensive list of PAPERS, CODEBASES, and, DATASETS on Decision Making using Foundation Models including LLMs and VLMs.☆377Updated last year
- Uni-RLHF platform for "Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human Feedback" (ICLR2024…☆41Updated 10 months ago
- ☆350Updated 2 years ago
- A collection of LLM with RL papers☆277Updated last year
- We perform functional grounding of LLMs' knowledge in BabyAI-Text☆273Updated last year
- ☆328Updated 2 years ago
- Official code for the ICLR 2025 paper, "Scaling Offline Model-Based RL via Jointly-Optimized World-Action Model Pretraining"☆26Updated 9 months ago
- [ICLR 2024 Spotlight] Code for the paper "Text2Reward: Reward Shaping with Language Models for Reinforcement Learning"☆177Updated 9 months ago
- Online Decision Transformer☆267Updated last year
- Code for "MetaMorph: Learning Universal Controllers with Transformers", Gupta et al, ICLR 2022☆125Updated 3 years ago
- CleanDiffuser: An Easy-to-use Modularized Library for Diffusion Models in Decision Making☆635Updated 5 months ago
- ☆49Updated 4 months ago
- ☆84Updated 2 years ago
- LLM-PySC2 is NKAI Decision Team and NUDT Decision Team's Python component of the StarCraft II LLM Decision Environment. It exposes Deepmi…☆138Updated 5 months ago
- ☆73Updated last year
- The source code for the blog post The 37 Implementation Details of Proximal Policy Optimization☆845Updated last year
- Codebase for paper: RoCo: Dialectic Multi-Robot Collaboration with Large Language Models☆216Updated last year
- A minimal and stable PPO.☆143Updated last year
- Code for "Learning to Model the World with Language." ICML 2024 Oral.☆392Updated 2 years ago
- ☆413Updated last year
- Preference Transformer: Modeling Human Preferences using Transformers for RL (ICLR2023 Accepted)☆163Updated last year
- Code for "Hierarchical World Models as Visual Whole-Body Humanoid Controllers"☆188Updated last week
- Code for "TD-MPC2: Scalable, Robust World Models for Continuous Control"☆623Updated 4 months ago
- Pytorch implementation of the models RT-1-X and RT-2-X from the paper: "Open X-Embodiment: Robotic Learning Datasets and RT-X Models"☆224Updated this week
- Open Platform for Embodied Agents☆329Updated 8 months ago