inspirai / TimeChamber
A Massively Parallel Large Scale Self-Play Framework
☆309Updated last year
Related projects ⓘ
Alternatives and complementary repositories for TimeChamber
- Reinforcement learning and planning for Minecraft.☆156Updated 8 months ago
- Monitoring recent cross-research on LLM & RL on arXiv for control. If there are good papers, PRs are welcome.☆161Updated last month
- A comprehensive list of PAPERS, CODEBASES, and, DATASETS on Decision Making using Foundation Models including LLMs and VLMs.☆328Updated 6 months ago
- Official Repo for Fine-Tuning Large Vision-Language Models as Decision-Making Agents via Reinforcement Learning☆199Updated last month
- CleanDiffuser: An Easy-to-use Modularized Library for Diffusion Models in Decision Making☆378Updated 3 weeks ago
- This is a library that provides dual dexterous hand manipulation tasks through Isaac Gym☆655Updated 4 months ago
- TextStarCraft2,a pure language env which support llms play starcraft2☆208Updated 3 weeks ago
- Unified Reinforcement Learning Framework☆641Updated 2 months ago
- [NeurIPS 2023] We use large language models as commonsense world model and heuristic policy within Monte-Carlo Tree Search, enabling bett…☆190Updated 5 months ago
- A collection of LLM with RL papers☆229Updated 6 months ago
- Online Decision Transformer☆237Updated 9 months ago
- RDT-1B: a Diffusion Foundation Model for Bimanual Manipulation☆412Updated last week
- ☆299Updated last year
- ☆149Updated this week
- A generative and self-guided robotic agent that endlessly propose and master new skills.☆590Updated 5 months ago
- SERL: A Software Suite for Sample-Efficient Robotic Reinforcement Learning☆341Updated this week
- [ICML 2024, Spotlight] EfficientZero V2: Mastering Discrete and Continuous Control with Limited Data☆35Updated 3 months ago
- Code for "TD-MPC2: Scalable, Robust World Models for Continuous Control"☆390Updated this week
- Code for RoboFlamingo☆309Updated 6 months ago
- [ICLR 2024] Code for the paper "Text2Reward: Automated Dense Reward Function Generation for Reinforcement Learning"☆127Updated 2 weeks ago
- Code for "Hierarchical World Models as Visual Whole-Body Humanoid Controllers"☆147Updated 4 months ago
- GRUtopia: Dream General Robots in a City at Scale☆503Updated 2 months ago
- Lamorel is a Python library designed for RL practitioners eager to use Large Language Models (LLMs).☆195Updated this week
- Official implementation of Diffusion Policy Policy Optimization, arxiv 2024☆213Updated this week
- NeurIPS 2023: Safety-Gymnasium: A Unified Safe Reinforcement Learning Benchmark☆393Updated 5 months ago
- Code for the paper "Offline Reinforcement Learning as One Big Sequence Modeling Problem"☆463Updated 2 years ago
- ☆200Updated 9 months ago
- The source code for the blog post The 37 Implementation Details of Proximal Policy Optimization☆642Updated 7 months ago
- A large-scale multi-modal pre-trained model☆128Updated last year
- Implementation of Dreamer v3 in pytorch.☆419Updated last month