inspirai / TimeChamberLinks
A Massively Parallel Large Scale Self-Play Framework
☆361Updated 3 years ago
Alternatives and similar repositories for TimeChamber
Users that are interested in TimeChamber are comparing it to the libraries listed below
Sorting:
- A collection of LLM with RL papers☆278Updated last year
- [NeurIPS 2023 FMDM Workshop] Skill Reinforcement Learning and Planning for Open-World Long-Horizon Tasks☆198Updated last year
- We perform functional grounding of LLMs' knowledge in BabyAI-Text☆275Updated 3 months ago
- Deep reinforcement learning without experience replay, target networks, or batch updates.☆273Updated 10 months ago
- Unified Reinforcement Learning Framework☆813Updated last year
- A large-scale multi-modal pre-trained model☆133Updated 2 years ago
- A comprehensive list of PAPERS, CODEBASES, and, DATASETS on Decision Making using Foundation Models including LLMs and VLMs.☆385Updated last year
- ☆89Updated 2 years ago
- ☆376Updated 2 years ago
- ☆263Updated 2 months ago
- TextStarCraft2,a pure language env which support llms play starcraft2☆301Updated 9 months ago
- [ICLR 2024 Spotlight] Text2Reward: Reward Shaping with Language Models for Reinforcement Learning☆197Updated last year
- Uni-RLHF platform for "Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human Feedback" (ICLR2024…☆42Updated last year
- A New Approach to Solving SMAC Task: Generating Decision Tree Code from Large Language Models☆51Updated 10 months ago
- DrQ-v2: Improved Data-Augmented Reinforcement Learning☆429Updated 3 years ago
- Monitoring recent cross-research on LLM & RL on arXiv for control. If there are good papers, PRs are welcome.☆539Updated 2 months ago
- CleanDiffuser: An Easy-to-use Modularized Library for Diffusion Models in Decision Making☆692Updated 9 months ago
- Pytorch implementation of the models RT-1-X and RT-2-X from the paper: "Open X-Embodiment: Robotic Learning Datasets and RT-X Models"☆234Updated last week
- ☆461Updated last year
- Codebase for paper: RoCo: Dialectic Multi-Robot Collaboration with Large Language Models☆237Updated 2 years ago
- Code accompanying the paper "TiZero: Mastering Multi-Agent Football with Curriculum Learning and Self-Play" (AAMAS 2023) 足球游戏智能体☆67Updated 2 years ago
- ☆49Updated 8 months ago
- Code for "TD-MPC2: Scalable, Robust World Models for Continuous Control"☆747Updated 8 months ago
- A minimal and stable PPO.☆146Updated last year
- [NeurIPS 2023] We use large language models as commonsense world model and heuristic policy within Monte-Carlo Tree Search, enabling bett…☆294Updated last year
- Official Task Suite Implementation of ICML'23 Paper "VIMA: General Robot Manipulation with Multimodal Prompts"☆324Updated 2 years ago
- off-policy RL on long sequences☆159Updated this week
- Preference Transformer: Modeling Human Preferences using Transformers for RL (ICLR2023 Accepted)☆167Updated 2 years ago
- ☆25Updated 3 years ago
- [ICLR 2024] Source codes for the paper "Building Cooperative Embodied Agents Modularly with Large Language Models"☆291Updated 10 months ago