WeihaoTan / TWOSOMELinks

Implementation of TWOSOME

☆77

Alternatives and similar repositories for TWOSOME

Users that are interested in TWOSOME are comparing it to the libraries listed below

Sorting:

yuqingd / ellm
☆81Updated last year
flowersteam / Grounding_LLMs_with_online_RL
We perform functional grounding of LLMs' knowledge in BabyAI-Text
☆268Updated 11 months ago
HosnLS / Hierarchical-Language-Agent
☆33Updated last year
123penny123 / Awesome-LLM-RL
A comprehensive list of PAPERS, CODEBASES, and, DATASETS on Decision Making using Foundation Models including LLMs and VLMs.
☆372Updated last year
xlang-ai / text2reward
[ICLR 2024 Spotlight] Code for the paper "Text2Reward: Reward Shaping with Language Models for Reinforcement Learning"
☆172Updated 7 months ago
OpenDFM / Rememberer
[NeurIPS 2023] Large Language Models Are Semi-Parametric Reinforcement Learning Agents
☆34Updated last year
Shanghai-Digital-Brain-Laboratory / BDM-DB1
A large-scale multi-modal pre-trained model
☆132Updated 2 years ago
csmile-1006 / PreferenceTransformer
Preference Transformer: Modeling Human Preferences using Transformers for RL (ICLR2023 Accepted)
☆163Updated last year
PKU-RL / CLIP4MC
An RL-Friendly Vision-Language Model for Minecraft
☆33Updated 9 months ago
flowersteam / lamorel
Lamorel is a Python library designed for RL practitioners eager to use Large Language Models (LLMs).
☆236Updated 9 months ago
mxu34 / prompt-dt
Official code repository for Prompt-DT.
☆114Updated 3 years ago
UMass-Embodied-AGI / CoELA
[ICLR 2024] Source codes for the paper "Building Cooperative Embodied Agents Modularly with Large Language Models"
☆265Updated 4 months ago
devindeng94 / LLM-SMAC
A New Approach to Solving SMAC Task: Generating Decision Tree Code from Large Language Models
☆46Updated 4 months ago
liyang619 / COLE-Platform
Overcooked human-AI experiment platform
☆38Updated last year
NJU-RL / Meta-DT
[NeurIPS 2024] Official Implementation of Meta-DT
☆45Updated 9 months ago
srzer / LaMo-2023
Official code for "Unleashing the Power of Pre-trained Language Models for Offline Reinforcement Learning".
☆53Updated last year
YaoMarkMu / Awesome-Pretrained-RL
☆89Updated 2 years ago
bigai-ai / civrealm
CivRealm is an interactive environment for the open-source strategy game Freeciv-web based on Freeciv, a Civilization-inspired game.
☆118Updated 10 months ago
microsoft / SmartPlay
SmartPlay is a benchmark for Large Language Models (LLMs). Uses a variety of games to test various important LLM capabilities as agents. …
☆140Updated last year
eric-ai-lab / llm_coordination
Code repository for the NAACL 2025 paper "LLM-Coordination: Evaluating and Analyzing Multi-agent Coordination Abilities in Large Language…
☆38Updated 9 months ago
PKU-RL / Plan4MC
[NeurIPS 2023 FMDM Workshop] Skill Reinforcement Learning and Planning for Open-World Long-Horizon Tasks
☆189Updated last year
floodsung / LLM-with-RL-papers
A collection of LLM with RL papers
☆276Updated last year
GuanSuns / LLMs-World-Models-for-Planning
The source code of the paper "Leveraging Pre-trained Large Language Models to Construct and Utilize World Models for Model-based Task Pla…
☆99Updated 11 months ago
PKU-Alignment / ProAgent
AAAI24(Oral) ProAgent: Building Proactive Cooperative Agents with Large Language Models
☆89Updated 5 months ago
1989Ryan / llm-mcts
[NeurIPS 2023] We use large language models as commonsense world model and heuristic policy within Monte-Carlo Tree Search, enabling bett…
☆281Updated 8 months ago
Stanford-ILIAD / Diverse-Conventions
Exploring techniques to generate diverse conventions in multi-agent settings
☆15Updated last year
ZJLAB-AMMI / LLM4RL
A RL approach to enable cost-effective, intelligent interactions between a local agent and a remote LLM
☆78Updated 11 months ago
WeihaoTan / gym-macro-overcooked
☆13Updated 2 years ago
pickxiguapi / Uni-RLHF-Platform
Uni-RLHF platform for "Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human Feedback" (ICLR2024…
☆37Updated 8 months ago
CraftJarvis / MC-Controller
Implementation of "Open-World Multi-Task Control Through Goal-Aware Representation Learning and Adaptive Horizon Prediction"
☆46Updated last year