Implementation of TWOSOME
☆82Jan 11, 2025Updated last year
Alternatives and similar repositories for TWOSOME
Users that are interested in TWOSOME are comparing it to the libraries listed below
Sorting:
- ☆15May 11, 2023Updated 2 years ago
- We perform functional grounding of LLMs' knowledge in BabyAI-Text☆276Oct 27, 2025Updated 4 months ago
- ☆89Aug 21, 2023Updated 2 years ago
- ☆15Mar 26, 2024Updated last year
- Lamorel is a Python library designed for RL practitioners eager to use Large Language Models (LLMs).☆244Dec 11, 2025Updated 2 months ago
- Python code to implement LLM4Teach, a policy distillation approach for teaching reinforcement learning agents with Large Language Model☆52Apr 19, 2024Updated last year
- Code repository for the NAACL 2025 paper "LLM-Coordination: Evaluating and Analyzing Multi-agent Coordination Abilities in Large Language…☆44Oct 13, 2024Updated last year
- Repository for Skill Set Optimization☆14Jul 26, 2024Updated last year
- Hao Jin, Yang Peng, Wenhao Yang, Shusen Wang and Zhihua Zhang. Federated Reinforcement Learning with Environment Heterogeneity. AISTATS, …☆63Feb 14, 2022Updated 4 years ago
- The source code of "ResQ: A Residual Q Function-based Approach for Multi-Agent Reinforcement Learning Value Factorization. NeurIPS 2022"☆18Oct 17, 2022Updated 3 years ago
- ☆42Jan 9, 2024Updated 2 years ago
- Text world based on Minecraft rules.☆17May 13, 2024Updated last year
- ☆12Aug 15, 2020Updated 5 years ago
- [NeurIPS 2023] Large Language Models Are Semi-Parametric Reinforcement Learning Agents☆40May 2, 2024Updated last year
- EMNLP2022: Learning Robust Representations for Continual Relation Extraction via Adversarial Class Augmentation☆15Oct 19, 2022Updated 3 years ago
- AdaRefiner: Refining Decisions of Language Models with Adaptive Feedback (NAACL 2024)☆18Aug 9, 2024Updated last year
- [NeurIPS 2024] Official Implementation of Meta-DT☆53Oct 16, 2024Updated last year
- Official python implementation of ASGRL in ICML 2022 paper: Leveraging Approximate Symbolic Models for Reinforcement Learning via Skill D…☆20Oct 5, 2022Updated 3 years ago
- Fine-tuned MARL algorithms on SMAC (100% win rates on most scenarios)☆19Aug 20, 2023Updated 2 years ago
- Monitoring recent cross-research on LLM & RL on arXiv for control. If there are good papers, PRs are welcome.☆543Nov 17, 2025Updated 3 months ago
- ALFWorld: Aligning Text and Embodied Environments for Interactive Learning☆648Feb 8, 2026Updated 3 weeks ago
- ☆53Feb 19, 2025Updated last year
- ☆30Jan 27, 2025Updated last year
- ☆31Jul 3, 2025Updated 8 months ago
- A benchmark for evaluating learning agents based on just language feedback☆94Jun 10, 2025Updated 8 months ago
- TensorFlow implementation for our paper "Learning Long-Term Reward Redistribution via Randomized Return Decomposition"☆19Mar 17, 2022Updated 3 years ago
- ☆47May 21, 2024Updated last year
- Code for paper "SPG Sandwiched Policy Gradient for Masked Diffusion Language Models"☆49Oct 29, 2025Updated 4 months ago
- Docker for the minerl gym environment with Jupyter, PyTorch and CUDA drivers installed☆20Aug 15, 2019Updated 6 years ago
- Evaluate Multimodal LLMs as Embodied Agents☆57Feb 14, 2025Updated last year
- (ACL 2025 Main) Code for MultiAgentBench : Evaluating the Collaboration and Competition of LLM agents https://www.arxiv.org/pdf/2503.019…☆34Jun 21, 2025Updated 8 months ago
- Official Implementation of CL-ALFRED (ICLR'24)☆30Oct 24, 2024Updated last year
- [NeurIPS 2023] We use large language models as commonsense world model and heuristic policy within Monte-Carlo Tree Search, enabling bett…☆297Nov 16, 2024Updated last year
- Codebase for paper: RoCo: Dialectic Multi-Robot Collaboration with Large Language Models☆238Oct 4, 2023Updated 2 years ago
- Web application where humans can play Overcooked with AI agents.☆60Dec 6, 2022Updated 3 years ago
- Benchmark and research code for the paper SWEET-RL Training Multi-Turn LLM Agents onCollaborative Reasoning Tasks☆261May 5, 2025Updated 9 months ago
- [ACL 2024] PCA-Bench: Evaluating Multimodal Large Language Models in Perception-Cognition-Action Chain☆106Mar 14, 2024Updated last year
- ☆118Apr 8, 2025Updated 10 months ago
- Official repo for the TMLR paper "Discffusion: Discriminative Diffusion Models as Few-shot Vision and Language Learners"☆30Apr 27, 2024Updated last year