Implementation of TWOSOME
☆82Jan 11, 2025Updated last year
Alternatives and similar repositories for TWOSOME
Users that are interested in TWOSOME are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆15May 11, 2023Updated 2 years ago
- We perform functional grounding of LLMs' knowledge in BabyAI-Text☆278Oct 27, 2025Updated 6 months ago
- ☆15Mar 26, 2024Updated 2 years ago
- ☆90Aug 21, 2023Updated 2 years ago
- Python code to implement LLM4Teach, a policy distillation approach for teaching reinforcement learning agents with Large Language Model☆53Apr 19, 2024Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Lamorel is a Python library designed for RL practitioners eager to use Large Language Models (LLMs).☆248Dec 11, 2025Updated 4 months ago
- Code repository for the NAACL 2025 paper "LLM-Coordination: Evaluating and Analyzing Multi-agent Coordination Abilities in Large Language…☆45Oct 13, 2024Updated last year
- Code for demonstration example-task in RUDDER blog☆24May 19, 2020Updated 5 years ago
- The source code of "ResQ: A Residual Q Function-based Approach for Multi-Agent Reinforcement Learning Value Factorization. NeurIPS 2022"☆18Oct 17, 2022Updated 3 years ago
- ☆13Aug 15, 2020Updated 5 years ago
- ☆44Jan 9, 2024Updated 2 years ago
- AdaRefiner: Refining Decisions of Language Models with Adaptive Feedback (NAACL 2024)☆19Aug 9, 2024Updated last year
- CVPR 2026 - MSGNav: Unleashing the Power of Multi-modal 3D Scene Graph for Zero-Shot Embodied Navigation☆47Mar 23, 2026Updated last month
- Hao Jin, Yang Peng, Wenhao Yang, Shusen Wang and Zhihua Zhang. Federated Reinforcement Learning with Environment Heterogeneity. AISTATS, …☆63Feb 14, 2022Updated 4 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- Monitoring recent cross-research on LLM & RL on arXiv for control. If there are good papers, PRs are welcome.☆551Nov 17, 2025Updated 5 months ago
- API to run VirtualHome, a Multi-Agent Household Simulator☆609Mar 26, 2026Updated last month
- [NeurIPS 2024] Official Implementation of Meta-DT☆53Oct 16, 2024Updated last year
- A personal, modular Python library built on top of NVIDIA Isaac Sim, designed to simplify robot simulation experiments and prototyping.☆10Aug 16, 2025Updated 8 months ago
- [NeurIPS 2023] Large Language Models Are Semi-Parametric Reinforcement Learning Agents☆40May 2, 2024Updated 2 years ago
- Text world based on Minecraft rules.☆17May 13, 2024Updated last year
- A Survey on Large Language Model-Based Game Agents☆872Feb 13, 2026Updated 2 months ago
- ALFWorld: Aligning Text and Embodied Environments for Interactive Learning☆729Feb 8, 2026Updated 2 months ago
- TensorFlow implementation for our paper "Learning Long-Term Reward Redistribution via Randomized Return Decomposition"☆19Mar 17, 2022Updated 4 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- A library for constrained RLHF.☆13Feb 19, 2024Updated 2 years ago
- Reward shaping approach for instruction following settings, leveraging language at multiple levels of abstraction.☆21Mar 9, 2021Updated 5 years ago
- Setup scripts for the WebArena benchmark☆22Jun 19, 2025Updated 10 months ago
- Code for "On the Utility of Learning about Humans for Human-AI Coordination"☆112Apr 17, 2023Updated 3 years ago
- IPyHOP is a Re-entrant Iterative GTPyHOP written in Python 3. PyHOP is an acronym for Python Hierarchical Ordered Planner.☆11Aug 12, 2022Updated 3 years ago
- Default Course Project of CS4278/CS5478 Intelligent Robots: Algorithms and Systems☆14Dec 9, 2024Updated last year
- Repo for Happiness is a Warm Gun☆10Oct 19, 2025Updated 6 months ago
- [NeurIPS 2023] We use large language models as commonsense world model and heuristic policy within Monte-Carlo Tree Search, enabling bett…☆300Nov 16, 2024Updated last year
- A comprehensive list of PAPERS, CODEBASES, and, DATASETS on Decision Making using Foundation Models including LLMs and VLMs.☆384Apr 24, 2024Updated 2 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Codebase for paper: RoCo: Dialectic Multi-Robot Collaboration with Large Language Models☆250Oct 4, 2023Updated 2 years ago
- A benchmark for evaluating reinforcement learning algorithms that train the policies using imaginary rollouts from LLMs.☆14Nov 4, 2025Updated 6 months ago
- EMNLP2022: Learning Robust Representations for Continual Relation Extraction via Adversarial Class Augmentation☆15Oct 19, 2022Updated 3 years ago
- Authors' PyTorch implementation of 'Recomposing the Reinforcement Learning Building-Blocks with Hypernetworks' (HypeRL)☆26Jun 9, 2021Updated 4 years ago
- ☆30Jan 27, 2025Updated last year
- TextStarCraft2,a pure language env which support llms play starcraft2☆313Apr 25, 2025Updated last year
- ORION: Option-Regularized Deep Reinforcement Learning for Cooperative Multi-Agent Online Navigation☆23Apr 16, 2026Updated 2 weeks ago