cwj22 / BeT-AILLinks
☆12Updated last year
Alternatives and similar repositories for BeT-AIL
Users that are interested in BeT-AIL are comparing it to the libraries listed below
Sorting:
- Code repository for the NAACL 2025 paper "LLM-Coordination: Evaluating and Analyzing Multi-agent Coordination Abilities in Large Language…☆39Updated last year
- Repo to reproduce the First-Explore paper results☆38Updated 10 months ago
- ☆23Updated last year
- Intelligent Go-Explore: Standing on the Shoulders of Giant Foundation Models☆65Updated 8 months ago
- Advantage Leftover Lunch Reinforcement Learning (A-LoL RL): Improving Language Models with Advantage-based Offline Policy Gradients☆26Updated last year
- Efficient World Models with Context-Aware Tokenization. ICML 2024☆111Updated last year
- An OpenAI gym environment to evaluate the ability of LLMs (eg. GPT-4, Claude) in long-horizon reasoning and task planning in dynamic mult…☆71Updated 2 years ago
- ☆219Updated 2 years ago
- Official Implementation of NeurIPS'23 Paper "Cross-Episodic Curriculum for Transformer Agents"☆31Updated 2 years ago
- A benchmark for evaluating learning agents based on just language feedback☆90Updated 4 months ago
- Code for Discovered Policy Optimisation (NeurIPS 2022)☆12Updated 2 years ago
- Pytorch Implementation of MuZero Unplugged for gym environment. This algorithm is capable of supporting a wide range of action and observ…☆33Updated 4 months ago
- Code for Contrastive Preference Learning (CPL)☆176Updated 11 months ago
- Reproduction of OpenAI and DeepMind's "Deep Reinforcement Learning from Human Preferences"☆31Updated 4 years ago
- Official implementation of "Direct Preference-based Policy Optimization without Reward Modeling" (NeurIPS 2023)☆42Updated last year
- The official implementation of the paper "Read to Play (R2-Play): Decision Transformer with Multimodal Game Instruction".☆34Updated last year
- Hypothetical Minds is an autonomous LLM-based agent for diverse multi-agent settings, integrating a Theory of Mind module Theory of Mind …☆36Updated last year
- A Large Recurrent Action Model: xLSTM enables Fast Inference for Robotics Tasks☆35Updated 11 months ago
- How to create rational LLM-based agents? Using game-theoretic workflows!☆78Updated 4 months ago
- ☆25Updated 7 months ago
- Pytorch implementation of the Gato paper from Deepmind☆11Updated 2 years ago
- Benchmarks for Multi-Objective Multi-Agent Decision Making☆109Updated 3 weeks ago
- Model-Based Transfer Learning for Contextual Reinforcement Learning (NeurIPS 2024)☆25Updated 10 months ago
- Preference Transformer: Modeling Human Preferences using Transformers for RL (ICLR2023 Accepted)☆164Updated 2 years ago
- Official implementation of the DECKARD Agent from the paper "Do Embodied Agents Dream of Pixelated Sheep?"☆94Updated 2 years ago
- Verlog: A Multi-turn RL framework for LLM agents☆63Updated this week
- INTeractive learning via REPresentatIon Discovery☆34Updated last year
- MAGELLAN: Metacognitive predictions of learning progress guide autotelic LLM agents in large goal spaces☆10Updated 7 months ago
- [ICML 2024] Official code release accompanying the paper "diff History for Neural Language Agents" (Piterbarg, Pinto, Fergus)☆20Updated last year
- Q-Probe: A Lightweight Approach to Reward Maximization for Language Models☆41Updated last year