facebookresearch / motif
Intrinsic Motivation from Artificial Intelligence Feedback
☆118Updated last year
Related projects ⓘ
Alternatives and complementary repositories for motif
- ☆73Updated 4 months ago
- Efficient baselines for autocurricula in JAX.☆172Updated 2 months ago
- Efficient World Models with Context-Aware Tokenization. ICML 2024☆84Updated last month
- Official implementation of the DECKARD Agent from the paper "Do Embodied Agents Dream of Pixelated Sheep?"☆86Updated last year
- STEVE-1: A Generative Model for Text-to-Behavior in Minecraft☆170Updated 5 months ago
- Lamorel is a Python library designed for RL practitioners eager to use Large Language Models (LLMs).☆195Updated this week
- General multi-task deep RL Agent☆164Updated 5 months ago
- SmartPlay is a benchmark for Large Language Models (LLMs). Uses a variety of games to test various important LLM capabilities as agents. …☆120Updated 6 months ago
- ☆135Updated 6 months ago
- ☆121Updated 9 months ago
- [NeurIPS '23 Spotlight] Thought Cloning: Learning to Think while Acting by Imitating Human Thinking☆252Updated 4 months ago
- ScienceWorld is a text-based virtual environment centered around accomplishing tasks from the standardized elementary science curriculum.☆213Updated 3 weeks ago
- An OpenAI gym environment to evaluate the ability of LLMs (eg. GPT-4, Claude) in long-horizon reasoning and task planning in dynamic mult…☆62Updated last year
- We perform functional grounding of LLMs' knowledge in BabyAI-Text☆219Updated 2 months ago
- Intelligent Go-Explore: Standing on the Shoulders of Giant Foundation Models☆45Updated 5 months ago
- Research Code for "ArCHer: Training Language Model Agents via Hierarchical Multi-Turn RL"☆102Updated 7 months ago
- ☆202Updated last year
- Awesome Open-ended AI☆179Updated last month
- Code for "Learning to Model the World with Language." ICML 2024 Oral.☆362Updated last year
- a simple and scalable agent for training adaptive policies with sequence-based RL☆89Updated this week
- Simple single-file baselines for Q-Learning in pure-GPU setting☆93Updated 3 months ago
- Official code from the paper "Offline RL for Natural Language Generation with Implicit Language Q Learning"☆197Updated last year
- Interpreting how transformers simulate agents performing RL tasks☆69Updated last year
- ☆61Updated 2 months ago
- OMNI: Open-endedness via Models of human Notions of Interestingness☆36Updated 10 months ago
- CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL☆105Updated 2 months ago
- Implementation of "Open-World Multi-Task Control Through Goal-Aware Representation Learning and Adaptive Horizon Prediction"☆42Updated last year
- A benchmark for evaluating learning agents based on just language feedback☆56Updated last month
- Bootstrapping ARC☆38Updated this week
- GROOT: Learning to Follow Instructions by Watching Gameplay Videos☆56Updated 10 months ago