facebookresearch / motifView external linksLinks
Intrinsic Motivation from Artificial Intelligence Feedback
☆134Nov 7, 2023Updated 2 years ago
Alternatives and similar repositories for motif
Users that are interested in motif are comparing it to the libraries listed below
Sorting:
- ☆10Oct 11, 2022Updated 3 years ago
- JAX code for the paper "Control-Oriented Model-Based Reinforcement Learning with Implicit Differentiation"☆44Jun 14, 2021Updated 4 years ago
- This is a repository for paper titled, PlaSma: Making Small Language Models Better Procedural Knowledge Models for (Counterfactual) Plann…☆14Nov 3, 2023Updated 2 years ago
- ☆35Jun 9, 2025Updated 8 months ago
- Automatic Integration for Neural Spatio-Temporal Point Process models (AI-STPP) is a new paradigm for exact, efficient, non-parametric inf…☆25Oct 14, 2024Updated last year
- Dual optimization to learn laplacian eigenpairs in arbitrary spaces☆16Dec 18, 2024Updated last year
- Learning diverse options through the Laplacian representation.☆23Jan 5, 2024Updated 2 years ago
- Codebase for Inference-Time Policy Adapters☆25Nov 3, 2023Updated 2 years ago
- Script for processing OpenAI's PRM800K process supervision dataset into an Alpaca-style instruction-response format☆27Jul 12, 2023Updated 2 years ago
- Spatial Aptitude Training for Multimodal Langauge Models☆24Updated this week
- Lottery Tickets in Evolutionary Optimization (Lange & Sprekeler, ICML 2023)☆17Jun 2, 2023Updated 2 years ago
- A Data Source for Reasoning Embodied Agents☆19Sep 18, 2023Updated 2 years ago
- Official codebase for "The Generalization Gap in Offline Reinforcement Learning" accepted to ICLR 2024☆28Aug 9, 2024Updated last year
- ☆15Jan 21, 2026Updated 3 weeks ago
- Efficient World Models with Context-Aware Tokenization. ICML 2024☆117Sep 22, 2024Updated last year
- ☆22Nov 8, 2021Updated 4 years ago
- Scalable Opponent Shaping Experiments in JAX☆25Apr 13, 2024Updated last year
- ☆67Mar 6, 2025Updated 11 months ago
- Official Repo of LangSuitE☆84Aug 15, 2024Updated last year
- ☆22Mar 28, 2025Updated 10 months ago
- ☆27Jan 22, 2025Updated last year
- Benchmarks for Model-Based Optimization☆97Apr 21, 2024Updated last year
- LogiCity@NeurIPS'24, D&B track. A multi-agent inductive learning environment for "abstractions".☆27Jun 10, 2025Updated 8 months ago
- Self-Alignment with Principle-Following Reward Models☆169Sep 18, 2025Updated 4 months ago
- A proof of concept / prototype alternative String implementation for Pharo using a variable length UTF8 encoded internal representation☆12May 7, 2022Updated 3 years ago
- ☆12Aug 13, 2025Updated 6 months ago
- Crafting Adversarial Examples for Neural Machine Translation☆10Apr 7, 2023Updated 2 years ago
- Public code release for the paper "Reawakening knowledge: Anticipatory recovery from catastrophic interference via structured training"☆11Oct 27, 2025Updated 3 months ago
- ☆74Apr 27, 2024Updated last year
- Benchmarking Agentic LLM and VLM Reasoning On Games☆228Updated this week
- MiniHack the Planet: A Sandbox for Open-Ended Reinforcement Learning Research☆506Feb 12, 2025Updated last year
- Repository for code used in the xVal paper☆146Apr 4, 2024Updated last year
- Benchmarking the Spectrum of Agent Capabilities☆515Jan 23, 2024Updated 2 years ago
- Code for paper "W-RAG: Weakly Supervised Dense Retrieval in RAG for Open-domain Question Answering"☆15Oct 2, 2025Updated 4 months ago
- Applies ROME and MEMIT on Mamba-S4 models☆14Apr 5, 2024Updated last year
- Implementation of the LDP module block in PyTorch and Zeta from the paper: "MobileVLM: A Fast, Strong and Open Vision Language Assistant …☆15Mar 11, 2024Updated last year
- MuJoCo benchmark for Deep Reinforcement Learning as provided by Tianshou framework.☆15Jan 12, 2025Updated last year
- SAILOR is an inverse RL algorithm that learns world and reward models to search at test-time and recover from mistakes.☆52Nov 2, 2025Updated 3 months ago
- Source code for the paper "Policy Architectures for Compositional Generalization in Control"☆30May 19, 2022Updated 3 years ago