Intrinsic Motivation from Artificial Intelligence Feedback
☆134Nov 7, 2023Updated 2 years ago
Alternatives and similar repositories for motif
Users that are interested in motif are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Learn online intrinsic rewards from LLM feedback☆45Dec 17, 2024Updated last year
- ☆10Oct 11, 2022Updated 3 years ago
- Qt-like event loops, signals and slots for communication across threads and processes in Python☆14Mar 26, 2024Updated 2 years ago
- Learning diverse options through the Laplacian representation.☆23Jan 5, 2024Updated 2 years ago
- ☆22Mar 28, 2025Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- JAX code for the paper "Control-Oriented Model-Based Reinforcement Learning with Implicit Differentiation"☆43Jun 14, 2021Updated 4 years ago
- Production build of the new website☆13May 19, 2024Updated last year
- Code for "Baba Is AI: Break the Rules to Beat the Benchmark"☆45Sep 3, 2025Updated 8 months ago
- Codebase for Inference-Time Policy Adapters☆25Nov 3, 2023Updated 2 years ago
- Dual optimization to learn laplacian eigenpairs in arbitrary spaces☆17Dec 18, 2024Updated last year
- Automatic Integration for Neural Spatio-Temporal Point Process models (AI-STPP) is a new paradigm for exact, efficient, non-parametric inf…☆25Oct 14, 2024Updated last year
- Official codebase for "The Generalization Gap in Offline Reinforcement Learning" accepted to ICLR 2024☆29Apr 8, 2026Updated last month
- This repository contains the code for Diversity Control (DiCo), a novel method to constrain behavioral diversity in multi-agent reinforce…☆31Dec 21, 2024Updated last year
- Efficient World Models with Context-Aware Tokenization. ICML 2024☆121Sep 22, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Spatial Aptitude Training for Multimodal Langauge Models☆31Feb 8, 2026Updated 3 months ago
- Self-Alignment with Principle-Following Reward Models☆170Sep 18, 2025Updated 7 months ago
- Benchmarking Agentic LLM and VLM Reasoning On Games☆252Apr 9, 2026Updated last month
- Script for processing OpenAI's PRM800K process supervision dataset into an Alpaca-style instruction-response format☆27Jul 12, 2023Updated 2 years ago
- MiniHack the Planet: A Sandbox for Open-Ended Reinforcement Learning Research☆517Feb 12, 2025Updated last year
- Curiosity-driven Exploration by Self-supervised Prediction☆25Jun 13, 2019Updated 6 years ago
- ☆34Jun 9, 2025Updated 11 months ago
- Automated Theorem Prover inspired by Aletheia. Claude Code for mathematicians.☆74Apr 20, 2026Updated 3 weeks ago
- Author implementation of Monte Carlo Augmented Actor Critic in PyTorch☆18Oct 24, 2022Updated 3 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Benchmarking the Spectrum of Agent Capabilities☆541Jan 23, 2024Updated 2 years ago
- ☆89Dec 15, 2023Updated 2 years ago
- Data-Driven NetHack Tools: Datasets (30+) and recurrent-baselines (AWAC, BC, CQL, IQL, REM)☆43Aug 22, 2023Updated 2 years ago
- ☆69Mar 6, 2025Updated last year
- Lottery Tickets in Evolutionary Optimization (Lange & Sprekeler, ICML 2023)☆17Jun 2, 2023Updated 2 years ago
- Author's PyTorch Implementation of Deep Homomorphic Policy Gradient (DHPG) - NeurIPS 2022 and JMLR 2024☆24Apr 8, 2024Updated 2 years ago
- Official Repo of LangSuitE☆84Aug 15, 2024Updated last year
- Paper List of Minecraft Agents☆65Mar 6, 2026Updated 2 months ago
- Simple, extensible implementations of some meta-learning algorithms in Jax☆11Oct 6, 2020Updated 5 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Skill Design From AI Feedback☆34Feb 27, 2025Updated last year
- Standard interface for entity based reinforcement learning environments.☆38Feb 28, 2024Updated 2 years ago
- Official repo for the E3B algorithm described in the paper "Exploration via Elliptical Episodic Bonuses".☆87Mar 22, 2024Updated 2 years ago
- ☆75Apr 27, 2024Updated 2 years ago
- [NeurIPS '23 Spotlight] Thought Cloning: Learning to Think while Acting by Imitating Human Thinking☆268Jun 28, 2024Updated last year
- Code for demonstration example-task in RUDDER blog☆24May 19, 2020Updated 5 years ago
- Scalable Opponent Shaping Experiments in JAX☆26Apr 13, 2024Updated 2 years ago