Intrinsic Motivation from Artificial Intelligence Feedback
☆135Nov 7, 2023Updated 2 years ago
Alternatives and similar repositories for motif
Users that are interested in motif are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Learn online intrinsic rewards from LLM feedback☆45Dec 17, 2024Updated last year
- Learning diverse options through the Laplacian representation.☆23Jan 5, 2024Updated 2 years ago
- ☆22Mar 28, 2025Updated last year
- JAX code for the paper "Control-Oriented Model-Based Reinforcement Learning with Implicit Differentiation"☆43Jun 14, 2021Updated 5 years ago
- Production build of the new website☆13May 19, 2024Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Code for "Baba Is AI: Break the Rules to Beat the Benchmark"☆47Sep 3, 2025Updated 9 months ago
- This is a repository for paper titled, PlaSma: Making Small Language Models Better Procedural Knowledge Models for (Counterfactual) Plann…☆14Nov 3, 2023Updated 2 years ago
- Codebase for Inference-Time Policy Adapters☆25Nov 3, 2023Updated 2 years ago
- Dual optimization to learn laplacian eigenpairs in arbitrary spaces☆18Dec 18, 2024Updated last year
- Automatic Integration for Neural Spatio-Temporal Point Process models (AI-STPP) is a new paradigm for exact, efficient, non-parametric inf…☆25Oct 14, 2024Updated last year
- Official codebase for "The Generalization Gap in Offline Reinforcement Learning" accepted to ICLR 2024☆30Apr 8, 2026Updated 2 months ago
- MuJoCo benchmark for Deep Reinforcement Learning as provided by Tianshou framework.☆15Jan 12, 2025Updated last year
- Efficient World Models with Context-Aware Tokenization. ICML 2024☆128Sep 22, 2024Updated last year
- [NeurIPS 2023] Official code release accompanying the paper "NetHack is Hard to Hack" (Piterbarg, Pinto, Fergus)☆13Oct 30, 2023Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- A Data Source for Reasoning Embodied Agents☆19Sep 18, 2023Updated 2 years ago
- Benchmarking Agentic LLM and VLM Reasoning On Games☆255Apr 9, 2026Updated 2 months ago
- Script for processing OpenAI's PRM800K process supervision dataset into an Alpaca-style instruction-response format☆27Jul 12, 2023Updated 2 years ago
- Baselines and memory-based scenarios for the ViZDoom simulator☆36Dec 8, 2022Updated 3 years ago
- Curiosity-driven Exploration by Self-supervised Prediction☆24Jun 13, 2019Updated 7 years ago
- ☆34Jun 9, 2025Updated last year
- Author implementation of Monte Carlo Augmented Actor Critic in PyTorch☆18Oct 24, 2022Updated 3 years ago
- Benchmarking the Spectrum of Agent Capabilities☆561Jan 23, 2024Updated 2 years ago
- ☆89Dec 15, 2023Updated 2 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Lottery Tickets in Evolutionary Optimization (Lange & Sprekeler, ICML 2023)☆17Jun 2, 2023Updated 3 years ago
- ☆70Mar 6, 2025Updated last year
- Author's PyTorch Implementation of Deep Homomorphic Policy Gradient (DHPG) - NeurIPS 2022 and JMLR 2024☆24Apr 8, 2024Updated 2 years ago
- Official Repo of LangSuitE☆84Aug 15, 2024Updated last year
- Simple, extensible implementations of some meta-learning algorithms in Jax☆11Oct 6, 2020Updated 5 years ago
- Skill Design From AI Feedback☆33Feb 27, 2025Updated last year
- Standard interface for entity based reinforcement learning environments.☆39Feb 28, 2024Updated 2 years ago
- Official repo for the E3B algorithm described in the paper "Exploration via Elliptical Episodic Bonuses".☆87Mar 22, 2024Updated 2 years ago
- [NeurIPS '23 Spotlight] Thought Cloning: Learning to Think while Acting by Imitating Human Thinking☆268Jun 28, 2024Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- ☆30Jan 22, 2025Updated last year
- Code for demonstration example-task in RUDDER blog☆24May 19, 2020Updated 6 years ago
- Scalable Opponent Shaping Experiments in JAX☆27Apr 13, 2024Updated 2 years ago
- ☆22Nov 8, 2021Updated 4 years ago
- We perform functional grounding of LLMs' knowledge in BabyAI-Text☆276Oct 27, 2025Updated 7 months ago
- CLARA: Code Language Assistant & Repository Analyzer☆95Jul 4, 2023Updated 2 years ago
- Public code release for the paper "Reawakening knowledge: Anticipatory recovery from catastrophic interference via structured training"☆11Oct 27, 2025Updated 7 months ago