Adapting the AlphaZero algorithm to remove the need of execution traces to train NPI.
☆79Oct 3, 2023Updated 2 years ago
Alternatives and similar repositories for AlphaNPI
Users that are interested in AlphaNPI are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Template for building 2D grid worlds with OpenAI Gym and Pycolab☆14Jun 12, 2019Updated 7 years ago
- ☆25Nov 23, 2021Updated 4 years ago
- Code for "Learning Compositional Rules via Neural Program Synthesis"☆60Dec 7, 2020Updated 5 years ago
- Basic experiment framework for tensorflow.☆91Jun 24, 2021Updated 4 years ago
- ☆18Jul 15, 2019Updated 6 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Code for NeurIPS 2019 paper: "Symmetry-Based Disentangled Representation Learning requires Interaction with Environments" by H. Caselles-…☆34Dec 9, 2019Updated 6 years ago
- Variational Walkback, NIPS'17☆28Oct 18, 2017Updated 8 years ago
- ☆19Nov 7, 2020Updated 5 years ago
- Karel dataset for program synthesis and program induction☆79Dec 24, 2017Updated 8 years ago
- 🪐 The Sebulba architecture to scale reinforcement learning on Cloud TPUs in JAX☆61Oct 23, 2023Updated 2 years ago
- A TensorFlow Label Propagation library☆13Apr 7, 2018Updated 8 years ago
- Code for the paper Physics-as-Inverse-Graphics: Joint Unsupervised Learning of Objects and Physics from Video☆41May 22, 2023Updated 3 years ago
- A python implemenation of tabular MuZero for educational purposes☆21Dec 11, 2019Updated 6 years ago
- Code of the paper: Debiasing Meta-Gradient Reinforcement Learning by Learning the Outer Value Function☆13Apr 13, 2026Updated 2 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- STRIPS Planning in Infinite Domains☆19Oct 19, 2020Updated 5 years ago
- [NeurIPS 2019] Code for the paper "Learning to Control Self-Assembling Morphologies: A Study of Generalization via Modularity"☆119Dec 13, 2019Updated 6 years ago
- Learning and Reasoning with Graph-Structured Data (ICML 2019 Workshop)☆26Jul 18, 2019Updated 6 years ago
- Scrap Your Boilerplate for MetaOCaml with modular implicits☆18Dec 21, 2015Updated 10 years ago
- CompILE: Compositional Imitation Learning and Execution (ICML 2019)☆112May 12, 2019Updated 7 years ago
- using information theory to encourage agents to cooperate and compete☆19Oct 4, 2018Updated 7 years ago
- **Sferes2 module** A unifying modular framework for Quality-Diversity algorithms☆22Nov 6, 2020Updated 5 years ago
- Silly twitter torch implementations.☆48Oct 14, 2022Updated 3 years ago
- This project was moved to: https://github.com/coax-dev/coax☆161Nov 28, 2022Updated 3 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Upside-Down Reinforcement Learning (⅂ꓤ) implementation in PyTorch. Based on the paper published by Jürgen Schmidhuber.☆79Aug 13, 2020Updated 5 years ago
- Meta-Inverse Reinforcement Learning with Probabilistic Context Variables☆77Mar 16, 2023Updated 3 years ago
- BabyAI platform. A testbed for training agents to understand and execute language commands.☆765Oct 1, 2023Updated 2 years ago
- Repository for the paper "Long-Horizon Visual Planning with Goal-Conditioned Hierarchical Predictors"☆46Nov 22, 2022Updated 3 years ago
- pytorch implementation of "S3NET: GRAPH REPRESENTATIONAL NETWORK FOR SKETCH RECOGNITION"☆10Oct 6, 2020Updated 5 years ago
- Code for Harmonic Exponential Families on Manifolds☆10Jun 2, 2016Updated 10 years ago
- Code for Semantically Robust Unpaired Image Translation for Data with Unmatched Semantics Statistics (SRUNIT), ICCV 2021☆11Feb 10, 2022Updated 4 years ago
- ☆17May 16, 2018Updated 8 years ago
- Basic versions of agents from Spinning Up in Deep RL written in PyTorch☆207May 20, 2021Updated 5 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Qt-like event loops, signals and slots for communication across threads and processes in Python☆14Mar 26, 2024Updated 2 years ago
- Background materials for the article "Productivity Assessment of Neural Code Completion"☆16Jul 11, 2023Updated 2 years ago
- Code publication to the paper "Normalized Attention Without Probability Cage"☆17Nov 9, 2021Updated 4 years ago
- A simple implementation of MuZero algorithm for connect4 game☆96Aug 11, 2020Updated 5 years ago
- A highly-customisable gridworld game engine with some batteries included. Make your own gridworld games to test reinforcement learning ag…☆665Sep 6, 2019Updated 6 years ago
- Few-shot Bayesian Imitation Learning with Policies as Logic over Programs☆21Oct 19, 2025Updated 7 months ago
- No control flow, only exceptions☆10Dec 13, 2018Updated 7 years ago