Moore Machine Networks (MMN): Learning Finite-State Representations of Recurrent Policy Networks
☆52Dec 8, 2022Updated 3 years ago
Alternatives and similar repositories for mmn
Users that are interested in mmn are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Source code for the differential saliency method used in "Re-understanding Finite-State Representations of Recurrent Policy Networks"☆11Oct 4, 2023Updated 2 years ago
- Code for "Dream and Search to Control: Latent Space Planning for Continuous Control"☆12Jul 12, 2021Updated 4 years ago
- Interacting with Latent Space of AutoEncoder☆21Nov 22, 2022Updated 3 years ago
- Code of the Paper "Time-Efficient Reinforcement Learning with Stochastic Stateful Policies"☆25May 5, 2024Updated 2 years ago
- ☆26Apr 16, 2024Updated 2 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- MuJoCo model for Blue☆10Mar 13, 2020Updated 6 years ago
- A boilerplate for gin + gorm + postgres + docker☆26May 8, 2023Updated 3 years ago
- ☆20Apr 29, 2019Updated 7 years ago
- This repository contains papers for a comprehensive survey on accelerated generation techniques in Large Language Models (LLMs).☆11May 24, 2024Updated 2 years ago
- ☆29Oct 26, 2020Updated 5 years ago
- ☆12Sep 21, 2024Updated last year
- ☆19Mar 1, 2023Updated 3 years ago
- ☆12Jul 6, 2023Updated 2 years ago
- a django proftolio app☆18Aug 16, 2021Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Sparse n-dimensional arrays in Python☆12Feb 10, 2010Updated 16 years ago
- Implementation of Spectral State Space Models☆16Feb 23, 2024Updated 2 years ago
- Code for our paper "Visualizing and Understanding Atari Agents" (https://goo.gl/AMAoSc)☆126Oct 21, 2021Updated 4 years ago
- Evaluating long-term memory of reinforcement learning algorithms☆180Jun 23, 2023Updated 2 years ago
- A scalable Dreamer implementation in JAX☆10May 22, 2022Updated 4 years ago
- A library for building reinforcement learning and imitation learning agents in Pytorch☆61Jun 13, 2020Updated 5 years ago
- Density Constrained Reinforcement Learning☆12Mar 24, 2023Updated 3 years ago
- ☆332Dec 19, 2024Updated last year
- Gymnasium environment for reinforcement learning with multicopters☆32Jun 4, 2024Updated 2 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- From pixels to symbolic rule learning☆12Nov 12, 2021Updated 4 years ago
- High-performance JAX-powered simulator for robotic navigation in 2D mazes, optimized for Quality-Diversity algorithm research and benchma…☆20Jun 19, 2025Updated 11 months ago
- Algorithmic Framework for Model-based Deep Reinforcement Learning with Theoretical Guarantees☆55Jul 26, 2019Updated 6 years ago
- Implementation codes of deep generative models with Pixyz☆107Jan 10, 2023Updated 3 years ago
- Mitigating Partial Observability in Sequential Decision Processes via the Lambda Discrepancy☆23Oct 28, 2024Updated last year
- an enumerative reactive synthesis tool for the GR(1) fragment of LTL☆13Jan 5, 2026Updated 5 months ago
- Synthesis Format Conversion Tool☆28Nov 18, 2025Updated 6 months ago
- Python package for Dec-POMDP files in the .dpomdp format☆11Oct 28, 2022Updated 3 years ago
- Count based exploration with the successor representation for Unity ML's Pyramid☆12Jun 19, 2019Updated 6 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Official implementation of "Latent Action Learning Requires Supervision in the Presence of Distractors", ICML 2025☆36Jul 8, 2025Updated 11 months ago
- stringsvc demo of Go kit reimplemented using the API gateway pattern.☆10Jan 31, 2020Updated 6 years ago
- NS3 module for simulating DOCSIS 3.1 links☆16Apr 5, 2024Updated 2 years ago
- Code to reproduce the experiments in The Mirage of Action-Dependent Baselines in Reinforcement Learning.☆17Aug 2, 2018Updated 7 years ago
- Efficient seed-parallel implementation of "Breaking the Replay Ratio Barrier"☆28May 22, 2023Updated 3 years ago
- ☆18May 24, 2021Updated 5 years ago
- This is an example of the design-by-contract method☆14Dec 27, 2022Updated 3 years ago