Code implementation of: "Graying the black box: Understanding DQNs"
☆20Feb 23, 2017Updated 9 years ago
Alternatives and similar repositories for GrayingTheBox
Users that are interested in GrayingTheBox are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Option Critic with subgoal discovery by spectral decomposition of the Successor Features Matrix or clustering in Successor features space…☆24Nov 29, 2018Updated 7 years ago
- Gym wrapper for pysc2☆10Sep 16, 2022Updated 3 years ago
- Counterfactual explanations for Reinforcement Learning agents on Atari☆12Apr 3, 2023Updated 3 years ago
- yet another reinforcement learning package☆12May 24, 2022Updated 4 years ago
- Meta Reinforcement Learning Experiments☆35Aug 22, 2017Updated 8 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Code for our paper "Visualizing and Understanding Atari Agents" (https://goo.gl/AMAoSc)☆125Oct 21, 2021Updated 4 years ago
- Official implementation of "Know Your Action Set: Learning Action Relations for Reinforcement Learning", Jain et al., ICLR 2022.☆18Mar 16, 2022Updated 4 years ago
- Reinforcement Learning☆12Jun 22, 2017Updated 9 years ago
- The Variational Homoencoder: Learning to learn high capacity generative models from few examples☆34Jul 13, 2023Updated 2 years ago
- Causal Deconvolution of Networks by Algorithmic Generative Models☆30May 9, 2019Updated 7 years ago
- A PyTorch implementation of SSINet.☆16Nov 10, 2020Updated 5 years ago
- Hierarchical Deep RL Network☆31Feb 20, 2017Updated 9 years ago
- Deep Reinforcement Active learning - Master Thesis☆19Dec 7, 2022Updated 3 years ago
- ☆14Dec 4, 2018Updated 7 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Code for experimenting with state and action abstractions in reinforcement learning.☆29Dec 11, 2020Updated 5 years ago
- Fully Cooperative Multi-Agent Deep Reinforcement Learning☆27Nov 20, 2019Updated 6 years ago
- The Easiest Pytorch Implementation of Branching-DQN☆12Feb 10, 2021Updated 5 years ago
- Random memory adaptation model inspired by the paper: "Memory-based parameter adaptation (MbPA)"☆24Mar 13, 2018Updated 8 years ago
- Ground control station and optimization code from Tango on Quadrotors project - NTR 50759☆12Aug 26, 2018Updated 7 years ago
- 2D multiplayer dogfighting game, written in Rust☆10Apr 5, 2024Updated 2 years ago
- paper <<Hierarchical Deep Reinforcement Learning: Integrating Temporal Abstraction and Intrinsic Motivation>> python implementation☆10Mar 27, 2018Updated 8 years ago
- Brax + Pufferlib + CARBS for gpu-accelerated robotics RL☆12Jun 12, 2025Updated last year
- Procgen2: A community maintained fork of procgen☆12Aug 25, 2022Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- This reposotory is for a project about Distributed TDMA for Mobile UWB Network Localization☆15Jun 1, 2021Updated 5 years ago
- A simple systemd service to better control Framework Laptop's fan *Ryzen 7040*☆15Dec 15, 2023Updated 2 years ago
- Simple GStreamer test programs for learning puporses.☆13Jul 27, 2013Updated 12 years ago
- Code for "Dream and Search to Control: Latent Space Planning for Continuous Control"☆12Jul 12, 2021Updated 4 years ago
- ☆10Feb 22, 2018Updated 8 years ago
- Trust Region Policy Optimization with Generalized Advantage Estimator☆16Nov 15, 2018Updated 7 years ago
- Gstreamer, Qt, RTSP server☆15Sep 7, 2018Updated 7 years ago
- Deep reinforcement learning package for torch7☆16Sep 17, 2016Updated 9 years ago
- This is a sample implementation of "TIMERS: Error-Bounded SVD Restart on Dynamic Networks"(AAAI 2018).☆12Jul 4, 2018Updated 7 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Attentional Mechanism incorporated in Asynchronous Advantage Actor Critic a3c/a2c deep mind☆10Jan 9, 2018Updated 8 years ago
- ☆12Dec 15, 2024Updated last year
- Companion code for Closed-Loop Koopman Operator Approximation☆17Mar 24, 2024Updated 2 years ago
- reimplementation of the ddpg algorithm using tensorflow☆37Oct 17, 2016Updated 9 years ago
- The code for experiments conducted to verify the correctness of mirror learning.☆11Jun 3, 2022Updated 4 years ago
- ☆10Feb 20, 2024Updated 2 years ago
- ☆13Jul 13, 2022Updated 3 years ago