☆20May 31, 2019Updated 6 years ago
Alternatives and similar repositories for b-pro
Users that are interested in b-pro are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official implementation for the paper: "Shallow Updates for Deep Reinforcement Learning"☆18Nov 2, 2017Updated 8 years ago
- Code for 'The Grand Atari Challenge dataset' paper☆55Oct 25, 2017Updated 8 years ago
- ☆31Jul 1, 2019Updated 6 years ago
- M.Sc. thesis: Cellular Automata + NeuroEvolution of Augmenting Topologies☆15Jan 12, 2018Updated 8 years ago
- Model-Free-Episodic-Control implementation.☆17Jun 3, 2019Updated 6 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Skip Context Tree Switching - Reference Implementation☆51Sep 13, 2017Updated 8 years ago
- Updated framework from the Ms Pac-Man vs Ghosts competition: https://www.facebook.com/pacman.vs.ghosts☆19Oct 16, 2017Updated 8 years ago
- A tutorial on doing RL research in Julia using both Jupyter notebooks and normal project structures.☆10Jun 23, 2021Updated 4 years ago
- Real-time visualisation☆29May 14, 2026Updated last week
- ☆11Feb 23, 2017Updated 9 years ago
- Reference implementation of algorithms for reinforcement learning and Markov decision processes.☆12Jan 28, 2021Updated 5 years ago
- Low-rank Highway Networks☆13Mar 11, 2016Updated 10 years ago
- A graduate-level introduction to reinforcement learning as a framework for modeling, optimization, and control, connecting dynamic models…☆19Dec 9, 2025Updated 5 months ago
- ☆99Aug 25, 2016Updated 9 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- Tensorflow DQN and DRQN agent playing doom☆35May 5, 2017Updated 9 years ago
- Implementation of condnets☆16Apr 21, 2016Updated 10 years ago
- Generic implementation of the dynamic programming algorithm for optimal system control☆11Mar 4, 2018Updated 8 years ago
- ☆47Sep 24, 2024Updated last year
- Optimized dqn for caffe☆11Dec 18, 2015Updated 10 years ago
- Pytorch implementation of paper "Distillation Techniques for Pseudo-rehearsal Based Incremental Learning"☆14May 5, 2026Updated 2 weeks ago
- An attempt to formalize my thoughts. A pythonic approach to mental housekeeping☆15Apr 21, 2016Updated 10 years ago
- ☆43Feb 9, 2017Updated 9 years ago
- Variable-order CRFs with structure learning☆17Aug 1, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- probability of mendelian error in trios.☆11Jan 27, 2016Updated 10 years ago
- Pytorch Implementation of AAMAS 2021 paper <Energy-Based Imitation Learning>☆11Oct 8, 2021Updated 4 years ago
- Tensor Switching Networks☆12Nov 2, 2017Updated 8 years ago
- Accompanying code for the paper "Learning Causal Models Online"☆23Jul 14, 2020Updated 5 years ago
- Torch implementation of "Deep Exploration via Bootstrapped DQN"☆42Apr 10, 2016Updated 10 years ago
- Python implementation of tabular asynchronous actor critic☆11May 3, 2016Updated 10 years ago
- My PhD thesis, titled "Reasonably Programmable Syntax"☆15Aug 28, 2018Updated 7 years ago
- Dynamic models for building energy management☆28Mar 10, 2024Updated 2 years ago
- A minimal implementation of Go-Explore without domain knowledge☆15Apr 26, 2021Updated 5 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Reinforcement learning environment for controlling greenhouse crop production systems. The greenhouse dynamics are based on GreenLight..☆26Mar 12, 2026Updated 2 months ago
- hierarchical deep reinforcement learning algorithms☆43Dec 12, 2017Updated 8 years ago
- Implementation and evaluation of Almanac (Automaton/Logic Multi-Agent Natural Actor-Critic), an algorithm for multi-agent reinforcement l…☆10May 5, 2022Updated 4 years ago
- ☆11Sep 22, 2019Updated 6 years ago
- MLE-Guided Parameter Search (AAAI 2021)☆12Sep 16, 2021Updated 4 years ago
- Collection of reinforcement learners implemented in python. Mainly including DQN and its variants☆54Apr 23, 2017Updated 9 years ago
- Implementation/experiments for L4DC 2020 submission "Optimal Cost Design for Model Predictive Control"☆12Apr 23, 2021Updated 5 years ago