VowpalWabbit / reinforcement_learning
Interaction-side integration library for Reinforcement Learning loops: Predict, Log, [Learn,] Update
☆75Updated 6 months ago
Alternatives and similar repositories for reinforcement_learning:
Users that are interested in reinforcement_learning are comparing it to the libraries listed below
- This project was moved to: https://github.com/coax-dev/coax☆160Updated 2 years ago
- Python implementation of GLN in different frameworks☆98Updated 4 years ago
- Mixture Density Networks (Bishop, 1994) tutorial in JAX☆59Updated 5 years ago
- Statistical adaptive stochastic optimization methods☆32Updated 5 years ago
- Functional machine learning for fun☆85Updated 4 years ago
- Progress, Notes, Summaries and a lot of Questions on Machine Learning☆55Updated 5 years ago
- Explore the optimization landscape for direct policy learning reinforcement learning.☆50Updated 6 years ago
- Generic reinforcement learning codebase in TensorFlow☆95Updated 3 years ago
- Open-source library for a reinforcement learning research.☆54Updated 2 years ago
- Evaluation Framework for Probabilistic Programming Languages☆100Updated last year
- Repository of models in Pyro☆29Updated 9 months ago
- Deep RL Bootcamp solutions☆35Updated 7 years ago
- Contextual bandit benchmarking☆49Updated 8 months ago
- Using / reproducing DAC from the paper "Disentangled Attribution Curves for Interpreting Random Forests and Boosted Trees"☆27Updated 4 years ago
- Implementation of Model-Agnostic Meta-Learning (MAML) in Jax☆189Updated 2 years ago
- BOAH: Bayesian Optimization & Analysis of Hyperparameters☆67Updated 4 years ago
- Public repository for the work on bandit problems☆23Updated last year
- Reinforcement Learning Assembly☆92Updated 3 years ago
- 2019 talk at GECCO☆68Updated 5 years ago
- Source code for ICLR 2020 paper: "Learning to Guide Random Search"☆39Updated 8 months ago
- PyTorch port and extension of the Deep Bayesian Bandits Library☆42Updated 5 years ago
- ☆48Updated 5 years ago
- SafeLife: safety benchmarks for reinforcement learning agents☆60Updated 3 years ago
- Moore Machine Networks (MMN): Learning Finite-State Representations of Recurrent Policy Networks☆50Updated 2 years ago
- Library for Multi-Armed Bandit Algorithms☆57Updated 8 years ago
- Experiment orchestration☆103Updated 4 years ago
- Code for NeurIPS 2019 paper: "Symmetry-Based Disentangled Representation Learning requires Interaction with Environments" by H. Caselles-…☆35Updated 5 years ago
- Decentralized Reinforcment Learning: Global Decision-Making via Local Economic Transactions (ICML 2020)☆43Updated 2 years ago
- The Differentiable Cross-Entropy Method☆126Updated 4 years ago
- Modular Probabilistic Programming on MXNet☆104Updated 2 years ago