VowpalWabbit / reinforcement_learning
Interaction-side integration library for Reinforcement Learning loops: Predict, Log, [Learn,] Update
☆75Updated 2 weeks ago
Related projects ⓘ
Alternatives and complementary repositories for reinforcement_learning
- Open-source library for a reinforcement learning research.☆54Updated last year
- Mixture Density Networks (Bishop, 1994) tutorial in JAX☆58Updated 4 years ago
- Progress, Notes, Summaries and a lot of Questions on Machine Learning☆55Updated 4 years ago
- This project was moved to: https://github.com/coax-dev/coax☆160Updated last year
- Functional machine learning for fun☆84Updated 3 years ago
- Explore the optimization landscape for direct policy learning reinforcement learning.☆50Updated 5 years ago
- Generic reinforcement learning codebase in TensorFlow☆95Updated 3 years ago
- Adapting the AlphaZero algorithm to remove the need of execution traces to train NPI.☆78Updated last year
- Contextual bandit benchmarking☆49Updated 2 months ago
- Reinforcement Learning Assembly☆92Updated 3 years ago
- Web version of “Neuroevolution of Self-Interpretable Agents” (https://arxiv.org/abs/2003.08165)☆21Updated 2 years ago
- Starter kit for the black box optimization challenge at Neurips 2020☆113Updated 4 years ago
- Performant, differentiable reinforcement learning☆121Updated 5 months ago
- Python implementation of GLN in different frameworks☆95Updated 4 years ago
- Reinforcement learning algorithms in RLlib☆56Updated 6 months ago
- ☆32Updated 4 years ago
- A collection of notebooks aiding the understanding of machine-learning papers.☆10Updated 3 years ago
- Source code for ICLR 2020 paper: "Learning to Guide Random Search"☆39Updated 2 months ago
- Augmented environments with RL☆102Updated 5 years ago
- Some hard problems for reinforcement learning.☆32Updated 6 years ago
- SafeLife: safety benchmarks for reinforcement learning agents☆59Updated 3 years ago
- Neuronal Circuit Policies☆40Updated 2 years ago
- Reinforcement learning library in JAX.☆102Updated last year
- PyTorch port and extension of the Deep Bayesian Bandits Library☆42Updated 5 years ago
- Statistical adaptive stochastic optimization methods☆32Updated 4 years ago
- Public repository for the work on bandit problems☆23Updated 7 months ago
- ☆182Updated 4 months ago
- Cellular automaton-based calculus for the masses☆24Updated 6 years ago
- BOAH: Bayesian Optimization & Analysis of Hyperparameters☆67Updated 4 years ago
- Implementation of Model-Agnostic Meta-Learning (MAML) in Jax☆188Updated 2 years ago