titaneric / AutoDiff-from-scratch
Auto Differentiate from scratch based on Autograd
☆9Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for AutoDiff-from-scratch
- a simple implementation of autograd engine☆24Updated 6 years ago
- A PyTorch implementation of DeepMind's MuZero agent☆27Updated 11 months ago
- Learning some numerical linear algebra.☆71Updated 3 years ago
- Paper: Challenges in High-dimensional Reinforcement Learning with Evolution Strategies☆26Updated 2 years ago
- Decentralized Reinforcment Learning: Global Decision-Making via Local Economic Transactions (ICML 2020)☆44Updated last year
- A Python 3 Bandit Visualization Package☆10Updated 7 years ago
- A C++ pytorch implementation of MuZero☆32Updated 6 months ago
- RL agent to play μRTS with Stable-Baselines3 and PyTorch☆25Updated 2 years ago
- ☆48Updated last year
- Rudimentary automatic differentiation framework☆73Updated 5 years ago
- Reinforcement learning library in JAX.☆103Updated last year
- Tutorial on Multi-Agent Reinforcement for Train Scheduling☆11Updated 4 years ago
- ☆32Updated 4 years ago
- Fully differentiable RL environments, written in Ivy.☆62Updated last year
- Library for learning and inference with Sum-product Networks utilizing TensorFlow 2.x and Keras☆47Updated 3 years ago
- Mixture Density Networks (Bishop, 1994) tutorial in JAX☆58Updated 4 years ago
- Some small scale experiments for my blog posts 📝☆78Updated 2 years ago
- A number of agents (PPO, MuZero) with a Perceiver-based NN architecture that can be trained to achieve goals in nethack/minihack environm…☆39Updated 2 years ago
- This is a self-contained repository to explain two basic Reinforcement (RL) algorithms.☆75Updated last month
- fork of rl-baseline-zoo☆21Updated 4 years ago
- ☆67Updated last year
- Map-Elites based on Evolution Strategies☆31Updated 2 years ago
- Building your own autograd mechanism based on PyTorch tensor only (not Variable, can be seen as numpy array)☆18Updated 10 months ago
- Deep reinforcement learning implementation that trains AIs for the CodeCraft real-time strategy game.☆19Updated last year
- Distributed Bayesian Optimization☆23Updated 4 years ago
- This project was moved to: https://github.com/coax-dev/coax☆160Updated last year
- Explore the optimization landscape for direct policy learning reinforcement learning.☆50Updated 5 years ago
- Code repository for the research project "You Play Ball, I Play Ball: Bayesian Multi-Agent Reinforcement Learning for Slime Volleyball", …☆16Updated 3 years ago
- ☆14Updated 8 years ago
- Reverse-mode automatic differentiation in Rust (experiment)☆61Updated 3 years ago