mishgon / alphastrassen
Reproduction of AlphaTensor paper for 2x2 matrices
☆18Updated last year
Alternatives and similar repositories for alphastrassen:
Users that are interested in alphastrassen are comparing it to the libraries listed below
- A novel parallel UCT algorithm with linear speedup and negligible performance loss.☆116Updated 3 years ago
- Deep Reinforcement Learning by using Proximal Policy Optimization and Random Network Distillation in Tensorflow 2 and Pytorch with some e…☆51Updated 4 years ago
- QuaRL is an open-source framework for systematically studying the effect of applying quantization to reinforcement learning algorithms.☆68Updated 2 years ago
- ☆24Updated 2 years ago
- Extreme Q-Learning: Max Entropy RL without Entropy☆85Updated 2 years ago
- Official implementation of NeurIPS22 paper “Multi-agent Dynamic Algorithm Configuration”☆24Updated 2 years ago
- A fully modular framework for modeling and optimizing analog neural networks☆20Updated 5 months ago
- Multi-Agent Determinantal Q-Learning☆42Updated 2 years ago
- An unofficial implementation for online decision transformer☆40Updated 2 years ago
- The Easiest Pytorch Implementation of Branching-DQN☆9Updated 4 years ago
- This code accompanies the paper "Scalable Multi-Agent Model-Based Reinforcement Learning".☆53Updated last year
- Online Decision Transformer☆252Updated last year
- ☆32Updated 7 months ago
- Paper collection of reinforcement learning based combinatorial optimization☆50Updated 4 years ago
- Official repository of the paper "FightLadder: A Benchmark for Competitive Multi-Agent Reinforcement Learning"☆21Updated 8 months ago
- Tracking literature and additional online resources on transformers for sequential decision making including RL and beyond.☆45Updated 2 years ago
- The AI Arena: A framework for distributed multi-agent reinforcement learning☆14Updated 2 years ago
- Parallel Monte Carlo Tree Search, see README.md for more detailed usage and information.☆45Updated 4 years ago
- Efficient Exploration through Bayesian Deep-Q Networks.☆17Updated 3 years ago
- Official Code Release for Pipeline PSRO: A Scalable Approach for Finding Approximate Nash Equilibria in Large Games☆49Updated 7 months ago
- A list of papers regarding generalization in (deep) reinforcement learning☆151Updated last year
- Demonstrating the usage of FGYM: A Toolkit for benchmarking FPGA-accelerated Reinforcement Learning☆13Updated 3 years ago
- A Really Scalable RL Framework to 10k+ CPUs☆27Updated last year
- 1st Solution For NeurIPS 2021 Competition on ML4CO Dual Task☆28Updated last year
- A distributed GPU-centric experience replay system for large AI models.☆17Updated last year
- ☆127Updated 8 months ago
- ☆30Updated 2 years ago
- Random network distillation on Montezuma's Revenge and Super Mario Bros.☆48Updated 2 years ago
- Official implementation of NeurIPS'23 paper "Macro Placement by Wire-Mask-Guided Black-Box Optimization"☆20Updated last year
- ☆108Updated last year