mishgon / alphastrassen
Reproduction of AlphaTensor paper for 2x2 matrices
☆17Updated last year
Alternatives and similar repositories for alphastrassen
Users that are interested in alphastrassen are comparing it to the libraries listed below
Sorting:
- QuaRL is an open-source framework for systematically studying the effect of applying quantization to reinforcement learning algorithms.☆69Updated 2 years ago
- Parallel Monte Carlo Tree Search, see README.md for more detailed usage and information.☆46Updated 4 years ago
- ☆15Updated 5 months ago
- A novel parallel UCT algorithm with linear speedup and negligible performance loss.☆118Updated 4 years ago
- A distributed GPU-centric experience replay system for large AI models.☆18Updated last year
- ☆18Updated 2 years ago
- [NeurIPS 2022] "NSNet: A General Neural Probabilistic Framework for Satisfiability Problems"☆18Updated 2 years ago
- Comprehensive Implementation of Proximal Policy Optimization☆10Updated 3 years ago
- Code and data for the paper "Bridging RL Theory and Practice with the Effective Horizon"☆48Updated 10 months ago
- Efficient Exploration through Bayesian Deep-Q Networks.☆17Updated 3 years ago
- ☆24Updated 2 years ago
- ☆30Updated 2 years ago
- Must-read papers on Reinforcement Learning (RL)☆48Updated 4 years ago
- MuZero for Combinatorial Action Spaces: open-source codebase for MA-Gumbel-AlphaZero, MA-Sampled-AlphaZero, MA-Gumbel-MuZero and MA-Sampl…☆18Updated last year
- Official implementation of NeurIPS'23 paper "Macro Placement by Wire-Mask-Guided Black-Box Optimization"☆22Updated last month
- Official implementation of NeurIPS22 paper “Multi-agent Dynamic Algorithm Configuration”☆24Updated 2 years ago
- An RL-Friendly Vision-Language Model for Minecraft☆31Updated 7 months ago
- Demonstrating the usage of FGYM: A Toolkit for benchmarking FPGA-accelerated Reinforcement Learning☆13Updated 3 years ago
- Multi-Agent Determinantal Q-Learning☆42Updated 2 years ago
- Gated Transformer Model for Computer Vision☆23Updated 3 years ago
- PyTorch implementation for the Deep Symbolic Simplification Without Human Knowledge☆14Updated 4 years ago
- A Deep-Reinforcement-Learning-Based Scheduler for FPGA HLS☆14Updated 4 years ago
- Extreme Q-Learning: Max Entropy RL without Entropy☆87Updated 2 years ago
- MARS is shortened for Multi-Agent Research Studio, a library for mulit-agent reinforcement learning research.☆48Updated last year
- Official implementation of "Learning from Visual Observation via Offline Pretrained State-to-Go Transformer"☆19Updated last year
- This code implements Prioritized Level Replay, a method for sampling training levels for reinforcement learning agents that exploits the …☆86Updated 3 years ago
- Implementation of MuZero with PyTorch, based on the pseudocode from DeepMind (https://arxiv.org/src/1911.08265v2/anc/pseudocode.py).☆32Updated 2 years ago
- Launch programs on multiple hosts. (多机启动程序)☆14Updated last year
- PyTorch implementation for all models and environments in the paper "Learning to Ground Multi-Agent Communication with Autoencoders"☆46Updated 3 years ago
- [NeurIPS 2022 Oral] The official implementation of POR in "A Policy-Guided Imitation Approach for Offline Reinforcement Learning"☆58Updated 2 years ago