mishgon / alphastrassenLinks

Reproduction of AlphaTensor paper for 2x2 matrices

☆17

Alternatives and similar repositories for alphastrassen

Users that are interested in alphastrassen are comparing it to the libraries listed below

Sorting:

bigrl-team / gear
A distributed GPU-centric experience replay system for large AI models.
☆18Updated last year
harvard-edge / QuaRL
QuaRL is an open-source framework for systematically studying the effect of applying quantization to reinforcement learning algorithms.
☆68Updated 2 years ago
liuanji / WU-UCT
A novel parallel UCT algorithm with linear speedup and negligible performance loss.
☆119Updated 4 years ago
aoiang / LaMOO
☆29Updated 2 years ago
openpsi-project / srl
A Really Scalable RL Framework to 10k+ CPUs
☆33Updated last year
jys5609 / MC-LAVE-RL
ICLR 2021: "Monte-Carlo Planning and Learning with Language Action Value Estimates"
☆33Updated last year
avillaflor / SPLT-transformer
☆18Updated 2 years ago
Bick95 / PPO
Comprehensive Implementation of Proximal Policy Optimization
☆10Updated 3 years ago
CyCTW / Parallel-MCTS
Parallel Monte Carlo Tree Search, see README.md for more detailed usage and information.
☆46Updated 4 years ago
yycdavid / program-synthesis-guided-RL
☆24Updated 2 years ago
CatherineMeng / FGYM-user-demo
Demonstrating the usage of FGYM: A Toolkit for benchmarking FPGA-accelerated Reinforcement Learning
☆13Updated 3 years ago
dhesin / minerl
MineRL DDPG Agent to Obtain Diamond in Minecraft
☆14Updated 5 years ago
rraileanu / policy-dynamics-value-functions
☆32Updated 9 months ago
tianjunz / NovelD
☆41Updated 3 years ago
jqueeney / geppo
Generalized Proximal Policy Optimization with Sample Reuse (GePPO)
☆24Updated last year
instadeepai / outer-value-function-meta-rl
Code of the paper: Debiasing Meta-Gradient Reinforcement Learning by Learning the Outer Value Function
☆13Updated 2 years ago
guptav96 / BDQN-PyTorch
Efficient Exploration through Bayesian Deep-Q Networks.
☆18Updated 3 years ago
eric-mitchell / macaw
Code for "Offline Meta-Reinforcement Learning with Advantage Weighting" [ICML 2021]
☆47Updated 2 years ago
lamda-bbo / madac
Official implementation of NeurIPS22 paper “Multi-agent Dynamic Algorithm Configuration”
☆25Updated 2 years ago
alantess / gtrxl-torch
Gated Transformer Model for Computer Vision
☆23Updated 3 years ago
sysu-eda / DeepRL-Scheduling
A Deep-Reinforcement-Learning-Based Scheduler for FPGA HLS
☆14Updated 4 years ago
MIRALab-USTC / RLPapers
Must-read papers on Reinforcement Learning (RL)
☆50Updated 4 years ago
tyq1024 / RLx2
☆30Updated 2 years ago
micahcarroll / uniMASK
Codebase for "Uni[MASK]: Unified Inference in Sequential Decision Problems"
☆55Updated 11 months ago
hr0nix / omega
A number of agents (PPO, MuZero) with a Perceiver-based NN architecture that can be trained to achieve goals in nethack/minihack environm…
☆41Updated 2 years ago
DT6A / ReBRAC
Author's implementation of ReBRAC, a minimalist improvement upon TD3+BC
☆14Updated last year
daniellawson9999 / online-decision-transformer
An unofficial implementation for online decision transformer
☆40Updated 2 years ago
Hwhitetooth / jax_muzero
An implementation of MuZero in JAX.
☆56Updated 2 years ago
staghuntrpg / RPG
This is the source code of RPG (Reward-Randomized Policy Gradient)
☆42Updated 2 years ago
koulanurag / minimal-marl
Minimal implementation of multi-agent reinforcement learning algorithms
☆55Updated 3 years ago