YiwenAI / OpenTensorLinks

☆14

Alternatives and similar repositories for OpenTensor

Users that are interested in OpenTensor are comparing it to the libraries listed below

Sorting:

lowrollr / turbozero
fast + parallel AlphaZero in JAX
☆97Updated 6 months ago
rlglab / minizero
MiniZero: An AlphaZero and MuZero Training Framework
☆94Updated 4 months ago
kenjyoung / mctx_learning_demo
☆52Updated 2 years ago
NTT123 / a0-jax
AlphaZero in JAX
☆78Updated last year
bwfbowen / muax
A project that provides help for using DeepMind's mctx on gym-style environments.
☆60Updated 8 months ago
andyljones / boardlaw
Scaling scaling laws with board games.
☆49Updated 2 years ago
michaelnny / muzero
A PyTorch implementation of DeepMind's MuZero agent
☆35Updated last year
epignatelli / navix
Accelerated minigrid environments with JAX
☆141Updated last month
Hwhitetooth / jax_muzero
An implementation of MuZero in JAX.
☆56Updated 2 years ago
vwxyzjn / cleanba
CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL
☆114Updated 10 months ago
instadeepai / outer-value-function-meta-rl
Code of the paper: Debiasing Meta-Gradient Reinforcement Learning by Learning the Outer Value Function
☆13Updated 2 years ago
DHDev0 / Stochastic-muzero
Pytorch Implementation of Stochastic MuZero for gym environment. This algorithm is capable of supporting a wide range of action and obser…
☆67Updated last year
observer4599 / explainable-reinforcement-learning
Explainable Reinforcement Learning (XRL) Resources
☆41Updated 9 months ago
google-deepmind / diplomacy
☆56Updated last year
Reytuag / transformerXL_PPO_JAX
☆81Updated 8 months ago
FLAIROx / jaxirl
Contains JAX implementation of algorithms for inverse reinforcement learning
☆73Updated 10 months ago
keraJLi / synthetic-gymnax
Drop-in environment replacements that make your RL algorithm train faster.
☆21Updated last year
hr0nix / omega
A number of agents (PPO, MuZero) with a Perceiver-based NN architecture that can be trained to achieve goals in nethack/minihack environm…
☆41Updated 2 years ago
yycdavid / program-synthesis-guided-RL
☆24Updated 2 years ago
luchris429 / popjaxrl
Benchmarking RL for POMDPs in Pure JAX [Code for "Structured State Space Models for In-Context Reinforcement Learning" (NeurIPS 2023)]
☆107Updated last year
gpoesia / minimo
Learning Formal Mathematics from Intrinsic Motivation
☆35Updated last week
MatX-inc / seqax
seqax = sequence modeling + JAX
☆165Updated last month
Sea-Snell / JAXSeq
Train very large language models in Jax.
☆204Updated last year
danijar / ninjax
General Modules for JAX
☆66Updated 3 months ago
liuanji / WU-UCT
A novel parallel UCT algorithm with linear speedup and negligible performance loss.
☆120Updated 4 years ago
lowrollr / mctx-az
Monte Carlo tree search in JAX, with functionality to continue search from a previous subtree
☆20Updated 2 months ago
AlexGoldie / rl-learned-optimization
Official Implementation of "Can Learned Optimization Make Reinforcement Learning Less Difficult"
☆25Updated 2 months ago
google-deepmind / tell_me_why_explanations_rl
☆36Updated 2 years ago
DramaCow / jaxued
☆82Updated 3 months ago
Zeta36 / muzero
A simple implementation of MuZero algorithm for connect4 game
☆96Updated 4 years ago