ciamic / MCTSLinks
Implementation of Monte Carlo Tree Search
☆15Updated 3 years ago
Alternatives and similar repositories for MCTS
Users that are interested in MCTS are comparing it to the libraries listed below
Sorting:
- [ICLR 2025] Official implementation of DICL (Disentangled In-Context Learning), featured in the paper "Zero-shot Model-based Reinforcemen…☆26Updated 11 months ago
- How to create rational LLM-based agents? Using game-theoretic workflows!☆92Updated 7 months ago
- my attempts at implementing various bits of Sepp Hochreiter's new xLSTM architecture☆134Updated last year
- An implementation of PPO in Pytorch☆106Updated last month
- Training small GPT-2 style models using Kolmogorov-Arnold networks.☆121Updated last year
- Explainable Reinforcement Learning (XRL) Resources☆47Updated last year
- ☆238Updated 2 months ago
- Kolmogorov-Arnold Network for Reinforcement Leaning, initial experiments☆296Updated 9 months ago
- Efficiently discovering algorithms via LLMs with evolutionary search and reinforcement learning.☆129Updated 2 months ago
- Gradient Boosting Reinforcement Learning (GBRL)☆136Updated this week
- Pytorch implementation of the xLSTM model by Beck et al. (2024)☆181Updated last year
- Experiments in Joint Embedding Predictive Architectures (JEPAs).☆46Updated 2 years ago
- Evaluating the Mamba architecture on the Othello game☆49Updated last year
- fast + parallel AlphaZero in JAX☆109Updated last year
- General multi-task deep RL Agent☆185Updated last year
- PyTorch implementation of Structured State Space for Sequence Modeling (S4), based on Annotated S4.☆89Updated last year
- Kolmogorov-Arnold Networks (KAN) using Chebyshev polynomials instead of B-splines.☆402Updated last year
- Implementation of xLSTM in Pytorch from the paper: "xLSTM: Extended Long Short-Term Memory"☆118Updated 3 weeks ago
- Implementation of Soft Actor Critic and some of its improvements in Pytorch☆64Updated last month
- Official JAX implementation of xLSTM including fast and efficient training and inference code. 7B model available at https://huggingface.…☆105Updated last year
- Repository for the PGA-MAP-Elites algorithm. PGA-MAP-Elites was developed to efficiently scale MAP-Elites to large genotypes and noisy d…☆58Updated 4 years ago
- A high throughput, end-to-end RL library for infinite-horizon tasks.☆22Updated 3 months ago
- ☆220Updated 2 years ago
- Reading list for adversarial perspective and robustness in deep reinforcement learning.☆129Updated 6 months ago
- Kolmogorov–Arnold Networks with modified activation (using MLP to represent the activation)☆108Updated 4 months ago
- This is the official repository for the paper "Flora: Low-Rank Adapters Are Secretly Gradient Compressors" in ICML 2024.☆106Updated last year
- A single repo with all scripts and utils to train / fine-tune the Mamba model with or without FIM☆61Updated last year
- Contrastive Reinforcement Learning☆59Updated 2 weeks ago
- OMNI: Open-endedness via Models of human Notions of Interestingness☆57Updated last year
- ☆128Updated 2 years ago