schinger/AlphaZero

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/schinger/AlphaZero)

schinger / AlphaZero

Simplest AlphaZero Implementation

☆26

Alternatives and similar repositories for AlphaZero

Users that are interested in AlphaZero are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

schinger / FullLLM
View on GitHub
Full stack LLM (Pre-training/finetuning, PPO(RLHF), Inference, Quant, etc.)
☆31Feb 21, 2025Updated last year
schinger / DiffusionModel
View on GitHub
Implement Diffusion Model only by Pytorch and MLP
☆30Aug 18, 2024Updated last year
HauffQian / DGAP
View on GitHub
☆14May 13, 2025Updated last year
liuxhym / EDIS
View on GitHub
EDIS: Energy-guided DIffusion Sampling
☆19Aug 10, 2024Updated last year
LxzGordon / PECAN
View on GitHub
☆12Jan 4, 2024Updated 2 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
aaronserianni / attention-iou
View on GitHub
[CVPR'25] Attention IoU: Examining Biases in CelebA using Attention Maps
☆13Mar 26, 2025Updated last year
menggehe / DRAW
View on GitHub
This repo provides the implemetation of the paper How to train your agent to read and write?
☆10Dec 29, 2020Updated 5 years ago
Zzl35 / flow-to-better
View on GitHub
☆27Apr 22, 2024Updated 2 years ago
MJ-Jang / BECEL
View on GitHub
☆10Jan 28, 2024Updated 2 years ago
YichenZW / awesome-llm-diversity
View on GitHub
A curated collection of research papers exploring diversity in Large Language Model text generation. This repository tracks cutting-edge …
☆15Jun 19, 2026Updated last month
shengyuzhang / Poet
View on GitHub
Poet: Product-oriented Video Captioner for E-commerce
☆12Sep 21, 2020Updated 5 years ago
tesslerc / TD3-JAX
View on GitHub
A JAX Implementation of the Twin Delayed DDPG Algorithm
☆35Mar 12, 2020Updated 6 years ago
moen-hyb / ATMOL
View on GitHub
A model named ATMOL for predicting molecular property
☆10May 2, 2022Updated 4 years ago
Michael-Beukman / DecisionAdapter
View on GitHub
Code for the paper Dynamics Generalisation in Reinforcement Learning via Adaptive Context-Aware Policies (NeurIPS 2023). https://arxiv.or…
☆15Nov 21, 2023Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
exalearn / molecular-graph-descriptors
View on GitHub
Codebase to accompany the paper A Look Inside the Black Box: Using Graph-Theoretical Descriptors to Interpret a Continuous-Filter Convolu…
☆12May 26, 2021Updated 5 years ago
ernie-research / CD-RLHF
View on GitHub
[ACL'25] Official code of curiosity-driven RLHF
☆16Jun 22, 2025Updated last year
Wangmerlyn / MCTS-GSM8k-Demo
View on GitHub
This is a repo for showcasing using MCTS with LLMs to solve gsm8k problems
☆95Nov 13, 2025Updated 8 months ago
GeorgeVern / lmcor
View on GitHub
Code for the EACL 2024 paper: "Small Language Models Improve Giants by Rewriting Their Outputs"
☆12Apr 20, 2024Updated 2 years ago
RoyalSkye / ATCL
View on GitHub
[NeurIPS 2022] "Adversarial Training with Complementary Labels: On the Benefit of Gradually Informative Attacks"
☆13Nov 11, 2022Updated 3 years ago
xiaoboxia / CoDis
View on GitHub
ICCV'2023: Combating Noisy Labels with Sample Selection by Mining High-Discrepancy Examples
☆12Oct 16, 2023Updated 2 years ago
shaoyijia / CMG
View on GitHub
Code for ECML-PKDD 2022 Paper --- CMG: A Class-Mixed Generation Approach to Out-of-Distribution Detection
☆13Oct 12, 2022Updated 3 years ago
microsoft / SuperRL
View on GitHub
☆15Sep 8, 2025Updated 10 months ago
stathius / sd-vae
View on GitHub
Code for the paper "Disentangled Generative Models for Robust Prediction of System Dynamics"
☆15May 2, 2023Updated 3 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
zhengwang100 / dgpn
View on GitHub
Source code and dataset for KDD 2021 paper: Zero-shot Node Classification with Decomposed Graph Prototype Network.
☆12Jun 18, 2021Updated 5 years ago
AiDuanshiying / RoboPARA
View on GitHub
☆24Jan 29, 2026Updated 5 months ago
hklhai / neo4j-server
View on GitHub
智能写作系统服务端
☆18Jun 18, 2021Updated 5 years ago
xiaoboxia / RTM_LNL
View on GitHub
Regularly Truncated M-estimators for Learning with Noisy Labels
☆11Apr 24, 2024Updated 2 years ago
verlab / Structural_Reasoning_SRR
View on GitHub
End-to-end implementation of the Social Graph Network (SGN), described in the Structural Reasoning for Image-based Social Relation Recogn…
☆13Apr 3, 2024Updated 2 years ago
MathGenie / MathGenie
View on GitHub
☆14Mar 11, 2024Updated 2 years ago
romanlee6 / multi_LLM_comm
View on GitHub
This repository contains reference implementation for multi-LLM ToM paper (accepted to EMNLP 2023), Theory of Mind for Multi-Agent Collab…
☆20Jun 11, 2024Updated 2 years ago
foryoung365 / fluent-reader2
View on GitHub
Modern desktop RSS reader built with Electron, React, and Fluent UI
☆15Jan 29, 2026Updated 5 months ago
kkjzio / pptist-aibackend
View on GitHub
自作pptist的ai后端
☆20Sep 16, 2025Updated 10 months ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
AlexeySorokin / EditScorer
View on GitHub
The code for EMNLP2022 paper "Improved grammatical error correction by ranking elementary edits"
☆21Dec 14, 2022Updated 3 years ago
machinelearning4health / MedPath
View on GitHub
☆15Dec 18, 2021Updated 4 years ago
yinyueqin / relative-preference-optimization
View on GitHub
Relative Preference Optimization: Enhancing LLM Alignment through Contrasting Responses across Identical and Diverse Prompts
☆26Feb 23, 2024Updated 2 years ago
thu-coai / NAST
View on GitHub
Codes for "NAST: A Non-Autoregressive Generator with Word Alignment for Unsupervised Text Style Transfer" (ACL 2021 findings)
☆15Nov 3, 2021Updated 4 years ago
WangXFng / TrieLLM
View on GitHub
A VERY SIMPLE example to control LLMs for text generations via a Custom Trie (prefix tree).
☆14Oct 21, 2024Updated last year
TCXM / DEXTER-LLM
View on GitHub
DEXTER-LLM: Dynamic and Explainable Coordination of Multi-Robot Systems in Unknown Environments via Large Language Models
☆16Jun 18, 2026Updated last month
SII-MARFT / MARFT
View on GitHub
☆20May 14, 2026Updated 2 months ago