s-casci / tinyzeroLinks

Easily train AlphaZero-like agents on any environment you want!

☆430

Alternatives and similar repositories for tinyzero

Users that are interested in tinyzero are comparing it to the libraries listed below

Sorting:

bsilverthorn / maccarone
AI-managed code blocks in Python ⏪⏩
☆469Updated last year
bananaml / fructose
☆745Updated last year
Tanuki / tanuki.py
Prompt engineering for developers
☆687Updated last year
alantech / marsha
Marsha is a functional, higher-level, English-based programming language that gets compiled into tested Python software by an LLM
☆470Updated last year
aymenfurter / microagents
Agents Capable of Self-Editing Their Prompts / Python Code
☆771Updated last year
labmlai / inspectus
LLM Analytics
☆675Updated 9 months ago
PaulPauls / llama3_interpretability_sae
A complete end-to-end pipeline for LLM interpretability with sparse autoencoders (SAEs) using Llama 3.2, written in pure PyTorch and full…
☆620Updated 4 months ago
rentruewang / bocoel
Bayesian Optimization as a Coverage Tool for Evaluating LLMs. Accurate evaluation (benchmarking) that's 10 times faster with just a few l…
☆286Updated last month
joennlae / tensorli
Absolute minimalistic implementation of a GPT-like transformer using only numpy (<650 lines).
☆253Updated last year
okuvshynov / slowllama
Finetune llama2-70b and codellama on MacBook Air without quantization
☆448Updated last year
modal-labs / devlooper
A program synthesis agent that autonomously fixes its output by running tests!
☆463Updated 10 months ago
jostmey / NakedAttention
Revealing example of self-attention, the building block of transformer AI models
☆131Updated 2 years ago
facebookresearch / searchformer
Official codebase for the paper "Beyond A* Better Planning with Transformers via Search Dynamics Bootstrapping".
☆372Updated last year
KhoomeiK / LlamaGym
Fine-tune LLM agents with online reinforcement learning
☆1,208Updated last year
google-deepmind / searchless_chess
Grandmaster-Level Chess Without Search
☆585Updated 6 months ago
amaiya / onprem
A toolkit for applying on-premises large language models to non-public data
☆729Updated this week
operand / agency
A fast and minimal framework for building agentic systems
☆435Updated last week
valine / NeuralFlow
Visualize the intermediate output of Mistral 7B
☆367Updated 6 months ago
lechmazur / elimination_game
A multi-player tournament benchmark that tests LLMs in social reasoning, strategy, and deception. Players engage in public and private co…
☆283Updated 2 weeks ago
arena-ai / structured-logprobs
OpenAI's Structured Outputs with Logprobs
☆181Updated 2 months ago
andyk / recursive_llm
Implement recursion using English as the programming language and an LLM as the runtime.
☆239Updated 2 years ago
simulatrex / simulatrex-engine
Enable decision-making based on simulations
☆227Updated last year
adamkarvonen / chess_llm_interpretability
Visualizing the internal board state of a GPT trained on chess PGN strings, and performing interventions on its internal board state and …
☆208Updated 8 months ago
Ads97 / WhatsApp-Llama
Finetune a LLM to speak like you based on your WhatsApp Conversations
☆367Updated last year
leapingio / leaping
☆263Updated last year
capjamesg / visionscript
A high-level programming language for using computer vision.
☆344Updated last year
aasimsani / meow-learning
This is a pick-your-problem style guide I created to educate everyone from my leadership team to my ML engineers on the process of how to…
☆118Updated last year
felafax / felafax
Felafax is building AI infra for non-NVIDIA GPUs
☆567Updated 6 months ago
samvher / bert-for-laptops
A BERT that you can train on a (gaming) laptop.
☆209Updated last year
wgryc / phasellm
Large language model evaluation and workflow framework from Phase AI.
☆456Updated 6 months ago