antirez / ttt-rl
Reinforcement Learning example in C, playing tic tac toe
☆513Updated last month
Alternatives and similar repositories for ttt-rl:
Users that are interested in ttt-rl are comparing it to the libraries listed below
- ☆1,252Updated 7 months ago
- creating a tiny tensor library in raw C☆680Updated 2 months ago
- Multi-Threaded FP32 Matrix Multiplication on x86 CPUs☆349Updated 2 weeks ago
- Textbook on reinforcement learning from human feedback☆855Updated this week
- The Tensor (or Array)☆432Updated 8 months ago
- Yet Another Language Model: LLM inference in C++/CUDA, no libraries except for I/O☆331Updated 3 months ago
- The simplest, fastest repository for training/finetuning small-sized VLMs.☆1,126Updated this week
- Achieve the llama3 inference step-by-step, grasp the core concepts, master the process derivation, implement the code.☆571Updated 2 months ago
- A modern, visual guide to programming, from assembly to high-level languages. Covers core concepts, performance, internals, and real-worl…☆28Updated 3 weeks ago
- Alex Krizhevsky's original code from Google Code☆191Updated 9 years ago
- This repository is a journey through Operating System concepts, with practical implementations in C. Each day focuses on a specific topic…☆257Updated 2 months ago
- Implementing DeepSeek R1's GRPO algorithm from scratch☆1,277Updated 3 weeks ago
- This repository is a curated collection of resources, tutorials, and practical examples designed to guide you through the journey of mast…☆333Updated 2 months ago
- A LC3 virtual machine implementation in a few lines of C code☆374Updated 2 years ago
- (WIP) A small but powerful, homemade PyTorch from scratch.☆550Updated 2 weeks ago
- a single pass c compiler in 500 lines of python☆178Updated last year
- A 120-day CUDA learning plan covering daily concepts, exercises, pitfalls, and references (including “Programming Massively Parallel Proc…☆667Updated last month
- Fast text search tool with advanced algorithms, SIMD acceleration, multi-threading, and regex support. Designed for rapid, large-scale pa…☆395Updated last week
- NAND is a logic simulator suite made entirely from NAND gates☆561Updated 2 weeks ago
- Learnings and programs related to CUDA☆380Updated 2 months ago
- Learning about CUDA by writing PTX code.☆128Updated last year
- A small stack based, written to bring Advent of Code 2022 Day 13 puzzle to the extreme consequences☆378Updated 2 years ago
- ☆296Updated 3 weeks ago
- A faithful clone of Karpathy's llama2.c (one file inference, zero dependency) but fully functional with LLaMA 3 8B base and instruct mode…☆126Updated 9 months ago
- gpt written in plain c☆120Updated 8 months ago
- Recreating PyTorch from scratch (C/C++, CUDA, NCCL and Python, with multi-GPU support and automatic differentiation!)☆150Updated 11 months ago
- simple java virtual machine☆258Updated 3 years ago
- A Simple Virtual Computer☆345Updated this week
- The Multilayer Perceptron Language Model☆547Updated 9 months ago
- A guide that explains how programs transform from source code to executables. Deep dive into ELF format, linking processes, and binary op…☆328Updated 5 months ago