antirez / ttt-rlLinks
Reinforcement Learning example in C, playing tic tac toe
☆540Updated 4 months ago
Alternatives and similar repositories for ttt-rl
Users that are interested in ttt-rl are comparing it to the libraries listed below
Sorting:
- ☆1,267Updated 9 months ago
- creating a tiny tensor library in raw C☆728Updated 4 months ago
- Very early versions of Redis with some backstory in the README☆125Updated last week
- gpt written in plain c☆124Updated 2 weeks ago
- Multi-Threaded FP32 Matrix Multiplication on x86 CPUs☆350Updated 2 months ago
- ☆222Updated 6 months ago
- Simple Byte pair Encoding mechanism used for tokenization process . written purely in C☆134Updated 8 months ago
- Alex Krizhevsky's original code from Google Code☆194Updated 9 years ago
- GPT-2 in C☆72Updated 6 months ago
- Minimalist MNIST implementation with two hidden layers written in C☆104Updated 3 months ago
- Will write CUDA for 100 days☆32Updated last month
- CPU inference for the DeepSeek family of large language models in C++☆308Updated last month
- Simple MPI implementation for prototyping or learning☆263Updated 3 weeks ago
- (WIP) A small but powerful, homemade PyTorch from scratch.☆555Updated last week
- A minimal TCP/IP stack☆527Updated last year
- The Tensor (or Array)☆437Updated 11 months ago
- Achieve the llama3 inference step-by-step, grasp the core concepts, master the process derivation, implement the code.☆597Updated 4 months ago
- pytorch from scratch in pure C/CUDA and python☆40Updated 9 months ago
- prima.cpp: Speeding up 70B-scale LLM inference on low-resource everyday home clusters☆975Updated this week
- Linux containers from scratch in C.☆1,527Updated 2 weeks ago
- Yet Another Language Model: LLM inference in C++/CUDA, no libraries except for I/O☆388Updated last month
- GGUF implementation in C as a library and a tools CLI program☆276Updated 6 months ago
- NES toolchain & emulator in C☆130Updated 7 months ago
- A LC3 virtual machine implementation in a few lines of C code☆383Updated 2 years ago
- Code for the video on feed-forward language model☆70Updated last year
- a tiny multidimensional array implementation in C similar to numpy, but only one file.☆228Updated 11 months ago
- Implementing DeepSeek R1's GRPO algorithm from scratch☆1,479Updated 3 months ago
- Tiny x86_64 OS in C☆75Updated last month
- Writing a compiler in a week-end☆399Updated 4 months ago
- A reimplementation of Stable Diffusion 3.5 in pure PyTorch☆637Updated last month