spikedoanz / from-bits-to-intelligenceLinks
could we make an ml stack in 100,000 lines of code?
☆46Updated last year
Alternatives and similar repositories for from-bits-to-intelligence
Users that are interested in from-bits-to-intelligence are comparing it to the libraries listed below
Sorting:
- parallelized hyperdimensional tictactoe☆124Updated last year
- Solve puzzles to improve your tinygrad skills!☆142Updated 5 months ago
- Tutorials on tinygrad☆402Updated 3 weeks ago
- Following master Karpathy with GPT-2 implementation and training, writing lots of comments cause I have memory of a goldfish☆172Updated last year
- small auto-grad engine inspired from Karpathy's micrograd and PyTorch☆278Updated 9 months ago
- Simple Transformer in Jax☆139Updated last year
- ☆89Updated last week
- Gradient descent is cool and all, but what if we could delete it?☆104Updated last week
- A light tensor library in zig.☆78Updated 6 months ago
- Tensor library with autograd using only Rust's standard library☆69Updated last year
- High Quality Resources on GPU Programming/Architecture☆588Updated last year
- in this repository, i'm going to implement increasingly complex llm inference optimizations☆66Updated 3 months ago
- a tiny multidimensional array implementation in C similar to numpy, but only one file.☆229Updated last year
- This repo is my attempt at a rough implementation of nanoGPT trained on a dataset of 30,000 unique Twitter usernames☆24Updated last year
- A really tiny autograd engine☆95Updated 3 months ago
- An implement of deep learning framework and models in C☆48Updated 4 months ago
- moondream in zig.☆73Updated 2 months ago
- ☆93Updated 7 months ago
- SIMD quantization kernels☆83Updated this week
- Learnings and programs related to CUDA☆415Updated last month
- ☆96Updated last year
- pytorch from scratch in pure C/CUDA and python☆40Updated 10 months ago
- Compiling useful links, papers, benchmarks, ideas, etc.☆45Updated 5 months ago
- Ultra low overhead NVIDIA GPU telemetry plugin for telegraf with memory temperature readings.☆62Updated last year
- Learning about CUDA by writing PTX code.☆134Updated last year
- PTX-Tutorial Written Purely By AIs (Deep Research of Openai and Claude 3.7)☆66Updated 5 months ago
- Intro to leetcodes. Basic techniques, quicksort and hash structures implementation, space and time complexities.☆97Updated last year
- a tiny vectorstore implementation built with numpy.☆63Updated last year
- speedrun implementation of dl papers throughout history☆33Updated last year
- The Tensor (or Array)☆441Updated last year