10-zin / cpp-microgradLinks
A c/c++ implementation of micrograd: a tiny autograd engine with neural net on top.
☆73Updated 2 years ago
Alternatives and similar repositories for cpp-micrograd
Users that are interested in cpp-micrograd are comparing it to the libraries listed below
Sorting:
- Multi-Threaded FP32 Matrix Multiplication on x86 CPUs☆367Updated 6 months ago
- The Tensor (or Array)☆452Updated last year
- Recreating PyTorch from scratch (C/C++, CUDA, NCCL and Python, with multi-GPU support and automatic differentiation!)☆160Updated last year
- UNet diffusion model in pure CUDA☆651Updated last year
- Tutorials on tinygrad☆431Updated 3 weeks ago
- Simple Byte pair Encoding mechanism used for tokenization process . written purely in C☆139Updated 11 months ago
- Accelerated General (FP32) Matrix Multiplication from scratch in CUDA☆163Updated 9 months ago
- Alex Krizhevsky's original code from Google Code☆199Updated 9 years ago
- CUDA Learning guide☆467Updated last year
- Apply GPU in ML and DL☆54Updated last month
- Simple MPI implementation for prototyping or learning☆286Updated 2 months ago
- Documented and Unit Tested educational Deep Learning framework with Autograd from scratch.☆122Updated last year
- This repository is a curated collection of resources, tutorials, and practical examples designed to guide you through the journey of mast…☆403Updated 8 months ago
- ☆81Updated last week
- Solve puzzles to improve your tinygrad skills!☆146Updated 2 weeks ago
- small auto-grad engine inspired from Karpathy's micrograd and PyTorch☆276Updated 11 months ago
- An implementation of the transformer architecture onto an Nvidia CUDA kernel☆192Updated 2 years ago
- Visualization of cache-optimized matrix multiplication☆155Updated 7 months ago
- An implement of deep learning framework and models in C☆48Updated 7 months ago
- pytorch from scratch in pure C/CUDA and python☆41Updated last year
- Well documented, unit tested, type checked and formatted implementation of a vanilla transformer - for educational purposes.☆267Updated last year
- 100 days of building GPU kernels!☆521Updated 6 months ago
- High Quality Resources on GPU Programming/Architecture☆590Updated last year
- gpt written in plain c☆129Updated 4 months ago
- Learning about CUDA by writing PTX code.