jla524 / road-to-llm
A learning roadmap from the tensor to large language models (LLMs).
☆10Updated 7 months ago
Alternatives and similar repositories for road-to-llm:
Users that are interested in road-to-llm are comparing it to the libraries listed below
- From the Tensor to Stable Diffusion, a rough outline for a 1 week course.☆1,057Updated last week
- High Quality Resources on GPU Programming/Architecture☆586Updated 9 months ago
- a tiny multidimensional array implementation in C similar to numpy, but only one file.☆228Updated 8 months ago
- Following master Karpathy with GPT-2 implementation and training, writing lots of comments cause I have memory of a goldfish☆174Updated 8 months ago
- ☆241Updated 3 months ago
- small auto-grad engine inspired from Karpathy's micrograd and PyTorch☆252Updated 5 months ago
- ☆126Updated last year
- Tutorials on tinygrad☆370Updated last month
- (WIP) A small but powerful, homemade PyTorch from scratch.☆543Updated last week
- learningggggggg 🐳☆513Updated 3 weeks ago
- Solve puzzles to improve your tinygrad skills!☆122Updated last month
- rewritingggggggg 🐳☆9Updated 4 months ago
- ☆1,065Updated last week
- GPU Kernels☆160Updated 2 weeks ago
- Accelerated General (FP32) Matrix Multiplication from scratch in CUDA☆114Updated 3 months ago
- ☆243Updated 2 weeks ago
- could we make an ml stack in 100,000 lines of code?☆42Updated 9 months ago
- GPU programming related news and material links☆1,461Updated 3 months ago
- This repo has all the basic things you'll need in-order to understand complete vision transformer architecture and its various implementa…☆215Updated 3 months ago
- A c/c++ implementation of micrograd: a tiny autograd engine with neural net on top.☆67Updated last year
- This repository is a curated collection of resources, tutorials, and practical examples designed to guide you through the journey of mast…☆324Updated 2 months ago
- A curated list of resources for learning and exploring Triton, OpenAI's programming language for writing efficient GPU code.☆337Updated last month
- UNet diffusion model in pure CUDA☆602Updated 9 months ago
- machine learning from absolute scratch in c. gradients, linear algebra ops & everything else without using any third party library!☆22Updated 8 months ago
- The Tensor (or Array)☆429Updated 8 months ago
- Solve Puzzles. Learn Metal 🤘☆549Updated 7 months ago
- 100 days of building GPU kernels!☆345Updated this week
- Uses Twitter archive to visualize your Twitter network based on your replies, quote tweets and direct messaging history. Get DM stats wit…☆130Updated 10 months ago
- work @ comma.ai☆175Updated 5 months ago
- parallelized hyperdimensional tictactoe☆117Updated 8 months ago