tensara / cliLinks
CLI tool for submitting GPU kernels
☆12Updated 8 months ago
Alternatives and similar repositories for cli
Users that are interested in cli are comparing it to the libraries listed below
Sorting:
- Keeping track of problems ive solved☆12Updated 4 years ago
- Solve puzzles to improve your tinygrad skills!☆178Updated 3 months ago
- A faster, more user-friendly course catalog.☆34Updated 2 months ago
- speedrun implementation of dl papers throughout history☆33Updated last year
- could we make an ml stack in 100,000 lines of code?☆46Updated last year
- High Quality Resources on GPU Programming/Architecture☆591Updated last year
- Competitive GPU kernel optimization platform.☆149Updated 2 weeks ago
- Tutorials on tinygrad☆455Updated 3 months ago
- ☆98Updated this week
- FastAsk is a Python package that installs an easy to use command to your terminal to get a quick answer to a question, using either OpenA…☆53Updated last year
- small auto-grad engine inspired from Karpathy's micrograd and PyTorch☆276Updated last year
- This repo is my attempt at a rough implementation of nanoGPT trained on a dataset of 30,000 unique Twitter usernames☆23Updated last year
- WATonomous ASD Admissions Assignment☆22Updated 4 months ago
- An archive of learning resources assembled by current Exun members and alumni.☆15Updated 3 years ago
- Following Karpathy with GPT-2 implementation and training, writing lots of comments cause I have memory of a goldfish☆172Updated last year
- An implementation of the transformer architecture onto an Nvidia CUDA kernel☆202Updated 2 years ago
- Quantized LLM training in pure CUDA/C++.☆235Updated 2 weeks ago
- Eric's personal notes from NYSRG https://notes.ekzhang.com/events/nysrg☆49Updated 3 weeks ago
- parallelized hyperdimensional tictactoe☆126Updated last year
- Manage your ever-growing list of research papers☆13Updated 2 years ago
- 💜 A webring for Software Engineering students at the University of Waterloo.☆31Updated this week
- Ultra low overhead NVIDIA GPU telemetry plugin for telegraf with memory temperature readings.☆63Updated last year
- Noob Lessons from Stream about how GPUs work☆136Updated 9 months ago
- Alex Krizhevsky's original code from Google Code☆199Updated 9 years ago
- Puzzles for exploring transformers☆384Updated 2 years ago
- Learning about CUDA by writing PTX code.☆152Updated last year
- Complete solutions to the Programming Massively Parallel Processors Edition 4☆655Updated 7 months ago
- Learnings and programs related to CUDA☆432Updated 7 months ago
- A really tiny autograd engine☆99Updated 8 months ago
- Small scale distributed training of sequential deep learning models, built on Numpy and MPI.☆155Updated 2 years ago