tensara / cliLinks
CLI tool for submitting GPU kernels
☆12Updated 8 months ago
Alternatives and similar repositories for cli
Users that are interested in cli are comparing it to the libraries listed below
Sorting:
- Keeping track of problems ive solved☆12Updated 4 years ago
- Competitive GPU kernel optimization platform.☆149Updated 2 weeks ago
- could we make an ml stack in 100,000 lines of code?☆46Updated last year
- speedrun implementation of dl papers throughout history☆33Updated last year
- Solve puzzles to improve your tinygrad skills!☆178Updated 3 months ago
- Tutorials on tinygrad☆455Updated 3 months ago
- ☆98Updated last week
- High Quality Resources on GPU Programming/Architecture☆591Updated last year
- small auto-grad engine inspired from Karpathy's micrograd and PyTorch☆276Updated last year
- ☆96Updated last year
- Noob Lessons from Stream about how GPUs work☆136Updated 9 months ago
- parallelized hyperdimensional tictactoe☆126Updated last year
- ☆10Updated 2 years ago
- An archive of learning resources assembled by current Exun members and alumni.☆15Updated 3 years ago
- in this repository, i'm going to implement increasingly complex llm inference optimizations☆81Updated 8 months ago
- Following Karpathy with GPT-2 implementation and training, writing lots of comments cause I have memory of a goldfish☆172Updated last year
- Basically Heterogenous Inference☆115Updated 3 months ago
- a highly efficient compression algorithm for the n1 implant (neuralink's compression challenge)☆47Updated last year
- Alex Krizhevsky's original code from Google Code☆199Updated 9 years ago
- A minimal Tensor Processing Unit (TPU) inspired by Google's TPUv1.☆194Updated last year
- FastAsk is a Python package that installs an easy to use command to your terminal to get a quick answer to a question, using either OpenA…☆53Updated last year
- Accelerated General (FP32) Matrix Multiplication from scratch in CUDA☆181Updated last year
- Learning about CUDA by writing PTX code.☆151Updated last year
- This repo is my attempt at a rough implementation of nanoGPT trained on a dataset of 30,000 unique Twitter usernames☆23Updated last year
- Semantic search over every Emergent Ventures winner.☆29Updated last week
- Simple Transformer in Jax☆142Updated last year
- work @ comma.ai☆174Updated last year
- Solve Puzzles. Learn Metal 🤘☆597Updated last year
- Small scale distributed training of sequential deep learning models, built on Numpy and MPI.☆155Updated 2 years ago
- Complete solutions to the Programming Massively Parallel Processors Edition 4☆655Updated 7 months ago