tensara / cliLinks
CLI tool for submitting GPU kernels
☆12Updated 5 months ago
Alternatives and similar repositories for cli
Users that are interested in cli are comparing it to the libraries listed below
Sorting:
- speedrun implementation of dl papers throughout history☆33Updated last year
- Keeping track of problems ive solved☆12Updated 3 years ago
- Competitive GPU kernel optimization platform.☆135Updated this week
- High Quality Resources on GPU Programming/Architecture☆590Updated last year
- could we make an ml stack in 100,000 lines of code?☆46Updated last year
- This repo is my attempt at a rough implementation of nanoGPT trained on a dataset of 30,000 unique Twitter usernames☆24Updated last year
- Tutorials on tinygrad☆439Updated last month
- small auto-grad engine inspired from Karpathy's micrograd and PyTorch☆277Updated last year
- Semantic search over every Emergent Ventures winner.☆27Updated last week
- ☆93Updated last week
- An archive of learning resources assembled by current Exun members and alumni.☆15Updated 3 years ago
- Following Karpathy with GPT-2 implementation and training, writing lots of comments cause I have memory of a goldfish☆172Updated last year
- parallelized hyperdimensional tictactoe☆125Updated last year
- FastAsk is a Python package that installs an easy to use command to your terminal to get a quick answer to a question, using either OpenA…☆53Updated 10 months ago
- ☆96Updated last year
- a highly efficient compression algorithm for the n1 implant (neuralink's compression challenge)☆46Updated last year
- WATonomous ASD Admissions Assignment☆21Updated 2 months ago
- Software for matching event participants based on interest using embeddings☆71Updated last year
- in this repository, i'm going to implement increasingly complex llm inference optimizations☆70Updated 6 months ago
- Simple Transformer in Jax☆139Updated last year
- My path from leetcode hell to bigtech heaven☆16Updated last year
- Small scale distributed training of sequential deep learning models, built on Numpy and MPI.☆150Updated 2 years ago
- An implementation of the transformer architecture onto an Nvidia CUDA kernel☆195Updated 2 years ago
- ☆10Updated last year
- My name is Ozymandias, King of Kings; Look on my Works, ye Mighty, and despair!☆39Updated 2 years ago
- Learnings and programs related to CUDA☆426Updated 4 months ago
- Sudocrypt v10.0: A map-based text adventure.☆41Updated 4 years ago
- 💜 A webring for Software Engineering students at the University of Waterloo.☆29Updated last week
- Alex Krizhevsky's original code from Google Code☆198Updated 9 years ago
- Intro to leetcodes. Basic techniques, quicksort and hash structures implementation, space and time complexities.☆96Updated last year