Kernel-Machines / kermacLinks
Pytorch routines for (Ker)nel (Mac)hines
☆10Updated 3 months ago
Alternatives and similar repositories for kermac
Users that are interested in kermac are comparing it to the libraries listed below
Sorting:
- Personal solutions to the Triton Puzzles☆20Updated last year
- ☆28Updated last year
- A bunch of kernels that might make stuff slower 😉☆75Updated this week
- Experiment of using Tangent to autodiff triton☆82Updated 2 years ago
- extensible collectives library in triton☆95Updated 10 months ago
- Automatic differentiation for Triton Kernels☆29Updated 5 months ago
- Write a fast kernel and run it on Discord. See how you compare against the best!☆68Updated last week
- Code for the paper: https://arxiv.org/pdf/2309.06979.pdf☆21Updated last year
- ☆33Updated last year
- Minimal but scalable implementation of large language models in JAX☆35Updated 2 months ago
- Sparsity support for PyTorch☆38Updated 10 months ago
- Einsum-like high-level array sharding API for JAX☆34Updated last year
- ☆18Updated 2 months ago
- Triton-based Symmetric Memory operators and examples☆81Updated 3 weeks ago
- ☆15Updated 3 months ago
- train with kittens!☆63Updated last year
- Custom triton kernels for training Karpathy's nanoGPT.☆19Updated last year
- ☆40Updated 2 years ago
- Ship correct and fast LLM kernels to PyTorch☆140Updated 3 weeks ago
- Parallel framework for training and fine-tuning deep neural networks☆70Updated 2 months ago
- FlexAttention w/ FlashAttention3 Support☆27Updated last year
- JAX implementation of the Mistral 7b v0.2 model☆35Updated last year
- JAX bindings for Flash Attention v2☆103Updated last week
- ☆147Updated this week
- Make triton easier☆50Updated last year
- An experimental implementation of compiler-driven automatic sharding of models across a given device mesh.☆52Updated this week
- Collection of kernels written in Triton language☆178Updated last week
- ☆39Updated last month
- ☆22Updated 9 months ago
- Code implementing "Efficient Parallelization of a Ubiquitious Sequential Computation" (Heinsen, 2023)☆98Updated last year