goodevening13 / aquakvLinks
☆16Updated this week
Alternatives and similar repositories for aquakv
Users that are interested in aquakv are comparing it to the libraries listed below
Sorting:
- supporting pytorch FSDP for optimizers☆83Updated 10 months ago
- Load compute kernels from the Hub☆308Updated this week
- ☆121Updated last year
- ☆102Updated last week
- Work in progress.☆74Updated 4 months ago
- ☆152Updated 4 months ago
- ☆91Updated last year
- Cold Compress is a hackable, lightweight, and open-source toolkit for creating and benchmarking cache compression methods built on top of…☆147Updated last year
- This repository contains the experimental PyTorch native float8 training UX☆223Updated last year
- The simplest implementation of recent Sparse Attention patterns for efficient LLM inference.☆92Updated 3 months ago
- QuIP quantization☆59Updated last year
- ☆15Updated 2 years ago
- Tree Attention: Topology-aware Decoding for Long-Context Attention on GPU clusters☆130Updated 10 months ago
- The simplest, fastest repository for training/finetuning medium-sized GPTs.☆168Updated 4 months ago
- Code for studying the super weight in LLM☆119Updated 10 months ago
- nanoGPT-like codebase for LLM training☆109Updated 5 months ago
- The evaluation framework for training-free sparse attention in LLMs☆102Updated 2 weeks ago
- Fast, Modern, and Low Precision PyTorch Optimizers☆116Updated last month
- A library for unit scaling in PyTorch☆132Updated 3 months ago
- Official implementation for Training LLMs with MXFP4☆100Updated 6 months ago
- Boosting 4-bit inference kernels with 2:4 Sparsity☆84Updated last year
- Prune transformer layers☆69Updated last year
- ☆68Updated 11 months ago
- Experiment of using Tangent to autodiff triton☆80Updated last year
- Efficient optimizers☆275Updated 2 weeks ago
- ring-attention experiments☆155Updated last year
- A fusion of a linear layer and a cross entropy loss, written for pytorch in triton.☆69Updated last year
- ☆224Updated last week
- A bunch of kernels that might make stuff slower 😉☆63Updated this week
- A repository to unravel the language of GPUs, making their kernel conversations easy to understand☆194Updated 4 months ago