apd10 / RzLinear
A compressed alternative to matrix multiplication using state-of-the art compression ROBE-Z
☆9Updated 10 months ago
Related projects: ⓘ
- ☆14Updated 2 years ago
- Memory Optimizations for Deep Learning (ICML 2023)☆58Updated 6 months ago
- Demo of the unit_scaling library, showing how a model can be easily adapted to train in FP8.☆34Updated 2 months ago
- ☆50Updated 3 months ago
- PyTorch half precision gemm lib w/ fused optional bias + optional relu/gelu☆25Updated 3 weeks ago
- ☆30Updated 8 months ago
- ☆23Updated 9 months ago
- ☆83Updated 3 weeks ago
- Reference implementation of "Softmax Attention with Constant Cost per Token" (Heinsen, 2024)☆22Updated 3 months ago
- CUDA implementation of autoregressive linear attention, with all the latest research findings☆43Updated last year
- The codes for training sparsity predictor on LLaMA.☆14Updated 4 months ago
- [ICML 2022] "Coarsening the Granularity: Towards Structurally Sparse Lottery Tickets" by Tianlong Chen, Xuxi Chen, Xiaolong Ma, Yanzhi Wa…☆30Updated last year
- Efficient 2:4 sparse training algorithms and implementations☆18Updated 3 months ago
- Official repository of Sparse ISO-FLOP Transformations for Maximizing Training Efficiency☆23Updated last month
- Here we will test various linear attention designs.☆55Updated 4 months ago
- Boosting 4-bit inference kernels with 2:4 Sparsity☆47Updated 2 weeks ago
- Confident Adaptive Transformers☆12Updated 3 years ago
- Simple and fast low-bit matmul kernels in CUDA☆48Updated this week
- Codes of the paper Deformable Butterfly: A Highly Structured and Sparse Linear Transform.☆12Updated 2 years ago
- ☆16Updated last year
- ☆38Updated 9 months ago
- Utilities for Training Very Large Models☆56Updated last week
- Code for the paper: https://arxiv.org/pdf/2309.06979.pdf☆14Updated last month
- ☆16Updated this week
- Fast Hadamard transform in CUDA, with a PyTorch interface☆87Updated 3 months ago
- RWKV model implementation☆38Updated last year
- Implementation of Hyena Hierarchy in JAX☆10Updated last year
- ☆66Updated 3 months ago
- ☆23Updated 6 months ago
- ☆14Updated 7 months ago