Tencent-Hunyuan / flex-block-attnView external linksLinks
flex-block-attn: an efficient block sparse attention computation library
☆108Dec 26, 2025Updated last month
Alternatives and similar repositories for flex-block-attn
Users that are interested in flex-block-attn are comparing it to the libraries listed below
Sorting:
- Vortex: A Flexible and Efficient Sparse Attention Framework☆46Jan 21, 2026Updated 3 weeks ago
- The this is the official implementation of "DAPE: Data-Adaptive Positional Encoding for Length Extrapolation"☆41Oct 11, 2024Updated last year
- A Distributed Attention Towards Linear Scalability for Ultra-Long Context, Heterogeneous Data Training☆631Feb 6, 2026Updated last week
- ☆48Dec 13, 2025Updated 2 months ago
- High performance RMSNorm Implement by using SM Core Storage(Registers and Shared Memory)☆26Jan 22, 2026Updated 3 weeks ago
- ☆12Jan 25, 2024Updated 2 years ago
- Depth-Bounded PCFG Induction☆13Apr 19, 2019Updated 6 years ago
- Official pytorch implementation of "ReDirector: Creating Any-Length Video Retakes with Rotary Camera Encoding"☆17Dec 17, 2025Updated last month
- FlexAttention w/ FlashAttention3 Support☆27Oct 5, 2024Updated last year
- ☆12Mar 4, 2022Updated 3 years ago
- [EMNLP 2024] Quantize LLM to extremely low-bit, and finetune the quantized LLMs☆15Jul 18, 2024Updated last year
- Official Implementation of "UniFlow: A Unified Pixel Flow Tokenizer for Visual Understanding and Generation"☆137Oct 17, 2025Updated 3 months ago
- Triton Implementation of Flash Attention with Bias.☆20Apr 16, 2025Updated 9 months ago
- ☆16Dec 12, 2023Updated 2 years ago
- Official code for the paper "Attention as a Hypernetwork"☆47Jun 22, 2024Updated last year
- Parallel Associative Scan for Language Models☆18Jan 8, 2024Updated 2 years ago
- data related codebase for polyglot project☆18Mar 30, 2023Updated 2 years ago
- Adapting Self-Supervised Representations as a Latent Space for Efficient Generation☆38Oct 17, 2025Updated 3 months ago
- ☆25Jun 19, 2025Updated 7 months ago
- ☆28Oct 2, 2025Updated 4 months ago
- Fast and memory-efficient exact kmeans☆138Updated this week
- Code repository for ICLR 2025 paper "LeanQuant: Accurate and Scalable Large Language Model Quantization with Loss-error-aware Grid"☆24Mar 2, 2025Updated 11 months ago
- Simple and efficient pytorch-native transformer training and inference (batched)☆79Apr 2, 2024Updated last year
- ☆38Aug 7, 2025Updated 6 months ago
- Python implementation of paper "AntisymmetricRNN: A Dynamical System View on Recurrent Neural Networks"☆15Aug 2, 2019Updated 6 years ago
- [ICLR 2025 & COLM 2025] Official PyTorch implementation of the Forgetting Transformer and Adaptive Computation Pruning☆137Dec 19, 2025Updated last month
- Overview of corpora/datasets for Germanic low-resource languages and dialects. Accompanies "A Survey of Corpora for Germanic Low-Resource…☆27Updated this week
- Helpful tools and examples for working with flex-attention☆1,127Updated this week
- Xmixers: A collection of SOTA efficient token/channel mixers☆28Sep 4, 2025Updated 5 months ago
- Framework to reduce autotune overhead to zero for well known deployments.☆96Sep 19, 2025Updated 4 months ago
- A list of language models with permissive licenses such as MIT or Apache 2.0☆24Feb 28, 2025Updated 11 months ago
- [ICLR 2025] CREMA: Generalizable and Efficient Video-Language Reasoning via Multimodal Modular Fusion☆55Jul 1, 2025Updated 7 months ago
- Official implementation of the ICASSP 2023 paper "HRTF Field: Unifying Measured HRTF Magnitude Representation with Neural Fields"☆25Dec 3, 2023Updated 2 years ago
- ☆21Mar 3, 2025Updated 11 months ago
- ☆63Jun 12, 2025Updated 8 months ago
- Official implementation of the EMNLP23 paper: Outlier Suppression+: Accurate quantization of large language models by equivalent and opti…☆50Oct 21, 2023Updated 2 years ago
- Tile-Based Runtime for Ultra-Low-Latency LLM Inference☆567Jan 26, 2026Updated 2 weeks ago
- Tritonbench is a collection of PyTorch custom operators with example inputs to measure their performance.☆326Updated this week
- A TTS Trained on Universal Audio.☆41Jun 6, 2025Updated 8 months ago