☆43Mar 15, 2025Updated 11 months ago
Alternatives and similar repositories for Block-Attention
Users that are interested in Block-Attention are comparing it to the libraries listed below
Sorting:
- Code repo for "CritiPrefill: A Segment-wise Criticality-based Approach for Prefilling Acceleration in LLMs".☆16Sep 15, 2024Updated last year
- [EMNLP 2025 Main] SpecVLM: Enhancing Speculative Decoding of Video LLMs via Verifier-Guided Token Pruning☆34Jan 11, 2026Updated last month
- ☆21Jan 16, 2025Updated last year
- [ICLR 2025🔥] D2O: Dynamic Discriminative Operations for Efficient Long-Context Inference of Large Language Models☆27Jul 7, 2025Updated 7 months ago
- ☆23Mar 31, 2023Updated 2 years ago
- Codebase for Context-aware Meta-learned Loss Scaling (CaMeLS). https://arxiv.org/abs/2305.15076.☆25Jan 23, 2024Updated 2 years ago
- EMNLP2023 - InfoSeek: A New VQA Benchmark focus on Visual Info-Seeking Questions☆25May 30, 2024Updated last year
- "Found in the Middle: How Language Models Use Long Contexts Better via Plug-and-Play Positional Encoding" Zhenyu Zhang, Runjin Chen, Shiw…☆31May 7, 2024Updated last year
- User-friendly implementation of the Mixture-of-Sparse-Attention (MoSA). MoSA selects distinct tokens for each head with expert choice rou…☆28May 3, 2025Updated 10 months ago
- Code and Data for "Language Modeling with Editable External Knowledge"☆36Jun 19, 2024Updated last year
- ☆88Sep 10, 2025Updated 5 months ago
- ☆12Jul 4, 2024Updated last year
- Classic Chess game using x86 Assembly Language☆11Apr 23, 2019Updated 6 years ago
- This is the code of a agentic rag method with dynamic workflow.☆12Jan 22, 2026Updated last month
- Official implementation of the paper "Pretraining Language Models to Ponder in Continuous Space"☆25Jul 21, 2025Updated 7 months ago
- ☆13Sep 8, 2024Updated last year
- Using OpenVINO to speed up inference of PaddleOCR-VL model☆25Updated this week
- Open source simulator for porous media flow☆14Oct 15, 2022Updated 3 years ago
- codes and plots for "Active-Dormant Attention Heads: Mechanistically Demystifying Extreme-Token Phenomena in LLMs"☆10Dec 30, 2024Updated last year
- ☆13Jul 8, 2020Updated 5 years ago
- ☆10Mar 31, 2022Updated 3 years ago
- The repository for papaer "Distance between Relevant Information Pieces Causes Bias in Long-Context LLMs"☆14Dec 16, 2024Updated last year
- Các thí nghiệm liên quan tới LLMs cho tiếng Việt (insprised by Physics of LLMs Series)☆11Oct 21, 2024Updated last year
- ☆13Jan 7, 2025Updated last year
- ☆12Updated this week
- ☆11Aug 12, 2024Updated last year
- parquet dedupe estimator☆25Feb 20, 2026Updated last week
- ☆14Oct 17, 2024Updated last year
- ☆17Apr 15, 2025Updated 10 months ago
- Fully working chess game implemented in the x86 Intel Assembly language☆12Oct 3, 2022Updated 3 years ago
- this is a work about UpliftRec☆10Dec 10, 2024Updated last year
- Upload a document image or PDF, or provide a URL, to convert it into a structured format using SmolDocling.☆16Mar 31, 2025Updated 11 months ago
- ChatGPT solutions for the MLE interview☆14Dec 9, 2022Updated 3 years ago
- The official implementation of the paper "Self-Updatable Large Language Models by Integrating Context into Model Parameters"☆15May 18, 2025Updated 9 months ago
- Develop a machine learning (ML) model for lung cancer detection using U-Net and DenseNet architectures. Achieve an accuracy of at least 9…☆10Dec 9, 2023Updated 2 years ago
- A repo to keep all resources about interpretability in NLP organised and up to date☆12Nov 22, 2020Updated 5 years ago
- ☆18Jun 23, 2025Updated 8 months ago
- Learning to Skip the Middle Layers of Transformers☆17Aug 7, 2025Updated 6 months ago
- Plagiarism Detection Approach for PAN 2015 Text Alignment task☆11May 11, 2018Updated 7 years ago