Less Is More: Training-Free Sparse Attention with Global Locality for Efficient Reasoning
☆31Sep 12, 2025Updated 7 months ago
Alternatives and similar repositories for LessIsMore
Users that are interested in LessIsMore are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- FlashSampling: Fast and Memory-Efficient Exact Sampling (https://huggingface.co/papers/2603.15854)☆70Apr 25, 2026Updated last week
- ☆30Oct 3, 2022Updated 3 years ago
- Example implementation of "Exact Byte-Level Probabilities from Tokenized Language Models for FIM-Tasks and Model Ensembles" by Buu Phan, …☆18Jan 22, 2026Updated 3 months ago
- [CVPR'25] Official code of paper "Mimic In-Context Learning for Multimodal Tasks"☆25Mar 10, 2026Updated last month
- Generate a sequence of animations using a Fourier series expansion, based on the provided parametric equations, text characters (or symbo…☆13Apr 9, 2026Updated 3 weeks ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- An efficient implementation of the NSA (Native Sparse Attention) kernel☆133Jun 24, 2025Updated 10 months ago
- Vocabulary Parallelism☆26Mar 10, 2025Updated last year
- 🚀 Sliding Window Attention Training for Efficient Large Language Models☆16Dec 8, 2025Updated 4 months ago
- Some microbenchmarks and design docs before commencement☆11Feb 1, 2021Updated 5 years ago
- An in-context learning research testbed☆19Mar 16, 2025Updated last year
- ONCache: A Cache-Based Low-Overhead Container Overlay Network☆21Jun 7, 2025Updated 10 months ago
- Implementation of SelfExtend from the paper "LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning" from Pytorch and Zeta☆13Nov 11, 2024Updated last year
- ☆13May 21, 2024Updated last year
- PipeRAG: Fast Retrieval-Augmented Generation via Algorithm-System Co-design (KDD 2025)☆32Jun 14, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- [ICML'25] Official code of paper "Fast Large Language Model Collaborative Decoding via Speculation"☆30Jun 23, 2025Updated 10 months ago
- ☆11Jan 17, 2024Updated 2 years ago
- Code for Blog Post: Can Better Cold-Start Strategies Improve RL Training for LLMs?☆20Mar 9, 2025Updated last year
- ☆16Jul 12, 2024Updated last year
- ☆17May 26, 2023Updated 2 years ago
- [ACL 2025] LongSafety: Evaluating Long-Context Safety of Large Language Models☆16Jun 18, 2025Updated 10 months ago
- Implementation of the LDP module block in PyTorch and Zeta from the paper: "MobileVLM: A Fast, Strong and Open Vision Language Assistant …☆15Mar 11, 2024Updated 2 years ago
- Fast and memory-efficient exact attention☆21Apr 10, 2026Updated 3 weeks ago
- Unofficial implementation of paper : Exploring the Space of Key-Value-Query Models with Intention☆12May 24, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- logit lens for VGGT☆27Dec 2, 2025Updated 5 months ago
- Awsome works based on SSM and Mamba☆16Apr 10, 2024Updated 2 years ago
- Synthetic Data Generation for Evaluation☆14Feb 21, 2025Updated last year
- ☆12Apr 9, 2025Updated last year
- KnowRL: Exploring Knowledgeable Reinforcement Learning for Factuality☆43Dec 1, 2025Updated 5 months ago
- ☆14Jan 20, 2025Updated last year
- Codebase for multilingual neural machine translation☆13Nov 24, 2022Updated 3 years ago
- lightsmile个人的用于爬取网络公开语料数据的mini通用爬虫框架。☆13Sep 30, 2020Updated 5 years ago
- ☆15Jan 27, 2026Updated 3 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Agentic Virtual Lab☆19Nov 30, 2025Updated 5 months ago
- a collection of skills for vllm-omni☆57Apr 14, 2026Updated 2 weeks ago
- A Comprehensive Study Notes on Artificial Intelligence: dedicated to the exploration and understanding of AI concepts, algorithms, and ap…☆20Jan 14, 2026Updated 3 months ago
- ☆15Nov 18, 2025Updated 5 months ago
- Accelerating Large-Scale Reasoning Model Inference with Sparse Self-Speculative Decoding☆98Dec 2, 2025Updated 5 months ago
- ☆13Mar 14, 2026Updated last month
- PyCausalSim is a Python framework for discovering and validating causal relationships through simulation. Unlike traditional analytics th…☆32Dec 8, 2025Updated 4 months ago