Less Is More: Training-Free Sparse Attention with Global Locality for Efficient Reasoning
☆29Sep 12, 2025Updated 5 months ago
Alternatives and similar repositories for LessIsMore
Users that are interested in LessIsMore are comparing it to the libraries listed below
Sorting:
- ☆29Oct 3, 2022Updated 3 years ago
- An efficient implementation of the NSA (Native Sparse Attention) kernel☆129Jun 24, 2025Updated 8 months ago
- Vocabulary Parallelism☆25Mar 10, 2025Updated 11 months ago
- PipeRAG: Fast Retrieval-Augmented Generation via Algorithm-System Co-design (KDD 2025)☆30Jun 14, 2024Updated last year
- Asynchronous pipeline parallel optimization☆19Feb 2, 2026Updated last month
- fine-tuning tutorial☆18Feb 20, 2026Updated last week
- User-friendly viewer for Parquet files☆10Jan 10, 2026Updated last month
- DOS Program Development☆13Nov 9, 2022Updated 3 years ago
- Symphony — A decentralized multi-agent framework that enables intelligent agents to collaborate seamlessly across heterogeneous edge devi…☆30Oct 30, 2025Updated 4 months ago
- CMPhysBench: A Benchmark for Evaluating Large Language Models in Condensed Matter Physics☆27Nov 1, 2025Updated 4 months ago
- Some microbenchmarks and design docs before commencement☆12Feb 1, 2021Updated 5 years ago
- Prefix-Aware Attention for LLM Decoding☆29Jan 23, 2026Updated last month
- LightGBM for handling label-imbalanced data with focal and weighted loss functions in binary and multiclass classification☆21Jan 29, 2026Updated last month
- A simple MIPS CPU for BUAA CO course (and now NSCSCC).☆10May 15, 2021Updated 4 years ago
- ☆10Feb 10, 2022Updated 4 years ago
- Redis distributed lock implementation for Python based on Pub/Sub messaging☆11Feb 14, 2026Updated 2 weeks ago
- ☆14Dec 12, 2022Updated 3 years ago
- A distributed stream querying engine that provides sub-millisecond stateful query at millions of queries per-second over fast-evolving li…☆10Jul 18, 2018Updated 7 years ago
- ☆15Jan 27, 2026Updated last month
- ☆11Dec 6, 2023Updated 2 years ago
- ☆13Jun 26, 2025Updated 8 months ago
- [ICDCS 2023] Evaluation and Optimization of Gradient Compression for Distributed Deep Learning☆10Apr 28, 2023Updated 2 years ago
- ☆10Jan 9, 2024Updated 2 years ago
- This is the code of a agentic rag method with dynamic workflow.☆12Jan 22, 2026Updated last month
- Minimal code for extracting structured Insights from Sustainability Reports via Large Language Models☆12Jul 9, 2025Updated 7 months ago
- Repository for "TESS-2: A Large-Scale, Generalist Diffusion Language Model"☆55Feb 20, 2025Updated last year
- Dev Dive 2022 세션 "TDD: 내 코드의 품질을 높여주는 Type-Driven Development" 장표☆12Nov 20, 2022Updated 3 years ago
- ☆11Sep 12, 2023Updated 2 years ago
- Hunt Town is a web3 co-building community where builders come together to contribute to the expansion of web3 culture and products.☆14Jan 15, 2026Updated last month
- 🚀 LLM inference optimization simulator, modeling compute-bound prefill and memory-bound decode phases.☆13Jul 12, 2025Updated 7 months ago
- Clober Solidity Library☆10Jun 9, 2025Updated 8 months ago
- MyScale Vector Database Benchmark☆16Aug 20, 2024Updated last year
- This is a joint project between Helmholtz Imaging (located at DKFZ) and Lin Yang and Otmar Schmid (Helmholtz Munich).☆11Nov 6, 2024Updated last year
- On November 21, 1972, Kim Doo-han collapsed due to high blood pressure, an orange disease.☆11Jun 27, 2022Updated 3 years ago
- msx game development library ubox example☆11Apr 26, 2023Updated 2 years ago
- ☆12Feb 26, 2025Updated last year
- Amazon Bedrock 의 Nova, Claude 3.7 모델을 활용하여 pdf 도면을 파싱 합니다.☆12May 19, 2025Updated 9 months ago
- BERT score for text generation☆12Jan 15, 2025Updated last year
- ☆26Nov 7, 2022Updated 3 years ago