xiongsiheng / DHSALinks
[NeurIPS 25 @ ER] Long-Context Modeling with Dynamic Hierarchical Sparse Attention for On-Device LLMs
☆72Updated 3 weeks ago
Alternatives and similar repositories for DHSA
Users that are interested in DHSA are comparing it to the libraries listed below
Sorting:
- ☆79Updated 2 months ago
- ☆38Updated 2 months ago
- Training and evaluation code of EGTLM model.☆22Updated last year
- a demo but fun snake game created in https://aide.ink☆66Updated 10 months ago
- ☆104Updated 10 months ago
- ☆95Updated last week
- toolkit for WakenLLM framework☆47Updated 2 weeks ago
- ☆80Updated 6 months ago
- A Knowledge Base on Pre-made Dishes☆105Updated 5 months ago
- Concise Evaluation Benchmark for Large Language Models☆25Updated 4 months ago
- A system demo based on Retrival Argument Generation to answer buddism question☆84Updated last year
- Assignment, homework and everything in Northeastern University Miami☆32Updated this week
- Invoke Interfaces Only When Needed: Adaptive Invocation for Large Language Models in Question Answering☆42Updated last month
- ☆48Updated 8 months ago
- Help you practice daily English speaking and conversation skills painlessly from easy to difficult☆64Updated 7 months ago
- 强化学习-大语言模型☆68Updated 5 months ago
- HACAN: Hybrid Attention-Driven Cross-Layer Alignment Network for Image-Text Retrieval☆79Updated 7 months ago
- Quick start with just one Python file for writing large models. No complex file structure or unnecessary explanations, perfect for beginn…☆41Updated 4 months ago
- This search engine leverages the Boost library for efficient document search, featuring data preprocessing, index creation, and advanced …☆59Updated last year
- Enable Agents to conduct web3 operations, support wider applications for cross-chain bridging☆43Updated 11 months ago
- ☆48Updated 7 months ago
- ☆41Updated 9 months ago
- Store and download PseudoMeta R Package☆28Updated 5 months ago
- ☆71Updated 3 years ago
- Imagine building a whole operating system around just your notes.☆80Updated 10 months ago
- 最终幻想14英文笔记☆96Updated last year
- The code and resources for the thesis project "Exploring Generative Adversarial Networks for Multivariate Time Series Data Imputation"☆53Updated 7 months ago
- MedSoft-Diffusion was early accepted to MICCAI 2025 (top 9%, scores: 5/4/4).☆41Updated 9 months ago
- my work☆26Updated 11 months ago
- `cryptor` is a Go package for secure encryption and decryption using NaCl's `secretbox` from `golang.org/x/crypto`☆60Updated 6 months ago