Fzkuji/swat-attention

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Fzkuji/swat-attention)

Fzkuji / swat-attention

🚀 Sliding Window Attention Training for Efficient Large Language Models

☆19

Alternatives and similar repositories for swat-attention

Users that are interested in swat-attention are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

TsinghuaC3I / FS-GEN
View on GitHub
Fast and Slow Generating: An Empirical Study on Large and Small Language Models Collaborative Decoding.
☆13Nov 19, 2024Updated last year
Elvin-Yiming-Du / Memory-T1
View on GitHub
This respository is used for time reasoning task for mult-session dialogue system.
☆16Feb 7, 2026Updated 5 months ago
john-hewitt / implicit-ins
View on GitHub
Codebase for Instruction Following without Instruction Tuning
☆36Sep 24, 2024Updated last year
scalable-model-editing / unified-model-editing
View on GitHub
We introduce EMMET and unify model editing with popular algorithms ROME and MEMIT.
☆29Dec 16, 2024Updated last year
Arvid-pku / ATOKE
View on GitHub
[AAAI 2024] History Matters: Temporal Knowledge Editing in Large Language Model
☆13Dec 17, 2023Updated 2 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
starrYYxuan / UniTE
View on GitHub
☆17Nov 20, 2024Updated last year
LuckyyySTA / Fine-grained-Attribution
View on GitHub
[ACL 2024 Findings] Learning Fine-Grained Grounded Citations for Attributed Large Language Models
☆21Oct 24, 2024Updated last year
Z1zs / MMNeuron
View on GitHub
Official implementation of "MMNeuron: Discovering Neuron-Level Domain-Specific Interpretation in Multimodal Large Language Model". Our co…
☆26Dec 20, 2024Updated last year
Graph-COM / Knowledge_Unlearning
View on GitHub
☆16Oct 12, 2025Updated 9 months ago
zkzhou126 / AI-for-Research
View on GitHub
From Hypothesis to Publication: A Comprehensive Survey of AI-Driven Research Support Systems
☆19Jun 29, 2026Updated 3 weeks ago
PASSIONLab / distributed_sddmm
View on GitHub
Distributed SDDMM Kernel
☆12Jul 8, 2022Updated 4 years ago
SooLab / MVTokenFlow
View on GitHub
[ICLR 2025] MVTokenFlow: High-quality 4D Content Generation using Multiview Token Flow
☆27Apr 9, 2025Updated last year
LuckyyySTA / GOLF
View on GitHub
☆18Mar 16, 2026Updated 4 months ago
dinobby / Skill-MoE
View on GitHub
The code implementation of Skill-MoE
☆46May 22, 2026Updated last month
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
arch-simulator-sig / simulator-paper
View on GitHub
☆12Sep 18, 2024Updated last year
Janghyun1230 / FastKVzip
View on GitHub
Accurate and fast KV cache compression with a gating mechanism
☆26Apr 5, 2026Updated 3 months ago
SooLab / Part2Object
View on GitHub
[ECCV 2024] The official PyTorch implementation of the "Part2Object: Hierarchical Unsupervised 3D Instance Segmentation".
☆26Sep 12, 2024Updated last year
EIT-NLP / Awesome-Streaming-LLMs
View on GitHub
🔥This is a repository of paper list for streaming LLMs/MLLMs.
☆24Apr 19, 2026Updated 3 months ago
Greysahy / ipiguard
View on GitHub
[EMNLP 2025 Oral] IPIGuard: A Novel Tool Dependency Graph-Based Defense Against Indirect Prompt Injection in LLM Agents
☆22Sep 16, 2025Updated 10 months ago
SuperLiaoXH / SystolicArray-2D-FP16
View on GitHub
基于FP16的二维脉动阵列电路设计
☆13Feb 23, 2023Updated 3 years ago
kyaso / py-v
View on GitHub
A cycle-accurate RISC-V CPU simulator + RTL modeling library in pure Python.
☆19Aug 27, 2025Updated 10 months ago
sagemathinc / cocalc-examples
View on GitHub
collection of example documents for use within cocalc's library
☆17Sep 11, 2025Updated 10 months ago
renatoberlinghieri / Helmholtz-GP
View on GitHub
☆11Mar 13, 2023Updated 3 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
LC1332 / Haruhi-2-Dev
View on GitHub
Just for debug
☆57Feb 15, 2024Updated 2 years ago
MemTensor / text2mem
View on GitHub
Text2Mem: A Unified Memory Operation Language for Memory Operating System
☆55Jan 7, 2026Updated 6 months ago
PASSIONLab / MaskedSpGEMM
View on GitHub
☆10Jul 4, 2022Updated 4 years ago
Vision-CAIR / Infinibench
View on GitHub
Official InfiniBench: A Benchmark for Large Multi-Modal Models in Long-Form Movies and TV Shows
☆20Nov 4, 2025Updated 8 months ago
zjunlp / unlearn
View on GitHub
[ACL 2025] Knowledge Unlearning for Large Language Models
☆49Sep 18, 2025Updated 10 months ago
CASR-HKU / AGNA-FCCM2023
View on GitHub
☆12Nov 24, 2023Updated 2 years ago
qinjr / RankFlow
View on GitHub
☆12Dec 15, 2022Updated 3 years ago
gty111 / SimpleUseGpgpuSim
View on GitHub
GPGPU-SIM 使用篇
☆14Nov 12, 2022Updated 3 years ago
VITA-Group / WeLore
View on GitHub
[ICML 2025] From Low Rank Gradient Subspace Stabilization to Low-Rank Weights: Observations, Theories and Applications
☆52Oct 30, 2025Updated 8 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
ChengShiest / Vision-Function-Layer
View on GitHub
[NeurIPS 2025] The official PyTorch implementation of the "Vision Function Layer in MLLM".
☆32Dec 18, 2025Updated 7 months ago
mbzuai-oryx / Video-R2
View on GitHub
Video-R2: Reinforcing Consistent and Grounded Reasoning in Multimodal Language Models
☆19Jan 21, 2026Updated 6 months ago
IoT-gamer / segment-anything-dinov3-onnx
View on GitHub
A set of tools and examples for converting and utilizing powerful vision models, DINOv3 and EdgeTAM (SAM2), within the ONNX ecosystem.
☆15Nov 5, 2025Updated 8 months ago
runame / laplace-refinement
View on GitHub
Posterior Refinement Improves Sample Efficiency in Bayesian Neural Networks
☆11Oct 21, 2022Updated 3 years ago
sfu-arch / SPAGHETTI
View on GitHub
RTL generator for SpGEMM
☆12Feb 2, 2021Updated 5 years ago
taiki-e / easytime
View on GitHub
Providing wrapper types for safely performing panic-free checked arithmetic on instants and durations.
☆17Jul 11, 2026Updated last week
tulasiram58827 / plot_top_losses_keras
View on GitHub
This repo consists of code for plotting top loss images
☆13May 18, 2020Updated 6 years ago