chenyaofo / CCA-AttentionLinks

☆13

Alternatives and similar repositories for CCA-Attention

Users that are interested in CCA-Attention are comparing it to the libraries listed below

Sorting:

AwesomeSeq / Comba-triton
☆48Updated 2 months ago
chuanyang-Zheng / DAPE
The this is the official implementation of "DAPE: Data-Adaptive Positional Encoding for Length Extrapolation"
☆39Updated 11 months ago
scale-lab / MTLoRA
The official implementation for MTLoRA: A Low-Rank Adaptation Approach for Efficient Multi-Task Learning (CVPR '24)
☆62Updated 2 months ago
waltonfuture / Diff-eRank
[NeurIPS 2024] A Novel Rank-Based Metric for Evaluating Large Language Models
☆52Updated 3 months ago
ArmenJeddi / saint
a training-free approach to accelerate ViTs and VLMs by pruning redundant tokens based on similarity
☆35Updated 3 months ago
MLGroupJLU / RWKV-Survey
The official GitHub page for the survey paper "A Survey of RWKV".
☆29Updated 8 months ago
abdelfattah-lab / TokenButler
☆25Updated last month
assafbk / DeciMamba
DeciMamba: Exploring the Length Extrapolation Potential of Mamba (ICLR 2025)
☆31Updated 5 months ago
Chaos96 / fourierft
☆148Updated last year
yule-BUAA / MergeLLM
Codes for Merging Large Language Models
☆33Updated last year
uservan / ThinkPO
☆18Updated last month
MIRALab-USTC / LLM-AttentionPredictor
The code for "AttentionPredictor: Temporal Pattern Matters for Efficient LLM Inference", Qingyue Yang, Jie Wang, Xing Li, Zhihai Wang, Ch…
☆16Updated 2 months ago
EffiVLM-Bench / EffiVLM-Bench
☆25Updated 3 months ago
Susan571 / LENSLLM
This repository contains the code for our ICML 2025 paper——LENSLLM: Unveiling Fine-Tuning Dynamics for LLM Selection🎉
☆25Updated 3 months ago
alessiodevoto / l2compress
Code for the EMNLP24 paper "A simple and effective L2 norm based method for KV Cache compression."
☆16Updated 9 months ago
Linzwcs / AFT
☆13Updated 7 months ago
MingyuJ666 / Rope_with_LLM
[ICML'25] Our study systematically investigates massive values in LLMs' attention mechanisms. First, we observe massive values are concen…
☆79Updated 2 months ago
leo-yangli / VB-LoRA
This repo contains the source code for VB-LoRA: Extreme Parameter Efficient Fine-Tuning with Vector Banks (NeurIPS 2024).
☆40Updated 11 months ago
aeroplanepaper / GRPO-LEAD
☆24Updated 4 months ago
THUDM / TreeRL
TreeRL: LLM Reinforcement Learning with On-Policy Tree Search in ACL'25
☆60Updated 2 months ago
pixas / NoRM
ICLR 2025
☆28Updated 3 months ago
MikaStars39 / StableMask
PyTorch implementation of StableMask (ICML'24)
☆14Updated last year
OpenSparseLLMs / Linearization
☆57Updated 2 months ago
TsinghuaC3I / SoRA
[EMNLP 2023, Main Conference] Sparse Low-rank Adaptation of Pre-trained Language Models
☆82Updated last year
GeWu-Lab / MokA
MokA: Multimodal Low-Rank Adaptation for MLLMs
☆22Updated 2 months ago
BICLab / MetaLA
Offical implementation of "MetaLA: Unified Optimal Linear Approximation to Softmax Attention Map" (NeurIPS2024 Oral)
☆26Updated 7 months ago
ArminAzizi98 / LaMDA
☆15Updated 10 months ago
shawnricecake / Heima
Code for Heima
☆52Updated 4 months ago
piotrpiekos / MoSA
User-friendly implementation of the Mixture-of-Sparse-Attention (MoSA). MoSA selects distinct tokens for each head with expert choice rou…
☆26Updated 4 months ago
mathllm / Step-Controlled_DPO
☆22Updated last year