alexlioralexli / attention-transferView external linksLinks
☆22Nov 19, 2024Updated last year
Alternatives and similar repositories for attention-transfer
Users that are interested in attention-transfer are comparing it to the libraries listed below
Sorting:
- ☆23Nov 16, 2024Updated last year
- [NeurIPS 2024] VeLoRA : Memory Efficient Training using Rank-1 Sub-Token Projections☆21Oct 15, 2024Updated last year
- Offical implementation of "MetaLA: Unified Optimal Linear Approximation to Softmax Attention Map" (NeurIPS2024 Oral)☆34Jan 18, 2025Updated last year
- Official Code Repository for the paper "Key-value memory in the brain"☆31Feb 25, 2025Updated 11 months ago
- Latest Advances on Reasoning of Multimodal Large Language Models (Multimodal R1 \ Visual R1) ) 🍓☆36Apr 3, 2025Updated 10 months ago
- ☆36Oct 3, 2018Updated 7 years ago
- This is the official implementation of paper, as was used for the paper.☆20Jul 20, 2025Updated 6 months ago
- AdaptiveStep: Automatically Dividing Reasoning Step through Model Confidence☆10Mar 2, 2025Updated 11 months ago
- Official PyTorch implementation of The Linear Attention Resurrection in Vision Transformer☆15Sep 7, 2024Updated last year
- Gesture Recognition Based on ALTERA DE2-115 FPGA☆10Mar 18, 2014Updated 11 years ago
- Anki add-on that adds Pinyin and Zhuyin readings above Chinese characters in any field.☆12Sep 23, 2025Updated 4 months ago
- ☆19Jan 16, 2026Updated last month
- [ICCV 2025] Official implementation of LLaVA-KD: A Framework of Distilling Multimodal Large Language Models☆125Oct 14, 2025Updated 4 months ago
- Residual vector quantization for KV cache compression in large language model☆11Oct 22, 2024Updated last year
- This repository contains the speaker labeled information of VoxCeleb2 and LRS3 audio-visual datasets. (AAAI 2025)☆12Sep 6, 2024Updated last year
- 河海大学每日健康打卡☆12Dec 4, 2021Updated 4 years ago
- Code for the paper "No Train, all Gain: Self-Supervised Gradients Improve Deep Frozen Representations"☆12Oct 31, 2024Updated last year
- ☆10Aug 31, 2021Updated 4 years ago
- [ICLR 2025 SynthData Workshop Spotlight] Empowering LLMs in Decision Games through Algorithmic Data Synthesis☆26Apr 27, 2025Updated 9 months ago
- Code for Rethinking Prompt Optimizers: From Prompt Merits to Optimization☆12Jan 12, 2026Updated last month
- ☆11Sep 1, 2024Updated last year
- Official implementation of CytoSAE: Interpretable Cell Embeddings for Hematology☆22Jul 17, 2025Updated 7 months ago
- Resolution Asymmetric Metric Learning☆16Dec 10, 2023Updated 2 years ago
- Official PyTorch implementation of CD-MOE☆12Mar 29, 2025Updated 10 months ago
- [NeurIPS 2024] Efficiency for Free: Ideal Data Are Transportable Representations☆19Jan 19, 2025Updated last year
- Companion code for Awe the Audience: How the Narrative Trajectories Affect Audience Perception in Public Speaking☆14Jan 6, 2018Updated 8 years ago
- HyperPose☆12Nov 6, 2025Updated 3 months ago
- Reference implementation of models from Nyonic Model Factory☆12May 13, 2024Updated last year
- ☆23Updated this week
- ☆13Jul 20, 2024Updated last year
- Realtime feedback control of event-camera biases.☆11Dec 22, 2025Updated last month
- This repo contains the official code release of the Neural Experts paper, published in NeurIPS 2024.☆13Dec 3, 2024Updated last year
- [COLM 2025: 1st Workshop on the Application of LLM Explainability to Reasoning and Planning] Latent Chain-of-Thought? Decoding the Depth-…☆17Oct 4, 2025Updated 4 months ago
- ☆13Jul 8, 2024Updated last year
- The source code for “Homophily-Related: Adaptive Hybrid Graph Filter for Multi-View Graph Clustering”☆10Apr 10, 2024Updated last year
- ☆11Jun 2, 2019Updated 6 years ago
- ☆10Feb 27, 2020Updated 5 years ago
- ☆10Oct 28, 2024Updated last year
- ☆36Jan 13, 2026Updated last month