Osilly / TokenExpansionLinks

[CVPR 2024] The official pytorch implementation of "A General and Efficient Training for Transformer via Token Expansion".

☆44

Alternatives and similar repositories for TokenExpansion

Users that are interested in TokenExpansion are comparing it to the libraries listed below

Sorting:

ZhengYu518 / VL-Mamba
Implementation of "VL-Mamba: Exploring State Space Models for Multimodal Learning"
☆82Updated last year
YuqiYang213 / MLoRE
Project Page for "Multi-Task Dense Prediction via Mixture of Low-Rank Experts"
☆82Updated 2 months ago
zhoujiahuan1991 / ICML2025-TCPA
☆18Updated 3 months ago
ChenhongyiYang / PlainMamba
[BMVC 2024] PlainMamba: Improving Non-hierarchical Mamba in Visual Recognition
☆79Updated 4 months ago
WillDreamer / Aurora
[NeurIPS2023] Parameter-efficient Tuning of Large-scale Multimodal Foundation Model
☆88Updated last year
lezhang7 / SAIL
[CVPR 2025 Highlight] Official Pytorch codebase for paper: "Assessing and Learning Alignment of Unimodal Vision and Language Models"
☆48Updated last month
EasonXiao-888 / MambaTree
[NeurIPS2024 Spotlight] The official implementation of MambaTree: Tree Topology is All You Need in State Space Model
☆99Updated last year
xmed-lab / TAM
[ICCV25 Oral] Token Activation Map to Visually Explain Multimodal LLMs
☆52Updated this week
YBZh / DMN
CVPR2024: Dual Memory Networks: A Versatile Adaptation Approach for Vision-Language Models
☆80Updated last year
winycg / CLIP-KD
[CVPR-2024] Official implementations of CLIP-KD: An Empirical Study of CLIP Model Distillation
☆123Updated last year
wusize / F-LMM
[CVPR2025] Code Release of F-LMM: Grounding Frozen Large Multimodal Models
☆100Updated 2 months ago
UMass-Embodied-AGI / Mod-Squad
☆92Updated 2 years ago
leaves162 / CLIPtrase
cliptrase
☆41Updated 11 months ago
yayafengzi / LMM-HiMTok
HiMTok: Learning Hierarchical Mask Tokens for Image Segmentation with Large Multimodal Model
☆58Updated 3 weeks ago
OliverRensu / ARM
[ICLR2025] This repository is the official implementation of our Autoregressive Pretraining with Mamba in Vision
☆83Updated 2 months ago
zhengli97 / ATPrompt
[ICCV 2025] Official PyTorch Code for "Advancing Textual Prompt Learning with Anchored Attributes"
☆83Updated 3 weeks ago
HVision-NKU / Cascade-CLIP
Official implement of ICML2024 Cascade-CLIP: Cascaded Vision-Language Embeddings Alignment for Zero-Shot Semantic Segmentation
☆51Updated 11 months ago
wangf3014 / SCLIP
Official implementation of SCLIP: Rethinking Self-Attention for Dense Vision-Language Inference
☆161Updated 10 months ago
mc-lan / ClearCLIP
[ECCV2024] ClearCLIP: Decomposing CLIP Representations for Dense Vision-Language Inference
☆86Updated 4 months ago
yongliu20 / SCAN
[CVPR 2024] The repository contains the official implementation of "Open-Vocabulary Segmentation with Semantic-Assisted Calibration"
☆73Updated 10 months ago
Hoar012 / RAP-MLLM
[CVPR 2025] RAP: Retrieval-Augmented Personalization
☆64Updated last week
aim-uofa / SINE
[NeurIPS'24] A Simple Image Segmentation Framework via In-Context Examples
☆58Updated 9 months ago
JieShibo / MemVP
[ICML 2024] Memory-Space Visual Prompting for Efficient Vision-Language Fine-Tuning
☆49Updated last year
OpenGVLab / Mono-InternVL
[CVPR 2025] Mono-InternVL: Pushing the Boundaries of Monolithic Multimodal Large Language Models with Endogenous Visual Pre-training
☆74Updated 3 weeks ago
Paranioar / UniPT
[CVPR2024] The code of "UniPT: Universal Parallel Tuning for Transfer Learning with Efficient Parameter and Memory"
☆67Updated 9 months ago
Koorye / DePT
[CVPR 2024] Offical implemention of the paper "DePT: Decoupled Prompt Tuning"
☆107Updated 2 months ago
jiaosiyu1999 / MAFT
☆58Updated 11 months ago
DavidYanAnDe / ARC
☆35Updated last year
scale-lab / MTLoRA
The official implementation for MTLoRA: A Low-Rank Adaptation Approach for Efficient Multi-Task Learning (CVPR '24)
☆58Updated last month
linyq2117 / TagCLIP
[AAAI 2024] TagCLIP: A Local-to-Global Framework to Enhance Open-Vocabulary Multi-Label Classification of CLIP Without Training
☆100Updated last year