Yaxin9Luo/Gamma-MOD

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Yaxin9Luo/Gamma-MOD)

Yaxin9Luo / Gamma-MOD

[ICLR2025] γ -MOD: Mixture-of-Depth Adaptation for Multimodal Large Language Models

☆45

Alternatives and similar repositories for Gamma-MOD

Users that are interested in Gamma-MOD are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

MetaAgentX / NextGen-CAPTCHAs
View on GitHub
[ICML 2026]A defense framework against MLLM-based web GUI agents. This repository provides both the generative CAPTCHA system and tools f…
☆22May 1, 2026Updated 2 months ago
Jiacheng8 / CV-DD
View on GitHub
Dataset Distillation via Committee Voting
☆15Jul 28, 2025Updated last year
Tangshengku / Bi-Mamba
View on GitHub
The official implementation of Bi-Mamba
☆17Oct 22, 2025Updated 9 months ago
ChangyaoTian / ADDP
View on GitHub
The official implementation of ADDP (ICLR 2024)
☆12Mar 27, 2024Updated 2 years ago
shaoshitong / EDC
View on GitHub
Elucidated Dataset Condensation (NeurIPS 2024)
☆20Oct 5, 2024Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
VILA-Lab / OD3
View on GitHub
[ICLR 2026] Optimization-free Dataset Distillation for Object Detection. Paper at: https://arxiv.org/abs/2506.01942
☆31Jan 26, 2026Updated 6 months ago
MCG-NJU / p-MoD
View on GitHub
[ICCV 2025] p-MoD: Building Mixture-of-Depths MLLMs via Progressive Ratio Decay
☆44Jun 26, 2025Updated last year
OpenGVLab / De-focus-Attention-Networks
View on GitHub
Learning 1D Causal Visual Representation with De-focus Attention Networks
☆35Jun 7, 2024Updated 2 years ago
VILA-Lab / DELT
View on GitHub
(CVPR 2025) Official implementation to DELT: A Simple Diversity-driven EarlyLate Training for Dataset Distillation which outperforms SOTA…
☆28Aug 23, 2025Updated 11 months ago
zjr2000 / REVERIE
View on GitHub
[ECCV2024] Reflective Instruction Tuning: Mitigating Hallucinations in Large Vision-Language Models
☆20Jul 17, 2024Updated 2 years ago
fundamentalvision / UniGrad
View on GitHub
☆31Jun 29, 2022Updated 4 years ago
NIneeeeeem / LangDC
View on GitHub
[EMNLP 2025 Oral] Official codebase for Seeing More, Saying More: Lightweight Language Experts are Dynamic Video Token Compressors.
☆18Sep 7, 2025Updated 10 months ago
Tencent-QQMM / Video-CCAM
View on GitHub
A lightweight flexible Video-MLLM developed by TencentQQ Multimedia Research Team.
☆74Oct 14, 2024Updated last year
swordlidev / Efficient-Multimodal-LLMs-Survey
View on GitHub
Efficient Multimodal Large Language Models: A Survey
☆386Apr 29, 2025Updated last year
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
sanbuphy / computer-vision-reference
View on GitHub
Collected the world's best computer vision labs and lecture materials.
☆15Feb 23, 2025Updated last year
Bizilizi / VGGSounder
View on GitHub
VGGSounder, a multi-label audio-visual classification dataset with modality annotations.
☆17Jun 30, 2026Updated 3 weeks ago
yliu-cs / PiTe
View on GitHub
[ECCV'24 Oral] PiTe: Pixel-Temporal Alignment for Large Video-Language Model
☆17Feb 13, 2025Updated last year
Wang-Xiaodong1899 / Awesome-Multimodal-Large-Language-Models
View on GitHub
🔥Awesome Multimodal Large Language Models Paper List
☆154Mar 12, 2025Updated last year
inFaaa / Evolver
View on GitHub
[COLING 2025🔥] Evolver: Chain-of-Evolution Prompting to Boost Large Multimodal Models for Hateful Meme Detection
☆17Jan 21, 2025Updated last year
OpenGVLab / Siamese-Image-Modeling
View on GitHub
[CVPR 2023]Implementation of Siamese Image Modeling for Self-Supervised Vision Representation Learning
☆41Jun 6, 2024Updated 2 years ago
XieZilongAI / E2E-AFG
View on GitHub
An End-to-End Model with Adaptive Filtering for Retrieval-Augmented Generation
☆16Oct 27, 2024Updated last year
zijunwei / S2N-release
View on GitHub
☆13Dec 25, 2018Updated 7 years ago
42Shawn / LLaVA-PruMerge
View on GitHub
LLaVA-PruMerge: Adaptive Token Reduction for Efficient Large Multimodal Models
☆173Mar 8, 2026Updated 4 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
astramind-ai / Mixture-of-depths
View on GitHub
Unofficial implementation for the paper "Mixture-of-Depths: Dynamically allocating compute in transformer-based language models"
☆175Jun 20, 2024Updated 2 years ago
SUSTechBruce / LOOK-M
View on GitHub
[EMNLP 2024 Findings🔥] Official implementation of ": LOOK-M: Look-Once Optimization in KV Cache for Efficient Multimodal Long-Context In…
☆103Nov 9, 2024Updated last year
showlab / MovieSeq
View on GitHub
[ECCV 2024] Learning Video Context as Interleaved Multimodal Sequences
☆46Mar 11, 2025Updated last year
Cooperx521 / PyramidDrop
View on GitHub
(CVPR 2025) PyramidDrop: Accelerating Your Large Vision-Language Models via Pyramid Visual Redundancy Reduction
☆151Mar 6, 2025Updated last year
pkunlp-icler / FastV
View on GitHub
[ECCV 2024 Oral] Code for paper: An Image is Worth 1/2 Tokens After Layer 2: Plug-and-Play Inference Acceleration for Large Vision-Langua…
☆591Jan 4, 2025Updated last year
mlvlab / vid-TLDR
View on GitHub
Official implementation of CVPR 2024 paper "vid-TLDR: Training Free Token merging for Light-weight Video Transformer".
☆55Oct 21, 2025Updated 9 months ago
szq0214 / CMC_with_Image_Mixture
View on GitHub
pytorch implementation of "Contrastive Multiview Coding", "Momentum Contrast for Unsupervised Visual Representation Learning", and "Unsup…
☆18Mar 23, 2020Updated 6 years ago
ludc506 / InternVL-X
View on GitHub
☆16Mar 26, 2025Updated last year
SHI-Labs / VisPer-LM
View on GitHub
[NeurIPS 2025] Elevating Visual Perception in Multimodal LLMs with Visual Embedding Distillation
☆74Oct 17, 2025Updated 9 months ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
pilancilab / matrix-compressor
View on GitHub
Implementation of LPLR algorithm for matrix compression
☆33Nov 21, 2023Updated 2 years ago
keikeiqi / MGTTA
View on GitHub
AAAI2025
☆13Apr 18, 2025Updated last year
lose4578 / CircleRoPE
View on GitHub
☆15Sep 1, 2025Updated 10 months ago
LaVi-Lab / AIM
View on GitHub
[ICCV 2025] Official code for "AIM: Adaptive Inference of Multi-Modal LLMs via Token Merging and Pruning"
☆65Oct 9, 2025Updated 9 months ago
mit-han-lab / lpd
View on GitHub
[ICLR 2026 Oral] Locality-aware Parallel Decoding for Efficient Autoregressive Image Generation
☆104May 8, 2026Updated 2 months ago
sramshetty / mixture-of-depths
View on GitHub
An unofficial implementation of "Mixture-of-Depths: Dynamically allocating compute in transformer-based language models"
☆35Jun 7, 2024Updated 2 years ago
sdc17 / CrossGET
View on GitHub
[ICML 2024] CrossGET: Cross-Guided Ensemble of Tokens for Accelerating Vision-Language Transformers
☆34Dec 30, 2024Updated last year