MikeWangWZHL/dymu

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/MikeWangWZHL/dymu)

MikeWangWZHL / dymu

☆29

Alternatives and similar repositories for dymu

Users that are interested in dymu are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

vbdi / divprune
View on GitHub
[CVPR 2025] DivPrune: Diversity-based Visual Token Pruning for Large Multimodal Models
☆86Apr 16, 2026Updated 3 months ago
hanxunyu / VisionTrim
View on GitHub
[ICLR 2026] Official code repository for "⚡️VisionTrim: Unified Vision Token Compression for Training-Free MLLM Acceleration"
☆56Jun 17, 2026Updated last month
Sein-Kim / self_evolverec
View on GitHub
☆19Updated this week
jamessealesmith / ConStruct-VL
View on GitHub
PyTorch code for the CVPR'23 paper: "ConStruct-VL: Data-Free Continual Structured VL Concepts Learning"
☆13Feb 5, 2024Updated 2 years ago
NIneeeeeem / LangDC
View on GitHub
[EMNLP 2025 Oral] Official codebase for Seeing More, Saying More: Lightweight Language Experts are Dynamic Video Token Compressors.
☆18Sep 7, 2025Updated 10 months ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
liuting20 / MustDrop
View on GitHub
Multi-Stage Vision Token Dropping: Towards Efficient Multimodal Large Language Model
☆36Jan 8, 2025Updated last year
Namkyeong / 3DMRL
View on GitHub
The official source code for "3D Interaction Geometric Pre-training for Molecular Relational Learning"
☆23Sep 30, 2025Updated 9 months ago
wy1iu / OPT
View on GitHub
Implementation for <Orthogonal Over-Parameterized Training> in CVPR'21.
☆22Jul 16, 2021Updated 5 years ago
hyunsungkim-ds / ballradar
View on GitHub
[KDD 2023] Ball Trajectory Inference from Multi-Agent Sports Contexts Using Set Transformer and Hierarchical Bi-LSTM
☆32Feb 3, 2026Updated 5 months ago
ByungKwanLee / Phantom
View on GitHub
[Technical Report] Official PyTorch implementation code for realizing the technical part of Phantom of Latent representing equipped with …
☆63Oct 9, 2024Updated last year
armenjeddi / saint
View on GitHub
a training-free approach to accelerate ViTs and VLMs by pruning redundant tokens based on similarity
☆44May 24, 2025Updated last year
HeewoongNoh / DOSTransformer
View on GitHub
The official source code for [2023 NeurIPS] " Density of States Prediction of Crystalline Materials via Prompt-guided Multi-Modal Transfo…
☆30Oct 15, 2024Updated last year
pixeli99 / MixLN
View on GitHub
[ICLR 2025] Official Pytorch Implementation of "Mix-LN: Unleashing the Power of Deeper Layers by Combining Pre-LN and Post-LN" by Pengxia…
☆30Jul 24, 2025Updated last year
42Shawn / LLaVA-PruMerge
View on GitHub
LLaVA-PruMerge: Adaptive Token Reduction for Efficient Large Multimodal Models
☆173Mar 8, 2026Updated 4 months ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
yuexy / ST-AR
View on GitHub
☆14Sep 22, 2025Updated 10 months ago
Tennine2077 / HiDe
View on GitHub
[ICML 2026] HiDe: Rethinking The Zoom-IN method in High Resolution MLLMs via Hierarchical Decoupling
☆27May 2, 2026Updated 2 months ago
daixiangzi / Awesome-Token-Compress
View on GitHub
A paper list of some recent works about Token Compress for Vit and VLM
☆944Updated this week
PiggyJerry / DC-Net
View on GitHub
The code for paper: "DC-Net: Divide-and-Conquer for Salient Object Detection"
☆22Aug 30, 2024Updated last year
kyegomez / Mirasol
View on GitHub
Pytorch Implementation of the Model from "MIRASOL3B: A MULTIMODAL AUTOREGRESSIVE MODEL FOR TIME-ALIGNED AND CONTEXTUAL MODALITIES"
☆26Jan 27, 2025Updated last year
Adaxry / Unified_Layer_Skipping
View on GitHub
☆15Apr 11, 2024Updated 2 years ago
BRZ911 / ViTCoT
View on GitHub
[ACM MM 2025] ViTCoT: Video-Text Interleaved Chain-of-Thought for Boosting Video Understanding in Large Language Models
☆18Jul 15, 2025Updated last year
showlab / MovieSeq
View on GitHub
[ECCV 2024] Learning Video Context as Interleaved Multimodal Sequences
☆46Mar 11, 2025Updated last year
ant-research / long-context-modeling
View on GitHub
Research work aimed at addressing the problem of modeling infinite-length context
☆50Dec 18, 2025Updated 7 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
zaydzuhri / flame
View on GitHub
Fork of Flame repo for training of some new stuff in development
☆20Jul 15, 2026Updated 2 weeks ago
blender-nlp / mCLM
View on GitHub
☆18May 11, 2026Updated 2 months ago
GATECH-EIC / Linearized-LLM
View on GitHub
[ICML 2024] When Linear Attention Meets Autoregressive Decoding: Towards More Effective and Efficient Linearized Large Language Models
☆35Jun 12, 2024Updated 2 years ago
AIM-SKKU / ADAPT
View on GitHub
[NeurIPS 2025] Backpropagation-Free Test-Time Adaptation via Probabilistic Gaussian Alignment
☆22Mar 18, 2026Updated 4 months ago
dibbla / Quantized-Evolution-Strategies
View on GitHub
Quantized Evolution Strategies: High-precision Fine-tuning of Quantized LLMs at Low-precision Cost
☆21May 14, 2026Updated 2 months ago
ByungKwanLee / TroL
View on GitHub
[EMNLP 2024] Official PyTorch implementation code for realizing the technical part of Traversal of Layers (TroL) presenting new propagati…
☆99Jun 23, 2024Updated 2 years ago
MikeWangWZHL / Zemi
View on GitHub
Repo for "Zemi: Learning Zero-Shot Semi-Parametric Language Models from Multiple Tasks" ACL 2023 Findings
☆15May 3, 2023Updated 3 years ago
JulietChoo / VisionSelector
View on GitHub
VisionSelector: End-to-End Learnable Visual Token Compression for Efficient Multimodal LLMs
☆65Mar 24, 2026Updated 4 months ago
kensho-technologies / pathpiece
View on GitHub
PathPiece tokenizer
☆14Nov 10, 2024Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
HumanMLLM / LLaVA-Scissor
View on GitHub
The official code for the paper: LLaVA-Scissor: Token Compression with Semantic Connected Components for Video LLMs
☆122Jul 1, 2025Updated last year
XPR2004 / SpatialBench
View on GitHub
Code and dataset for paper "SpatialBench: Benchmarking Multimodal Large Language Models for Spatial Cognition"
☆19Mar 17, 2026Updated 4 months ago
longrongyang / STGC
View on GitHub
Solving Token Gradient Conflict in Mixture-of-Experts for Large Vision-Language Model
☆13Feb 11, 2025Updated last year
iLearn-Lab / ACL25-PTQ1.61
View on GitHub
☆15Apr 6, 2026Updated 3 months ago
merlresearch / SOCKET
View on GitHub
Code for MERL's ECCV 2022 paper on Cross-Modal Knowledge Transfer Without Task-Relevant Source Data
☆11Jul 19, 2022Updated 4 years ago
Fantasyele / LLaVA-KD
View on GitHub
[ICCV 2025] Official implementation of LLaVA-KD: A Framework of Distilling Multimodal Large Language Models
☆134Oct 14, 2025Updated 9 months ago
KD-TAO / DyCoke
View on GitHub
[CVPR 2025] DyCoke: Dynamic Compression of Tokens for Fast Video Large Language Models
☆114Nov 22, 2025Updated 8 months ago