codefanw/FlashSloth

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/codefanw/FlashSloth)

codefanw / FlashSloth

[CVPR2025] FlashSloth: Lightning Multimodal Large Language Models via Embedded Visual Compression

☆64

Alternatives and similar repositories for FlashSloth

Users that are interested in FlashSloth are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

DoubtedSteam / DyVTE
View on GitHub
The official implement of "Accelerating Multimodal Large Language Models via Dynamic Visual-Token Exit and the Empirical Findings"
☆18Dec 5, 2024Updated last year
ywh187 / FitPrune
View on GitHub
☆68Jan 23, 2026Updated 6 months ago
wysnzzzz / DIT
View on GitHub
☆18Nov 15, 2024Updated last year
Aria-Zhangjl / E3-FaceNet
View on GitHub
[ICML 2024] Fast Text-to-3D-Aware Face Generation and Manipulation via Direct Cross-modal Mapping and Geometric Regularization
☆23Dec 20, 2024Updated last year
lzhxmu / AccDiffusion_v2
View on GitHub
Code release for AccDiffusionV2 (TPAMI)
☆34Nov 4, 2025Updated 8 months ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
yu2hi13 / P2SAM
View on GitHub
Official implementation for P2SAM (ACM MM 2024)
☆14Dec 7, 2024Updated last year
DoubtedSteam / RoE
View on GitHub
The official implement of "Routing Experts: Learning to Route Dynamic Experts in Existing Multi-modal Large Language Models"
☆17Mar 24, 2025Updated last year
lzhxmu / VTW
View on GitHub
Code release for VTW (AAAI 2025 Oral)
☆68Nov 4, 2025Updated 8 months ago
city1517 / FlexMem
View on GitHub
[CVPR2026 Highlight] FlexMem: Scaling the Long Video Understanding of MLLMs via Visual Memory Mechanism
☆28Apr 10, 2026Updated 3 months ago
ModelTC / MoDES
View on GitHub
[CVPR 2026] This is the official PyTorch implementation of "MoDES: Accelerating Mixture-of-Experts Multimodal Large Language Models via D…
☆31Mar 16, 2026Updated 4 months ago
Theia-4869 / FasterVLM
View on GitHub
Official code for paper: [CLS] Attention is All You Need for Training-Free Visual Token Pruning: Make VLM Inference Faster.
☆114Jun 29, 2025Updated last year
JiejiangWu / FaceG2E
View on GitHub
Official code for CVPR2024 paper "text-guided 3d face synthesis - from generation to editing"
☆44Aug 22, 2024Updated last year
yu2hi13 / TAO
View on GitHub
Official implementation for TAO (CVPR 2025)
☆21Jan 1, 2026Updated 6 months ago
JIA-Lab-research / VisionZip
View on GitHub
Official repository for VisionZip (CVPR 2025)
☆443Jul 21, 2025Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
nini0919 / SemiRES
View on GitHub
[ICML2024]The official implementation of SemiRES in PyTorch.
☆33Jun 20, 2024Updated 2 years ago
xuyimin0926 / U-WADN
View on GitHub
☆27Jan 30, 2024Updated 2 years ago
dragonlzm / PAVE
View on GitHub
This repo holds the implementation of PAVE: Patching and Adapting Video Large Language Models (CVPR2025)
☆27Sep 6, 2025Updated 10 months ago
fperazzi / davis-2017
View on GitHub
☆15Aug 1, 2021Updated 4 years ago
songrise / MLLM4Art
View on GitHub
[ACM MM 2025] MLLMs for Aesthetics Reasoning
☆26Jan 5, 2026Updated 6 months ago
hanxunyu / VisionTrim
View on GitHub
[ICLR 2026] Official code repository for "⚡️VisionTrim: Unified Vision Token Compression for Training-Free MLLM Acceleration"
☆55Jun 17, 2026Updated last month
67L1 / SinkTrack
View on GitHub
[ICLR'26] SinkTrack: Attention Sink based Context Anchoring for Large Language Models
☆18Apr 23, 2026Updated 2 months ago
jasongzy / EG4D
View on GitHub
Official implementation of "EG4D: Explicit Generation of 4D Object without Score Distillation" (ICLR 2025)
☆37Feb 14, 2025Updated last year
kawhiiiileo / FiCoCo
View on GitHub
[AAAI 26'] This is the official pytorch implementation for paper: Filter, Correlate, Compress: Training-Free Token Reduction for MLLM Acc…
☆47Nov 13, 2025Updated 8 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
ZYangChen / DC-SatMVS
View on GitHub
[IEEE JSTARS] The official implementation of "Surface Depth Estimation from Multi-view Stereo Satellite Images with Distribution Contrast…
☆11May 16, 2025Updated last year
InternLM / Spatial-SSRL
View on GitHub
[CVPR 2026] Official release of "Spatial-SSRL: Enhancing Spatial Understanding via Self-Supervised Reinforcement Learning"
☆133Apr 7, 2026Updated 3 months ago
Tencent / VITA
View on GitHub
The official implement of VITA, VITA15, LongVITA, VITA-Audio, VITA-VLA, and VITA-E.
☆162Oct 28, 2025Updated 8 months ago
lep990816 / CrossHomo
View on GitHub
☆17May 18, 2024Updated 2 years ago
RoyZhao926 / InstructBrush
View on GitHub
Official repository of the paper InstructBrush: Learning Attention-based Instruction Optimization for Image Editing
☆16Apr 14, 2024Updated 2 years ago
catlab-team / latent3D_code
View on GitHub
☆20Aug 30, 2022Updated 3 years ago
Disguiser15 / RefTeacher
View on GitHub
RefTeacher is a strong baseline method for Semi-Supervised Referring Expression Comprehension.
☆14May 26, 2023Updated 3 years ago
michaelowenliu / awesome-interactive-segmentation
View on GitHub
A collection of AWESOME things about interactive segmentation.
☆25Jun 25, 2023Updated 3 years ago
JiahuaDong / CIFC
View on GitHub
[NeurIPS2024]
☆36Dec 18, 2024Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
hyqyoung / RAMS-Trans
View on GitHub
RAMS-Trans: Recurrent Attention Multi-scale Transformer for Fine-grained Image Recognition
☆11Dec 14, 2021Updated 4 years ago
ziplab / LongVLM
View on GitHub
☆108Jul 30, 2024Updated last year
xmu-xiaoma666 / SDATR
View on GitHub
Official Code for "Knowing what it is: Semantic-enhanced Dual Attention Transformer" (TMM2022)
☆19Oct 15, 2022Updated 3 years ago
OpenGVLab / InternVL-U
View on GitHub
InternVL-U is a 4B-parameter unified multimodal model (UMM) that brings multimodal understanding, reasoning, image generation, image edit…
☆291Mar 21, 2026Updated 4 months ago
AI-in-Health / PromptLLM
View on GitHub
Code for PromptNet
☆16Jan 29, 2025Updated last year
YaoXingbo / MagicCity
View on GitHub
ICCV 2025
☆16Mar 26, 2026Updated 3 months ago
mwoedlinger / ecsic
View on GitHub
Official code of our WACV paper "ECSIC: Epipolar Cross Attention for Stereo Image Compression"
☆15Dec 27, 2023Updated 2 years ago