sdc17/CrossGET

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/sdc17/CrossGET)

sdc17 / CrossGET

[ICML 2024] CrossGET: Cross-Guided Ensemble of Tokens for Accelerating Vision-Language Transformers

☆34

Alternatives and similar repositories for CrossGET

Users that are interested in CrossGET are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

sdc17 / UPop
View on GitHub
[ICML 2023] UPop: Unified and Progressive Pruning for Compressing Vision-Language Transformers
☆103Dec 30, 2024Updated last year
sdc17 / CopT
View on GitHub
CopT: Contrastive On-Policy Thinking with Continuous Spaces for General and Agentic Reasoning
☆18May 21, 2026Updated 2 months ago
Osilly / Heartbeat-sequence-prediction
View on GitHub
天池上的一场长期赛（心跳信号分类预测），非常简单朴素的实现，长期赛榜单第8名（258.9817分）
☆18Jan 30, 2022Updated 4 years ago
WalkerWorldPeace / MLLMerging
View on GitHub
ICLR 2026 "OptMerge: Unifying Multimodal LLM Capabilities and Modalities via Model Merging".
☆57Jun 18, 2026Updated last month
menik1126 / UNComp
View on GitHub
[EMNLP 2025🔥] UNComp: Can Matrix Entropy Uncover Sparsity? -- A Compressor Design from an Uncertainty-Aware Perspective
☆20Jan 7, 2026Updated 6 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
menik1126 / Swing-Bench
View on GitHub
[ICLR2026🔥Oral] SwingArena: Competitive Programming Arena for Long-context GitHub Issue Solving
☆15Feb 26, 2026Updated 4 months ago
ahmedssabir / Belief-Revision-Score
View on GitHub
Belief Revision based Caption Re-ranker with Visual Semantic Information. COLING 2022
☆11Apr 13, 2025Updated last year
OpenGVLab / DiffRate
View on GitHub
[ICCV 23]An approach to enhance the efficiency of Vision Transformer (ViT) by concurrently employing token pruning and token merging tech…
☆103Jul 14, 2023Updated 3 years ago
42Shawn / LLaVA-PruMerge
View on GitHub
LLaVA-PruMerge: Adaptive Token Reduction for Efficient Large Multimodal Models
☆173Mar 8, 2026Updated 4 months ago
wenhaochai / UniAP
View on GitHub
[AAAI 2024] UniAP: Towards Universal Animal Perception in Vision via Few-shot Learning
☆12Dec 10, 2023Updated 2 years ago
ywh187 / FitPrune
View on GitHub
☆68Jan 23, 2026Updated 5 months ago
fistyee / MixPro
View on GitHub
🔥MixPro: Data Augmentation with MaskMix and Progressive Attention Labeling for Vision Transformer [Official, ICLR 2023]
☆22Nov 3, 2023Updated 2 years ago
LorrinWWW / SkipBERT
View on GitHub
Code associated with the paper **SkipBERT: Efficient Inference with Shallow Layer Skipping**, at ACL 2022
☆16Jun 22, 2022Updated 4 years ago
ZIB-IOL / SMS
View on GitHub
Code to reproduce the experiments of the ICLR24-paper: "Sparse Model Soups: A Recipe for Improved Pruning via Model Averaging"
☆12Oct 14, 2025Updated 9 months ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
tttyuntian / vlm_lexical_grounding
View on GitHub
PyTorch code for the Findings of EMNLP 2021 paper "Does Vision-and-Language Pretraining Improve Lexical Grounding?"
☆11Sep 26, 2021Updated 4 years ago
SpencerWhitehead / novelvqa
View on GitHub
☆27Oct 7, 2021Updated 4 years ago
OrigamiSL / OTETrack
View on GitHub
Source code of the paper: Overlapped Trajectory-Enhanced Visual Tracking
☆11Sep 3, 2024Updated last year
OpenGVLab / Multitask-Model-Selector
View on GitHub
[NIPS2023]Implementation of Foundation Model is Efficient Multimodal Multitask Model Selector
☆37Mar 7, 2024Updated 2 years ago
MCG-NJU / MMN
View on GitHub
[AAAI 2022] Negative Sample Matters: A Renaissance of Metric Learning for Temporal Grounding
☆91Nov 16, 2022Updated 3 years ago
Yaxin9Luo / Gamma-MOD
View on GitHub
[ICLR2025] γ -MOD: Mixture-of-Depth Adaptation for Multimodal Large Language Models
☆45Oct 28, 2025Updated 8 months ago
mlvlab / RPO
View on GitHub
Official Implementation of "Read-only Prompt Optimization for Vision-Language Few-shot Learning", ICCV 2023
☆54Aug 19, 2023Updated 2 years ago
Kurt232 / RLKV
View on GitHub
☆35Jun 8, 2026Updated last month
cchao0116 / CTSMA-ICML21
View on GitHub
Code for ICML21 paper "Learning Self-Modulating Attention in Continuous Time Space with Applications to Sequential Recommendation"
☆12Feb 8, 2023Updated 3 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
wenhaochai / PoseDA
View on GitHub
[ICCV 2023] Global Adaptation meets Local Generalization: Unsupervised Domain Adaptation for 3D Human Pose Estimation
☆24Aug 26, 2023Updated 2 years ago
haoweiz23 / DistDiff
View on GitHub
[NeurIPS 2024] The official repository of "Distribution-Aware Data Expansion with Diffusion Models".
☆17Dec 15, 2025Updated 7 months ago
yancie-yjr / DBQ-SSD
View on GitHub
The official implementation of the paper DBQ-SSD: Dynamic Ball Query for Efficient 3D Object Detection (ICLR 2023)
☆18Sep 17, 2023Updated 2 years ago
AIoT-MLSys-Lab / D2O
View on GitHub
[ICLR 2025🔥] D2O: Dynamic Discriminative Operations for Efficient Long-Context Inference of Large Language Models
☆27Jul 7, 2025Updated last year
MCG-NJU / p-MoD
View on GitHub
[ICCV 2025] p-MoD: Building Mixture-of-Depths MLLMs via Progressive Ratio Decay
☆43Jun 26, 2025Updated last year
INK-USC / hypter
View on GitHub
Zero-shot Learning by Generating Task-specific Adapters
☆14Apr 2, 2021Updated 5 years ago
devaansh100 / CLIPTrans
View on GitHub
Official implementation for the paper "Transferring Visual Knowledge with Pre-trained Models for Multimodal Machine Translation", publish…
☆20Jun 3, 2024Updated 2 years ago
ylsung / ECoFLaP
View on GitHub
Code for "ECoFLaP: Efficient Coarse-to-Fine Layer-Wise Pruning for Vision-Language Models" (ICLR 2024)
☆21Feb 16, 2024Updated 2 years ago
TengdaHan / TemporalAlignNet
View on GitHub
[CVPR'22 Oral] Temporal Alignment Networks for Long-term Video. Tengda Han, Weidi Xie, Andrew Zisserman.
☆122Oct 9, 2023Updated 2 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
DanielLin97 / FACT-AUDIT
View on GitHub
An Adaptive Multi-Agent Framework for Dynamic Fact-Checking Evaluation of Large Language Models
☆18Feb 27, 2025Updated last year
lern-to-write / STC
View on GitHub
[CVPR 2026] Accelerating Streaming Video Large Language Models via Hierarchical Token Compression
☆70Jun 8, 2026Updated last month
aimagelab / PMA-Net
View on GitHub
[ICCV 2023] With a Little Help from your own Past: Prototypical Memory Networks for Image Captioning.
☆19Jun 7, 2024Updated 2 years ago
LeeSureman / Sequence-Labeling-Early-Exit
View on GitHub
Code for ACL 2021 paper: Accelerating BERT Inference for Sequence Labeling via Early-Exit
☆28Aug 19, 2022Updated 3 years ago
xunull / read-RT-DETR
View on GitHub
☆14May 19, 2024Updated 2 years ago
Yondijr / NER_Transformer
View on GitHub
A transformer model that should be able to solve a simple NER task
☆11Mar 7, 2019Updated 7 years ago
huangzizheng01 / ShuffleMamba
View on GitHub
Code of paper 'Stochastic Layer-Wise Shuffle for Improving Vision Mamba Training'
☆21Jun 10, 2025Updated last year