MCG-NJU/ZeroI2V

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/MCG-NJU/ZeroI2V)

MCG-NJU / ZeroI2V

[ECCV 2024] ZeroI2V: Zero-Cost Adaptation of Pre-trained Transformers from Image to Video

☆23

Alternatives and similar repositories for ZeroI2V

Users that are interested in ZeroI2V are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

MCG-NJU / SPLAM
View on GitHub
[ECCV 2024 Oral] SPLAM: Accelerating Image Generation with Sub-path Linear Approximation Model
☆24Nov 1, 2024Updated last year
MCG-NJU / AWT
View on GitHub
[NeurIPS 2024] AWT: Transferring Vision-Language Models via Augmentation, Weighting, and Transportation
☆121Oct 5, 2024Updated last year
MCG-NJU / VideoEval
View on GitHub
VideoEval: Comprehensive Benchmark Suite for Low-Cost Evaluation of Video Foundation Model
☆15Jul 31, 2025Updated 11 months ago
MCG-NJU / MGMAE
View on GitHub
[ICCV 2023] MGMAE: Motion Guided Masking for Video Masked Autoencoding
☆26Oct 16, 2023Updated 2 years ago
MCG-NJU / TemporalPerceiver
View on GitHub
[T-PAMI 2023] Temporal Perceiver: A General Architecture for Arbitrary Boundary Detection
☆39Aug 29, 2023Updated 2 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
MCG-NJU / AMD
View on GitHub
[CVPR 2024] Asymmetric Masked Distillation for Pre-Training Small Foundation Models
☆18Jan 11, 2026Updated 6 months ago
MCG-NJU / JoMoLD
View on GitHub
[ECCV 2022] Joint-Modal Label Denoising for Weakly-Supervised Audio-Visual Video Parsing
☆27Jul 15, 2022Updated 4 years ago
MCG-NJU / ViT-TAD
View on GitHub
[CVPR 2024] Adapting Short-Term Transformers for Action Detection in Untrimmed Videos
☆11Jun 11, 2024Updated 2 years ago
MCG-NJU / Dynamic-MDETR
View on GitHub
[TPAMI 2024] Dynamic MDETR: A Dynamic Multimodal Transformer Decoder for Visual Grounding
☆29Sep 11, 2024Updated last year
leexinhao / ZeroI2V
View on GitHub
[ECCV 2024] ZeroI2V: Zero-Cost Adaptation of Pre-trained Transformers from Image to Video
☆20Jul 29, 2024Updated 2 years ago
MCG-NJU / CaReBench
View on GitHub
A Fine-grained Benchmark for Video Captioning and Retrieval
☆30Jul 16, 2025Updated last year
MCG-NJU / TimeLens2
View on GitHub
TimeLens2: Generalist Video Temporal Grounding with Multimodal LLMs
☆57Updated this week
MCG-NJU / BIVDiff
View on GitHub
[CVPR 2024] BIVDiff: A Training-free Framework for General-Purpose Video Synthesis via Bridging Image and Video Diffusion Models
☆78Sep 11, 2024Updated last year
MCG-NJU / CoMAE
View on GitHub
[AAAI 2023 Oral] CoMAE: Single Model Hybrid Pre-training on Small-Scale RGB-D Datasets
☆38Aug 20, 2024Updated last year
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
wangzheallen / STL-VQA
View on GitHub
The good practice in the VQA system such as pos-tag attention, structed triplet learning and triplet attention is very general and can be…
☆19Jan 23, 2018Updated 8 years ago
MCG-NJU / RTD-Action
View on GitHub
[ICCV 2021] Relaxed Transformer Decoders for Direct Action Proposal Generation
☆92Apr 5, 2022Updated 4 years ago
MCG-NJU / MoG-VFI
View on GitHub
Motion-Aware Generative Frame Interpolation
☆50Mar 11, 2025Updated last year
MCG-NJU / PointTAD
View on GitHub
[NeurIPS 2022] PointTAD: Multi-Label Temporal Action Detection with Learnable Query Points
☆48Nov 24, 2023Updated 2 years ago
MCG-NJU / TREG
View on GitHub
Target Transformed Regression for Accurate Tracking
☆21Dec 5, 2021Updated 4 years ago
MCG-NJU / VFIMamba
View on GitHub
[NeurIPS 2024] VFIMamba: Video Frame Interpolation with State Space Models
☆155Sep 26, 2024Updated last year
MCG-NJU / CGA-Net
View on GitHub
[CVPR 2021] CGA-Net: Category Guided Aggregation for Point Cloud Semantic Segmentation
☆24Jan 30, 2022Updated 4 years ago
EnVision-Research / TASC
View on GitHub
☆27Apr 28, 2025Updated last year
x4Cx58x54 / vistal
View on GitHub
A visualization tool for temporal action localization (detection/segmentation).
☆13Mar 30, 2023Updated 3 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
MCG-NJU / FlowBack
View on GitHub
[AAAI 2026] Flowing Backwards: Improving Normalizing Flows via Reverse Representation Alignment
☆16Dec 9, 2025Updated 7 months ago
Egg-Hu / LoRA-Recycle
View on GitHub
[CVPR 2025] LoRA Recycle: Unlocking Tuning-Free Few-Shot Adaptability in Visual Foundation Models by Recycling Pre-Tuned LoRAs
☆14Jun 20, 2025Updated last year
Sharpiless / Awesome-datafree-KD
View on GitHub
2019~2021年间Zero-shot/Data-free知识蒸馏的论文合集
☆11Sep 8, 2021Updated 4 years ago
PKU-ICST-MIPL / FineParser_CVPR2024
View on GitHub
☆27Oct 11, 2024Updated last year
pranoyr / large-scale-visual-relationship-understanding
View on GitHub
Visual Relationship Understanding
☆10Oct 2, 2021Updated 4 years ago
amitakamath / vl_text_encoders_are_bottlenecks
View on GitHub
Code and datasets for "Text encoders are performance bottlenecks in contrastive vision-language models". Coming soon!
☆11May 24, 2023Updated 3 years ago
ZIB-IOL / SMS
View on GitHub
Code to reproduce the experiments of the ICLR24-paper: "Sparse Model Soups: A Recipe for Improved Pruning via Model Averaging"
☆12Oct 14, 2025Updated 9 months ago
huanranchen / SupContrast
View on GitHub
a pytorch implement of Supervised Contrastive Learning with memory bank(queue)
☆15May 25, 2022Updated 4 years ago
tihbe / python-ebdataset
View on GitHub
An event based dataset loader under one common python API.
☆10Mar 22, 2022Updated 4 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
Visual-AI / FROSTER
View on GitHub
[ICLR 2024] FROSTER: Frozen CLIP is a Strong Teacher for Open-Vocabulary Action Recognition
☆101Jan 14, 2025Updated last year
wanglimin / improved_trajectory
View on GitHub
Improved trajectories for action recognition
☆22Apr 10, 2016Updated 10 years ago
Richard-61 / FineAction
View on GitHub
The official codebase of FineAction dataset. We will update the data and code of our FineAction.
☆24Apr 10, 2025Updated last year
tianyu139 / tangent-model-composition
View on GitHub
Code for Tangent Model Composition for Ensembling and Continual Fine-tuning (ICCV 2023) and Tangent Transformers for Composition, Privacy…
☆14May 14, 2024Updated 2 years ago
haonanwang0522 / GTPT
View on GitHub
[ECCV 2024] GTPT: Group-based Token Pruning Transformer for Efficient Human Pose Estimation
☆19Oct 5, 2024Updated last year
weinman / MapTextSynthesizer
View on GitHub
A synthetic training data generator for a text recognition CNN
☆10Jul 8, 2019Updated 7 years ago
vivoutlaw / tcbp
View on GitHub
Temporal Compact Bilinear Pooling (TCBP)
☆11May 27, 2020Updated 6 years ago