ExMorgan-Alter / NeighborNetLinks
This is an official PyTorch Implementation of Neighbor Relations Matter in Video Scene Detection.
β28Updated 10 months ago
Alternatives and similar repositories for NeighborNet
Users that are interested in NeighborNet are comparing it to the libraries listed below
Sorting:
- π R2-Tuning: Efficient Image-to-Video Transfer Learning for Video Temporal Grounding (ECCV 2024)β90Updated last year
- β80Updated last year
- Official Implementation of "Chrono: A Simple Blueprint for Representing Time in MLLMs"β92Updated 10 months ago
- A Versatile Video-LLM for Long and Short Video Understanding with Superior Temporal Localization Abilityβ105Updated last year
- FlashVTG: Feature Layering and Adaptive Score Handling Network for Video Temporal Grounding. (WACV2025)β34Updated 9 months ago
- β25Updated 2 months ago
- [ECCV 22] LocVTP: Video-Text Pre-training for Temporal Localizationβ39Updated 3 years ago
- β37Updated last year
- [CVPR 2024] Context-Guided Spatio-Temporal Video Groundingβ65Updated last year
- Official pytorch repository for "Knowing Where to Focus: Event-aware Transformer for Video Grounding" (ICCV 2023)β55Updated 2 years ago
- [ECCV 2024] Learning Video Context as Interleaved Multimodal Sequencesβ42Updated 10 months ago
- [ICLR 2025] IDA-VLM: Towards Movie Understanding via ID-Aware Large Vision-Language Modelβ37Updated last year
- [CVPR 2024] Bridging the Gap: A Unified Video Comprehension Framework for Moment Retrieval and Highlight Detectionβ114Updated last year
- β73Updated last year
- [ICLR 2025] TRACE: Temporal Grounding Video LLM via Casual Event Modelingβ143Updated 5 months ago
- [NeurlPS 2024] One Token to Seg Them All: Language Instructed Reasoning Segmentation in Videosβ145Updated last year
- Official pytorch repository for CG-DETR "Correlation-guided Query-Dependency Calibration in Video Representation Learning for Temporal Grβ¦β148Updated last year
- [EMNLP 2025 Findings] Grounded-VideoLLM: Sharpening Fine-grained Temporal Grounding in Video Large Language Modelsβ139Updated 5 months ago
- [AAAI 2024] DGL: Dynamic Global-Local Prompt Tuning for Text-Video Retrieval.β47Updated last year
- β120Updated last year
- β58Updated last year
- Unified layout planning and image generation, ICCV2025β40Updated 2 weeks ago
- Code for "CAFe: Unifying Representation and Generation with Contrastive-Autoregressive Finetuning"β32Updated 10 months ago
- Official implementation of CVPR 2024 paper "vid-TLDR: Training Free Token merging for Light-weight Video Transformer".β53Updated 3 months ago
- [NeurIPS 2024 D&B Track] Official Repo for "LVD-2M: A Long-take Video Dataset with Temporally Dense Captions"β77Updated last year
- β54Updated last year
- β106Updated last year
- Offical PyTorch implementation of Clover: Towards A Unified Video-Language Alignment and Fusion Model (CVPR2023)β40Updated 2 years ago
- Official repository of NeurIPS D&B Track 2024 paper "VERIFIED: A Video Corpus Moment Retrieval Benchmark for Fine-Grained Video Understanβ¦β40Updated last year
- [AAAI 2025] VTG-LLM: Integrating Timestamp Knowledge into Video LLMs for Enhanced Video Temporal Groundingβ124Updated last year