This is an official PyTorch Implementation of Neighbor Relations Matter in Video Scene Detection.
☆28Mar 19, 2025Updated 11 months ago
Alternatives and similar repositories for NeighborNet
Users that are interested in NeighborNet are comparing it to the libraries listed below
Sorting:
- ☆34Jun 2, 2023Updated 2 years ago
- Official Implementation of ISR-DPO:Aligning Large Multimodal Models for Videos by Iterative Self-Retrospective DPO (AAAI'25)☆23Nov 25, 2025Updated 3 months ago
- Agentic Keyframe Search for Video Question Answering☆16Apr 7, 2025Updated 10 months ago
- ReNeg: Learning Negative Embedding with Reward Guidance☆17Jan 17, 2025Updated last year
- [ICCV 2025] Enhancing Partially Relevant Video Retrieval with Hyperbolic Learning.☆49Dec 10, 2025Updated 2 months ago
- Learning Situation Hyper-Graphs for Video Question Answering☆22Feb 16, 2024Updated 2 years ago
- Official Pytorch Implementation of 'BAM-DETR: Boundary-Aligned Moment Detection Transformer for Temporal Sentence Grounding in Videos'☆35Feb 26, 2025Updated last year
- ☆18Jun 10, 2025Updated 8 months ago
- Official implementation for paper TEVAD: Improved video anomaly detection with captions☆38Apr 5, 2023Updated 2 years ago
- A Large-Scale Chinese Image-Text Benchmark for Real-World Short Video Search Scenarios☆13Jan 24, 2024Updated 2 years ago
- ☆13Aug 28, 2024Updated last year
- Edge Impulse FOMO Implementation from scratch☆16Updated this week
- Project developed for AI Launch Lab's R&D program. TradeMind is a Machine Learning Stock Analysis tool aimed to give you more confidence …☆13Aug 26, 2024Updated last year
- SKFAC Preconditioner for MindSpore☆12Jul 2, 2021Updated 4 years ago
- CoS: Chain-of-Shot Prompting for Long Video Understanding☆53Feb 13, 2025Updated last year
- [AAAI 2024] DGL: Dynamic Global-Local Prompt Tuning for Text-Video Retrieval.☆47Oct 14, 2024Updated last year
- Solving Logic Grid Puzzles with Part-of-Speech Tagging and First-Order Logic☆11Dec 18, 2016Updated 9 years ago
- Unofficial implementation of the Ask-LLM paper 'How to Train Data-Efficient LLMs', arXiv:2402.09668.☆12Jun 19, 2024Updated last year
- ☆15Feb 18, 2024Updated 2 years ago
- Bidirectional Likelihood Estimation with Multi-Modal Large Language Models for Text-Video Retrieval (ICCV 2025 Highlight)☆20Aug 1, 2025Updated 7 months ago
- ☆16Oct 9, 2024Updated last year
- Code for the paper: Graph Jigsaw Learning for Cartoon Face Recognition☆10Jul 1, 2022Updated 3 years ago
- Reinforcing Text-Rich Video Reasoning with Visual Rumination☆27Nov 24, 2025Updated 3 months ago
- CVPR 2025 Accepted Papers☆23Dec 20, 2025Updated 2 months ago
- ☆14Sep 11, 2025Updated 5 months ago
- Official Implementation for ACM MM2024 paper "VrdONE: One-stage Video Visual Relation Detection".☆11Nov 13, 2024Updated last year
- Effective Attention Sheds Light On Interpretability - Findings of ACL2021☆11May 16, 2021Updated 4 years ago
- quagga☆10Apr 7, 2020Updated 5 years ago
- Official implementation of "ConViS-Bench: Estimating Video Similarity Through Semantic Concepts", NeurIPS 2025☆25Nov 28, 2025Updated 3 months ago
- The official implementation of paper "ColorFlow: Retrieval-Augmented Image Sequence Colorization"☆10Dec 24, 2024Updated last year
- ☆19Jul 22, 2025Updated 7 months ago
- Code for MME-SID accepted to CIKM 2025 Full Research track.☆27Oct 29, 2025Updated 4 months ago
- TARS: MinMax Token-Adaptive Preference Strategy for Hallucination Reduction in MLLMs☆23Sep 21, 2025Updated 5 months ago
- [CVPR 2024] Official repository of ST_GT☆10Sep 15, 2024Updated last year
- Scripting Multi-Scene Videos with Time-Aware and Structural Audio-Visual Captions☆21Feb 11, 2026Updated 2 weeks ago
- ✨✨[NeurIPS 2025] This is the official implementation of our paper "Video-RAG: Visually-aligned Retrieval-Augmented Long Video Comprehensi…☆400Jan 14, 2026Updated last month
- VideoAuteur: Towards Long Narrative Video Generation☆43Oct 22, 2025Updated 4 months ago
- [ECCV 22] LocVTP: Video-Text Pre-training for Temporal Localization☆39Jul 29, 2022Updated 3 years ago
- ☆48Sep 22, 2023Updated 2 years ago