yongliang-wu / RepurposeLinks
[AAAI2025] Video Repurposing from User Generated Content: A Large-scale Dataset and Benchmark
☆16Updated 4 months ago
Alternatives and similar repositories for Repurpose
Users that are interested in Repurpose are comparing it to the libraries listed below
Sorting:
- [CVPR2025] Number it: Temporal Grounding Videos like Flipping Manga☆117Updated last week
- Accepted by CVPR 2024☆37Updated last year
- R1-like Video-LLM for Temporal Grounding☆115Updated 2 months ago
- [NeurIPS2023] Exploring Diverse In-Context Configurations for Image Captioning☆41Updated 9 months ago
- Official repository of NeurIPS D&B Track 2024 paper "VERIFIED: A Video Corpus Moment Retrieval Benchmark for Fine-Grained Video Understan…☆36Updated 7 months ago
- [CVPR 2025] Online Video Understanding: OVBench and VideoChat-Online☆63Updated last week
- 🔥CVPR 2025 Multimodal Large Language Models Paper List☆153Updated 5 months ago
- [NeurlPS 2024] One Token to Seg Them All: Language Instructed Reasoning Segmentation in Videos☆133Updated 8 months ago
- 📖 This is a repository for organizing papers, codes, and other resources related to unified multimodal models.☆281Updated 3 weeks ago
- TokLIP: Marry Visual Tokens to CLIP for Multimodal Comprehension and Generation☆196Updated 2 weeks ago
- Official repo for "Streaming Video Understanding and Multi-round Interaction with Memory-enhanced Knowledge" ICLR2025☆69Updated 5 months ago
- Official implementation of MC-LLaVA.☆139Updated 2 weeks ago
- [CVPR 2025 Oral] VideoEspresso: A Large-Scale Chain-of-Thought Dataset for Fine-Grained Video Reasoning via Core Frame Selection☆111Updated last month
- Official Repository of paper: Envisioning Beyond the Pixels: Benchmarking Reasoning-Informed Visual Editing☆88Updated last week
- Collections of Papers and Projects for Multimodal Reasoning.☆106Updated 4 months ago
- [ICLR 2025] TRACE: Temporal Grounding Video LLM via Casual Event Modeling☆117Updated 2 weeks ago
- [ICCV 25] VMBench: A Benchmark for Perception-Aligned Video Motion Generation☆59Updated last month
- Official code for MotionBench (CVPR 2025)☆56Updated 6 months ago
- [CVPR'2024 Highlight] Official PyTorch implementation of the paper "VTimeLLM: Empower LLM to Grasp Video Moments".☆285Updated last year
- WISE: A World Knowledge-Informed Semantic Evaluation for Text-to-Image Generation☆144Updated 3 weeks ago
- The official code of "Thinking With Videos: Multimodal Tool-Augmented Reinforcement Learning for Long Video Reasoning"☆23Updated last week
- Official repository of DoraemonGPT: Toward Understanding Dynamic Scenes with Large Language Models☆86Updated last year
- A framework for unified personalized model, achieving mutual enhancement between personalized understanding and generation. Demonstrating…☆121Updated 3 weeks ago
- [AAAI 2025] AL-Ref-SAM 2: Unleashing the Temporal-Spatial Reasoning Capacity of GPT for Training-Free Audio and Language Referenced Video…☆86Updated 8 months ago
- TStar is a unified temporal search framework for long-form video question answering☆63Updated 2 weeks ago
- [CVPR 2025] Adaptive Keyframe Sampling for Long Video Understanding☆98Updated last week
- VideoChat-R1: Enhancing Spatio-Temporal Perception via Reinforcement Fine-Tuning☆179Updated 2 weeks ago
- Ego-R1: Chain-of-Tool-Thought for Ultra-Long Egocentric Video Reasoning☆110Updated 2 weeks ago
- [CVPR 2025] OVO-Bench: How Far is Your Video-LLMs from Real-World Online Video Understanding?☆81Updated last month
- 🌀 R2-Tuning: Efficient Image-to-Video Transfer Learning for Video Temporal Grounding (ECCV 2024)☆86Updated last year