daeunni/VideoRepair

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/daeunni/VideoRepair)

daeunni / VideoRepair

Code for "VideoRepair: Improving Text-to-Video Generation via Misalignment Evaluation and Localized Refinement [ACL 2026 Findings]"

☆52

Alternatives and similar repositories for VideoRepair

Users that are interested in VideoRepair are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

daeunni / Video-Skill-CoT
View on GitHub
Code for "Skill-based Chain-of-Thoughts for Domain-Adaptive Video Reasoning [EMNLP 2025 Findings]"
☆18Aug 27, 2025Updated 10 months ago
Yui010206 / MEXA
View on GitHub
[EMNLP 2025 Findings] MEXA: Towards General Multimodal Reasoning with Dynamic Multi-Expert Aggregation
☆15Aug 22, 2025Updated 11 months ago
jaehong31 / RACCooN
View on GitHub
(EMNLP 2025 Main) RACCooN: A Versatile Instructional Video Editing Framework with Auto-Generated Narratives
☆37Dec 20, 2025Updated 7 months ago
Ziyang412 / Video-RTS
View on GitHub
Code for EMNLP25 paper "Video-RTS: Rethinking Reinforcement Learning and Test-Time Scaling for Efficient and Enhanced Video Reasoning"
☆24Feb 18, 2026Updated 5 months ago
daeunni / StreamGaze
View on GitHub
Code for "StreamGaze: Gaze-Guided Temporal Reasoning and Proactive Understanding in Streaming Videos"
☆26May 13, 2026Updated 2 months ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
Yui010206 / CREMA
View on GitHub
[ICLR 2025] CREMA: Generalizable and Efficient Video-Language Reasoning via Multimodal Modular Fusion
☆56Jul 1, 2025Updated last year
Yui010206 / Adaptive-Visual-Imagination-Control
View on GitHub
When and How Much to Imagine: Adaptive Test-Time Scaling with World Models for Visual Spatial Reasoning
☆18Jun 2, 2026Updated last month
wz0919 / DreamRunner
View on GitHub
[AAAI 2026] Official implementation of DreamRunner: Fine-Grained Storytelling Video Generation with Retrieval-Augmented Motion Adaptation
☆78Jun 11, 2025Updated last year
yahoojapan / srgd
View on GitHub
Official implementation of "Real-SRGD: Enhancing Real-World Image Super-Resolution with Classifier-Free Guided Diffusion" [ACCV2024]
☆19Dec 9, 2024Updated last year
XingruiWang / 3D-Aware-VQA
View on GitHub
Official Code for the NeurIPS'23 paper "3D-Aware Visual Question Answering about Parts, Poses and Occlusions"
☆21Oct 17, 2024Updated last year
jialuli-luka / SELMA
View on GitHub
Code and Data for Paper: SELMA: Learning and Merging Skill-Specific Text-to-Image Experts with Auto-Generated Data
☆35Mar 12, 2024Updated 2 years ago
mincheoree / BEVMap
View on GitHub
[WACV 2024] This is the official implementation of BEVMap, a map-aware BEV modeling framework for multiview-camera detection
☆22Dec 28, 2023Updated 2 years ago
Yui010206 / VEGGIE-VidEdit
View on GitHub
[ICCV2025] VEGGIE: Instructional Editing and Reasoning Video Concepts with Grounded Generation
☆34Aug 18, 2025Updated 11 months ago
mung3477 / SemanticControl
View on GitHub
[BMVC 2025] Official implementation of SemanticControl: A Training-Free Approach for Handling Loosely Aligned Visual Conditions in Contro…
☆18Aug 27, 2025Updated 10 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
zeyofu / Commonsense-T2I
View on GitHub
Code for Commonsense-T2I Challenge: Can Text-to-Image Generation Models Understand Commonsense? [COLM 2024]
☆24Aug 13, 2024Updated last year
HuiGuanLab / RaTSG
View on GitHub
This is a repository contains the implementation of our NeurIPS'24 paper "Temporal Sentence Grounding with Relevance Feedback in Videos"
☆13Aug 22, 2025Updated 11 months ago
wz0919 / EPiC
View on GitHub
[ICML2026] Official implementation of EPiC: Efficient Video Camera Control Learning with Precise Anchor-Video Guidance
☆50Jun 2, 2025Updated last year
jialuli-luka / Video-MSG
View on GitHub
Training-free Guidance in Text-to-Video Generation via Multimodal Planning and Structured Noise Initialization
☆28Apr 14, 2025Updated last year
MS-LIMA / ImagePG
View on GitHub
[WACV 2026] Image-Guided Semantic Pseudo-LiDAR Point Generation for 3D Object Detection
☆18Feb 25, 2026Updated 4 months ago
ylsung / rsq
View on GitHub
Code for "RSQ: Learning from Important Tokens Leads to Better Quantized LLMs"
☆23Mar 25, 2026Updated 3 months ago
Aurora-slz / MM-Verify
View on GitHub
☆19Oct 28, 2025Updated 8 months ago
jaehong31 / SAFREE
View on GitHub
[ICLR 2025] SAFREE: Training-Free and Adaptive Guard for Safe Text-to-Image and Video Generation
☆59Jan 22, 2025Updated last year
bigai-nlco / VideoLLaMB
View on GitHub
[ICCV 2025] Official Repository of VideoLLaMB: Long Video Understanding with Recurrent Memory Bridges
☆87Feb 27, 2025Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
Yui010206 / Ego2Web
View on GitHub
[CVPR 2026] Ego2Web: A Web Agent Benchmark Grounded in Egocentric Videos
☆29Mar 25, 2026Updated 3 months ago
daeunni / BECoTTA
View on GitHub
Code for "BECoTTA: Input-dependent Online Blending of Experts for Continual Test-time Adaptation [ICML2024]".
☆51Jun 16, 2024Updated 2 years ago
Ysz2022 / SEPC
View on GitHub
[TIM 2023] Multi-scale Synergism Ensemble Progressive and Contrastive Investigation for Image Restoration
☆14Jan 18, 2026Updated 6 months ago
linzhiqiu / CLIP-FlanT5
View on GitHub
Training code for CLIP-FlanT5
☆31Jul 29, 2024Updated last year
MetabrainAGI / Awaker2.5-VL
View on GitHub
☆35Jan 21, 2025Updated last year
gqk / HiCoM
View on GitHub
[NeurIPS 2024] HiCoM: Hierarchical Coherent Motion for Dynamic Streamable Scenes with 3D Gaussian Splatting
☆44Dec 24, 2024Updated last year
linzhiqiu / visual_gpt_score
View on GitHub
VisualGPTScore for visio-linguistic reasoning
☆27Oct 7, 2023Updated 2 years ago
gqk / R-DFCIL
View on GitHub
R-DFCIL: Relation-Guided Representation Learning for Data-Free Class Incremental Learning, ECCV2022 [PyTorch Code]
☆14Sep 19, 2022Updated 3 years ago
camenduru / AdvancedLivePortrait-jupyter
View on GitHub
☆11Sep 28, 2024Updated last year
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
UNITES-Lab / Mew
View on GitHub
[ECCV 2024] Code for the paper "Mew: Multiplexed Immunofluorescence Image Analysis through an Efficient Multiplex Network"
☆17Jul 27, 2024Updated last year
jiaangli / VILA
View on GitHub
[TACL/EMNLP'24] Do Vision and Language Models Share Concepts? A Vector Space Alignment Study
☆16Nov 22, 2024Updated last year
JarrentWu1031 / SingleInsert
View on GitHub
Official pytorch implementation for SingleInsert
☆29Apr 19, 2024Updated 2 years ago
TonyLianLong / LLM-groundedVideoDiffusion
View on GitHub
[ICLR 2024] LLM-grounded Video Diffusion Models (LVD): official implementation for the LVD paper
☆172May 7, 2024Updated 2 years ago
KaiyueSun98 / T2I-ReasonBench
View on GitHub
T2I-ReasonBench: Benchmarking Reasoning-Informed Text-to-Image Generation
☆37Sep 16, 2025Updated 10 months ago
LinglingCai0314 / FreeMask
View on GitHub
☆11Jan 18, 2025Updated last year
SJTU-DENG-Lab / LOVECon
View on GitHub
Official implementation for "LOVECon: Text-driven Training-free Long Video Editing with ControlNet"
☆43Oct 26, 2023Updated 2 years ago