Yui010206/VEGGIE-VidEdit

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Yui010206/VEGGIE-VidEdit)

Yui010206 / VEGGIE-VidEdit

[ICCV2025] VEGGIE: Instructional Editing and Reasoning Video Concepts with Grounded Generation

☆34

Alternatives and similar repositories for VEGGIE-VidEdit

Users that are interested in VEGGIE-VidEdit are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Yui010206 / MEXA
View on GitHub
[EMNLP 2025 Findings] MEXA: Towards General Multimodal Reasoning with Dynamic Multi-Expert Aggregation
☆15Aug 22, 2025Updated 11 months ago
Ziyang412 / Video-RTS
View on GitHub
Code for EMNLP25 paper "Video-RTS: Rethinking Reinforcement Learning and Test-Time Scaling for Efficient and Enhanced Video Reasoning"
☆24Feb 18, 2026Updated 5 months ago
OpenVE-Team / OpenVE-3M
View on GitHub
OpenVE-3M: A Large-Scale High-Quality Dataset for Instruction-Guided Video Editing
☆51Apr 15, 2026Updated 3 months ago
jaehong31 / RACCooN
View on GitHub
(EMNLP 2025 Main) RACCooN: A Versatile Instructional Video Editing Framework with Auto-Generated Narratives
☆37Dec 20, 2025Updated 7 months ago
langmanbusi / InsViE
View on GitHub
Official PyTorch implementation of paper “InsViE-1M: Effective Instruction-based Video Editing with Elaborate Dataset Construction”
☆34Apr 3, 2026Updated 3 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
Yui010206 / Adaptive-Visual-Imagination-Control
View on GitHub
When and How Much to Imagine: Adaptive Test-Time Scaling with World Models for Visual Spatial Reasoning
☆18Jun 2, 2026Updated last month
LinglingCai0314 / FreeMask
View on GitHub
☆11Jan 18, 2025Updated last year
UCSB-AI / via-video
View on GitHub
☆25May 12, 2026Updated 2 months ago
daeunni / Video-Skill-CoT
View on GitHub
Code for "Skill-based Chain-of-Thoughts for Domain-Adaptive Video Reasoning [EMNLP 2025 Findings]"
☆18Aug 27, 2025Updated 11 months ago
Yui010206 / Ego2Web
View on GitHub
[CVPR 2026] Ego2Web: A Web Agent Benchmark Grounded in Egocentric Videos
☆29Mar 25, 2026Updated 4 months ago
knightyxp / VideoCoF
View on GitHub
[CVPR 2026 Highlight] VideoCoF: Unified Video Editing with Temporal Reasoner
☆205Jun 17, 2026Updated last month
daspartho / DiffEdit
View on GitHub
my attempt at implementing the DiffEdit paper (WIP)
☆16Oct 30, 2022Updated 3 years ago
daeunni / VideoRepair
View on GitHub
Code for "VideoRepair: Improving Text-to-Video Generation via Misalignment Evaluation and Localized Refinement [ACL 2026 Findings]"
☆52Apr 7, 2026Updated 3 months ago
CUC-MIPG / Edit-Transfer
View on GitHub
Official code of "Edit Transfer: Learning Image Editing via Vision In-Context Relations"
☆89Jun 6, 2025Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
zibojia / SENORITA
View on GitHub
This is the official implementation of our Señorita-2M [Weights and Dataset] : A High-Quality Instruction-based Dataset for General Video…
☆112Apr 9, 2025Updated last year
bimsarapathiraja / refedit
View on GitHub
[ICCV 2025] Official Implementation of RefEdit: A Benchmark and Method for Improving Instruction-based Image Editing Model for Referring …
☆20Jun 27, 2025Updated last year
MC-E / InstructX
View on GitHub
☆86Oct 10, 2025Updated 9 months ago
mingcv / TADiSR
View on GitHub
Official implementation for "Text-Aware Real-World Image Super-Resolution via Diffusion Model with Joint Segmentation Decoders"
☆20May 29, 2025Updated last year
Yui010206 / CREMA
View on GitHub
[ICLR 2025] CREMA: Generalizable and Efficient Video-Language Reasoning via Multimodal Modular Fusion
☆56Jul 1, 2025Updated last year
daeunni / StreamGaze
View on GitHub
Code for "StreamGaze: Gaze-Guided Temporal Reasoning and Proactive Understanding in Streaming Videos"
☆27May 13, 2026Updated 2 months ago
TinyTigerPan / tiger200k
View on GitHub
☆27Jul 5, 2025Updated last year
jialuli-luka / SELMA
View on GitHub
Code and Data for Paper: SELMA: Learning and Merging Skill-Specific Text-to-Image Experts with Auto-Generated Data
☆35Mar 12, 2024Updated 2 years ago
mbzuai-oryx / Video-CoM
View on GitHub
Video-CoM: Interactive Video Reasoning via Chain of Manipulations
☆22Jun 17, 2026Updated last month
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
knightyxp / VideoGrain
View on GitHub
[ICLR 2025] VideoGrain: This repo is the official implementation of "VideoGrain: Modulating Space-Time Attention for Multi-Grained Video …
☆159Mar 24, 2025Updated last year
mbzuai-oryx / Video-R2
View on GitHub
Video-R2: Reinforcing Consistent and Grounded Reasoning in Multimodal Language Models
☆19Jan 21, 2026Updated 6 months ago
leoisufa / ICVE
View on GitHub
[Preprint 2025] ICVE: In-Context Learning with Unpaired Clips for Instruction-based Video Editing
☆25Jun 2, 2026Updated last month
amandpkr / Efficient-3D-Aware-Facial-Image-Editing
View on GitHub
[ECCV 2024] Official code repository of paper titled "Efficient 3D-Aware Facial Image Editing Via Attribute-Specific Prompt Learning"
☆10Aug 2, 2024Updated last year
CeeZh / SILVR
View on GitHub
Official Implementation for "SiLVR : A Simple Language-based Video Reasoning Framework"
☆19Jan 18, 2026Updated 6 months ago
MinghanLi / FiVE-Bench
View on GitHub
[ICCV 2025] FiVE-Bench: A Fine-grained Video Editing Benchmark for Evaluating Emerging Diffusion and Rectified Flow Models
☆38Apr 2, 2026Updated 3 months ago
tetrzim / diffusion-human-feedback
View on GitHub
Censored Sampling of Diffusion Models Using 3 Minutes of Human Feedback
☆26Aug 27, 2023Updated 2 years ago
byeongjun-park / ReDirector
View on GitHub
[CVPR 2026] Official pytorch implementation of "ReDirector: Creating Any-Length Video Retakes with Rotary Camera Encoding"
☆34Dec 17, 2025Updated 7 months ago
TencentARC / VideoPainter
View on GitHub
[SIGGRAPH2025] Official repo for paper "Any-length Video Inpainting and Editing with Plug-and-Play Context Control"
☆626Apr 8, 2025Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
UVA-Computer-Vision-Lab / FrameINO
View on GitHub
[NeurIPS 2025] Frame In-N-Out: Unbounded Controllable Image-to-Video Generation
☆33May 1, 2026Updated 2 months ago
Yui010206 / MoPRL
View on GitHub
[TCSVT] Regularity Learning via Explicit Distribution Modeling for Skeletal Video Anomaly Detection
☆17Jul 22, 2023Updated 3 years ago
JIA-Lab-research / UnityVideo
View on GitHub
[CVPR 2026]UnityVideo: Unified Multi-Modal Multi-Task Learning for Enhancing World-Aware Video Generation
☆321Jul 14, 2026Updated 2 weeks ago
facebookresearch / EgoAVU
View on GitHub
[CVPR 2026 highlight] Official release of EgoAVU Egocentric Audio-Visual Understanding
☆33Jun 8, 2026Updated last month
WeChatCV / NovaEdit
View on GitHub
[CVPR26] Nova: Video Editing via single/multiple frame references
☆50Mar 4, 2026Updated 4 months ago
libaolu312 / VFXMaster
View on GitHub
VFXMaster: Unlocking Dynamic Visual Effect Generation via In-Context Learning
☆65Apr 7, 2026Updated 3 months ago
wz0919 / EPiC
View on GitHub
[ICML2026] Official implementation of EPiC: Efficient Video Camera Control Learning with Precise Anchor-Video Guidance
☆50Jun 2, 2025Updated last year