jaehong31/RACCooN

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/jaehong31/RACCooN)

jaehong31 / RACCooN

(EMNLP 2025 Main) RACCooN: A Versatile Instructional Video Editing Framework with Auto-Generated Narratives

☆37

Alternatives and similar repositories for RACCooN

Users that are interested in RACCooN are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

daeunni / VideoRepair
View on GitHub
Code for "VideoRepair: Improving Text-to-Video Generation via Misalignment Evaluation and Localized Refinement [ACL 2026 Findings]"
☆52Apr 7, 2026Updated 3 months ago
Yui010206 / MEXA
View on GitHub
[EMNLP 2025 Findings] MEXA: Towards General Multimodal Reasoning with Dynamic Multi-Expert Aggregation
☆15Aug 22, 2025Updated 11 months ago
Yui010206 / VEGGIE-VidEdit
View on GitHub
[ICCV2025] VEGGIE: Instructional Editing and Reasoning Video Concepts with Grounded Generation
☆34Aug 18, 2025Updated 11 months ago
daeunni / Video-Skill-CoT
View on GitHub
Code for "Skill-based Chain-of-Thoughts for Domain-Adaptive Video Reasoning [EMNLP 2025 Findings]"
☆18Aug 27, 2025Updated 11 months ago
Yui010206 / Adaptive-Visual-Imagination-Control
View on GitHub
When and How Much to Imagine: Adaptive Test-Time Scaling with World Models for Visual Spatial Reasoning
☆18Jun 2, 2026Updated last month
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
sunilhoho / EVEREST
View on GitHub
Official Pytorch implementation of EVEREST: Efficient Masked Video Autoencoder by Removing Redundant Spatiotemporal Tokens [ICML2024].
☆31Jun 15, 2024Updated 2 years ago
SobeyMIL / MVOC
View on GitHub
code for "MVOC:atraining-free multiple video object composition method with diffusion models"
☆23Jul 3, 2024Updated 2 years ago
DongkiKim95 / GDSS-Transformer
View on GitHub
Code Repository for GDSS using Graph Transformer
☆17Nov 16, 2023Updated 2 years ago
jaehong31 / OCS
View on GitHub
Online Coreset Selection for Rehearsal-based Continual Learning, ICLR 2022
☆23Oct 19, 2022Updated 3 years ago
Yui010206 / CREMA
View on GitHub
[ICLR 2025] CREMA: Generalizable and Efficient Video-Language Reasoning via Multimodal Modular Fusion
☆56Jul 1, 2025Updated last year
ylsung / rsq
View on GitHub
Code for "RSQ: Learning from Important Tokens Leads to Better Quantized LLMs"
☆23Mar 25, 2026Updated 4 months ago
harryjo97 / GruM
View on GitHub
Official Code Repository for the paper "Graph Generation with Diffusion Mixture" (ICML 2024).
☆35May 20, 2024Updated 2 years ago
DongkiKim95 / Mol-LLaMA
View on GitHub
Official Code Repository for the paper "Mol-LLaMA: Towards General Understanding of Molecules in Large Molecular Language Model"
☆32Sep 30, 2025Updated 9 months ago
RUCAIBox / Event-Bench
View on GitHub
Official code of *Towards Event-oriented Long Video Understanding*
☆12Jul 26, 2024Updated 2 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
agwmon / MuDI
View on GitHub
[NeurIPS 2024] MuDI: Identity Decoupling for Multi-Subject Personalization of Text-to-Image Models
☆96Jan 17, 2025Updated last year
JinheonBaek / KALMV
View on GitHub
Official Code Repository for Knowledge-Augmented Language Model Verification (EMNLP 2023)
☆28Dec 22, 2023Updated 2 years ago
Nardien / KALA
View on GitHub
Official Code Repository for the paper "KALA: Knowledge-Augmented Language Model Adaptation" (NAACL 2022)
☆35Oct 17, 2023Updated 2 years ago
CownowAn / DiffusionNAG
View on GitHub
Official PyTorch implementation of "DiffusionNAG: Predictor-guided Neural Architecture Generation with Diffusion Models" (ICLR 2024)
☆42Mar 20, 2024Updated 2 years ago
Ground-A-Video / Ground-A-Video
View on GitHub
Ground-A-Video: Zero-shot Grounded Video Editing using Text-to-image Diffusion Models (ICLR 2024)
☆140May 21, 2024Updated 2 years ago
CeeZh / SILVR
View on GitHub
Official Implementation for "SiLVR : A Simple Language-based Video Reasoning Framework"
☆19Jan 18, 2026Updated 6 months ago
dan64 / vs-propainter
View on GitHub
Vapoursynth filter using ProPainter: Improving Propagation and Transformer for Video Inpainting
☆19Jun 26, 2026Updated last month
wz0919 / EPiC
View on GitHub
[ICML2026] Official implementation of EPiC: Efficient Video Camera Control Learning with Precise Anchor-Video Guidance
☆50Jun 2, 2025Updated last year
Papple-F / csg
View on GitHub
☆17Aug 8, 2024Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
harryjo97 / riemannian-diffusion-mixture-torch
View on GitHub
PyTorch implementation for "Generative Modeling on Manifolds Through Mixture of Riemannian Diffusion Processes" (ICML 2024).
☆13Jul 21, 2024Updated 2 years ago
doc-doc / NExT-GQA
View on GitHub
Can I Trust Your Answer? Visually Grounded Video Question Answering (CVPR'24, Highlight)
☆89Jul 1, 2024Updated 2 years ago
videodreamer23 / videodreamer23.github.io
View on GitHub
☆31Nov 7, 2023Updated 2 years ago
jianzongwu / Language-Driven-Video-Inpainting
View on GitHub
(CVPR 2024) Official code for paper "Towards Language-Driven Video Inpainting via Multimodal Large Language Models"
☆99Apr 17, 2024Updated 2 years ago
Flowerfan / VistaLLaMA
View on GitHub
☆15Dec 11, 2024Updated last year
jialuli-luka / Video-MSG
View on GitHub
Training-free Guidance in Text-to-Video Generation via Multimodal Planning and Structured Noise Initialization
☆28Apr 14, 2025Updated last year
qiujihao19 / Artemis
View on GitHub
[NeurIPS 2024] Artemis: Towards Referential Understanding in Complex Videos
☆27Apr 8, 2025Updated last year
VQAssessment / BVQI
View on GitHub
[ICME 2023 Oral, Extended to TIP (UR)] The best zero-shot VQA approach that even outperforms several fully-supervised methods.
☆41Jul 11, 2023Updated 3 years ago
zhang-zx / AVID
View on GitHub
This respository contains the code for the CVPR 2024 paper AVID: Any-Length Video Inpainting with Diffusion Model.
☆177Feb 27, 2024Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
PhysGame / PhysGame
View on GitHub
PhysGame Benchmark for Physical Commonsense Evaluation in Gameplay Videos
☆49Jul 3, 2025Updated last year
harryjo97 / RDLM
View on GitHub
Official Code Repository for the paper "Continuous Diffusion Model for Language Modeling" (NeurIPS 2025).
☆74Sep 25, 2025Updated 10 months ago
CNVid / CNVid-3.5M
View on GitHub
This repository contains the dataset, codebase, and benchmarks for our paper: <CNVid-3.5M: Build, Filter, and Pre-train the Large-scale P…
☆26Nov 28, 2023Updated 2 years ago
ddehun / DEnsity
View on GitHub
Official repository for "DEnsity: Open-domain Dialogue Evaluation Metric using Density Estimation (ACL2023 Findings)"
☆11May 23, 2023Updated 3 years ago
Ziyang412 / UCoFiA
View on GitHub
Pytorch Code for "Unified Coarse-to-Fine Alignment for Video-Text Retrieval" (ICCV 2023)
☆66Jun 7, 2024Updated 2 years ago
zcai0612 / InstantBooth
View on GitHub
My implement of InstantBooth
☆14Sep 11, 2023Updated 2 years ago
Yui010206 / Ego2Web
View on GitHub
[CVPR 2026] Ego2Web: A Web Agent Benchmark Grounded in Egocentric Videos
☆29Mar 25, 2026Updated 4 months ago