jaehong31 / RACCooNView external linksLinks
(EMNLP 2025 Main) RACCooN: A Versatile Instructional Video Editing Framework with Auto-Generated Narratives
☆37Dec 20, 2025Updated last month
Alternatives and similar repositories for RACCooN
Users that are interested in RACCooN are comparing it to the libraries listed below
Sorting:
- Code for "VideoRepair: Improving Text-to-Video Generation via Misalignment Evaluation and Localized Refinement"☆52Dec 5, 2024Updated last year
- Code for "Skill-based Chain-of-Thoughts for Domain-Adaptive Video Reasoning [EMNLP 2025 Finding]"☆15Aug 27, 2025Updated 5 months ago
- [ICCV2025] VEGGIE: Instructional Editing and Reasoning Video Concepts with Grounded Generation☆33Aug 18, 2025Updated 5 months ago
- Code and Data for Paper: SELMA: Learning and Merging Skill-Specific Text-to-Image Experts with Auto-Generated Data☆35Mar 12, 2024Updated last year
- code for "MVOC:atraining-free multiple video object composition method with diffusion models"☆23Jul 3, 2024Updated last year
- Official Pytorch implementation of EVEREST: Efficient Masked Video Autoencoder by Removing Redundant Spatiotemporal Tokens [ICML2024].☆30Jun 15, 2024Updated last year
- ☆30Nov 7, 2023Updated 2 years ago
- Online Coreset Selection for Rehearsal-based Continual Learning, ICLR 2022☆24Oct 19, 2022Updated 3 years ago
- Official code of *Towards Event-oriented Long Video Understanding*☆12Jul 26, 2024Updated last year
- Official Implementation for "SiLVR : A Simple Language-based Video Reasoning Framework"☆19Jan 18, 2026Updated 3 weeks ago
- Official Code Repository for the paper - Personalized Subgraph Federated Learning (ICML 2023)☆53Jul 2, 2023Updated 2 years ago
- [ICLR 2025] CREMA: Generalizable and Efficient Video-Language Reasoning via Multimodal Modular Fusion☆56Jul 1, 2025Updated 7 months ago
- ☆27Apr 8, 2025Updated 10 months ago
- Can I Trust Your Answer? Visually Grounded Video Question Answering (CVPR'24, Highlight)☆83Jul 1, 2024Updated last year
- This respository contains the code for the CVPR 2024 paper AVID: Any-Length Video Inpainting with Diffusion Model.☆176Feb 27, 2024Updated last year
- ☆15May 9, 2024Updated last year
- Code Repository for GDSS using Graph Transformer☆17Nov 16, 2023Updated 2 years ago
- Vapoursynth filter using ProPainter: Improving Propagation and Transformer for Video Inpainting☆15Jan 2, 2026Updated last month
- MuDI: Identity Decoupling for Multi-Subject Personalization of Text-to-Image Models (NeurIPS 2024)☆97Jan 17, 2025Updated last year
- (CVPR 2024) Official code for paper "Towards Language-Driven Video Inpainting via Multimodal Large Language Models"☆99Apr 17, 2024Updated last year
- Official Code Repository for the paper "Graph Generation with Diffusion Mixture" (ICML 2024).☆36May 20, 2024Updated last year
- Official PyTorch implementation of paper “InsViE-1M: Effective Instruction-based Video Editing with Elaborate Dataset Construction”☆33Jul 28, 2025Updated 6 months ago
- ☆17Aug 8, 2024Updated last year
- [CVPR 2025] Multi-focal Conditioned Latent Diffusion for Person Image Synthesis☆21Mar 23, 2025Updated 10 months ago
- Ground-A-Video: Zero-shot Grounded Video Editing using Text-to-image Diffusion Models (ICLR 2024)☆140May 21, 2024Updated last year
- FlowZero: Zero-Shot Text-to-Video Synthesis with LLM-Driven Dynamic Scene Syntax☆18Nov 23, 2023Updated 2 years ago
- Statistics and Visualization of acceptance rate, main keyword of NeurIPS 2020 accepted papers☆16Oct 12, 2020Updated 5 years ago
- ☆18Nov 25, 2023Updated 2 years ago
- Training-free Guidance in Text-to-Video Generation via Multimodal Planning and Structured Noise Initialization☆24Apr 14, 2025Updated 10 months ago
- [AAAI 2025] Official pytorch implementation of "VideoElevator: Elevating Video Generation Quality with Versatile Text-to-Image Diffusion …☆162Apr 7, 2024Updated last year
- Text Image Inpainting via Global Structure-Guided Diffusion Models (Accepted by AAAI-24)☆75Apr 2, 2025Updated 10 months ago
- PhysGame Benchmark for Physical Commonsense Evaluation in Gameplay Videos☆48Jul 3, 2025Updated 7 months ago
- [ICLR 2025] SAFREE: Training-Free and Adaptive Guard for Safe Text-to-Image and Video Generation☆53Jan 22, 2025Updated last year
- Official Code Repository for the paper "Graph Self-supervised Learning with Accurate Discrepancy Learning" (NeurIPS 2022)☆18Oct 10, 2022Updated 3 years ago
- [CVPR 2024] VidToMe: Video Token Merging for Zero-Shot Video Editing☆20Feb 29, 2024Updated last year
- Official PyTorch implementation of "DiffusionNAG: Predictor-guided Neural Architecture Generation with Diffusion Models" (ICLR 2024)☆43Mar 20, 2024Updated last year
- ☆55Updated this week
- [NeurIPS 2024] Official Code for the Paper "Multimodal Task Vectors Enable Many-Shot Multimodal In-Context Learning"☆27Apr 8, 2025Updated 10 months ago
- VMC: Video Motion Customization using Temporal Attention Adaption for Text-to-Video Diffusion Models (CVPR 2024)☆198Mar 29, 2024Updated last year