agwmon / frame-guidanceView external linksLinks
Frame Guidance: Training-Free Guidance for Frame-Level Control in Video Diffusion Model (ICLR 2026)
☆41Jul 10, 2025Updated 7 months ago
Alternatives and similar repositories for frame-guidance
Users that are interested in frame-guidance are comparing it to the libraries listed below
Sorting:
- [ACM MM 2025] MLLMs for Aesthetics Reasoning☆23Jan 5, 2026Updated last month
- [AAAI 2026] Official implementation of DreamRunner: Fine-Grained Storytelling Video Generation with Retrieval-Augmented Motion Adaptation☆77Jun 11, 2025Updated 8 months ago
- Official implementation of Self-Taught Agentic Long Context Understanding (ACL 2025).☆12Sep 22, 2025Updated 4 months ago
- [TMM 2025] Official Implementation of DreamJourney: Perpetual View Generation with Video Diffusion Models☆16Jun 24, 2025Updated 7 months ago
- [ICLR'25] Official repository of paper: Ranking-aware adapter for text-driven image ordering with CLIP☆16Apr 17, 2025Updated 10 months ago
- The official repository of EffiVED☆19Jun 5, 2024Updated last year
- On Path to Multimodal Generalist: General-Level and General-Bench☆18Jul 11, 2025Updated 7 months ago
- ☆23Jul 20, 2025Updated 6 months ago
- ☆16Feb 21, 2025Updated 11 months ago
- Code for paper: Unified Text-to-Image Generation and Retrieval☆16Jul 6, 2024Updated last year
- Official repository for the paper: "Controlling Geometric Abstraction and Texture for Artistic Images"☆15Aug 2, 2023Updated 2 years ago
- official implementation of the paper "Delving into Latent Spectral Biasing of Video VAEs for Superior Diffusability".☆47Dec 25, 2025Updated last month
- lite attention implemented over flash attention 3☆45Updated this week
- Official Implementation for *PaCo-RL: Advancing Reinforcement Learning for Consistent Image Generation with Pairwise Reward Modeling*☆31Dec 13, 2025Updated 2 months ago
- Official repo for StyleMe3D☆28Apr 22, 2025Updated 9 months ago
- code for "TVG: A Training-free Transition Video Generation Method with Diffusion Models"☆48Aug 19, 2024Updated last year
- [AAAI 2026] Multimodal Deepresearcher: Generating Text-Chart Interleaved Reports From Scratch with Agentic Framework☆44Jan 25, 2026Updated 3 weeks ago
- This project is the official implementation of 'DreamOmni3: Scribble-based Editing and Generation''☆37Dec 30, 2025Updated last month
- DeepSolution: Boosting Complex Engineering Solution Design via Tree-based Exploration and Bi-point Thinking☆49Dec 18, 2025Updated last month
- Code for "VideoRepair: Improving Text-to-Video Generation via Misalignment Evaluation and Localized Refinement"☆52Dec 5, 2024Updated last year
- 🎮Manipulates mobile phones just like how you would. Official code for "MobA: Multifaceted Memory-Enhanced Adaptive Planning for Efficien…☆27Oct 10, 2025Updated 4 months ago
- Official PyTorch implementation of the paper "FlowDirector: Training-Free Flow Steering for Precise Text-to-Video Editing"☆76Dec 12, 2025Updated 2 months ago
- Code implementation for "Feedforward 3D Editing via Text-Steerable Image-to-3D"☆44Dec 23, 2025Updated last month
- A tokenbased 3d rendering engine for ComfyUI.☆33Aug 8, 2025Updated 6 months ago
- The code of the paper "Free-Lunch Color-Texture Disentanglement for Stylized Image Generation"☆36Sep 18, 2025Updated 4 months ago
- MuDI: Identity Decoupling for Multi-Subject Personalization of Text-to-Image Models (NeurIPS 2024)☆97Jan 17, 2025Updated last year
- Official Code Release of NeurIPS 2025 Paper: HoloScene: Simulation‑Ready Interactive 3D Worlds from a Single Video☆89Oct 8, 2025Updated 4 months ago
- Transform ComfyUI into a Universal AI Vibe Coding Agent — From Code to Productivity Automation☆32Jun 5, 2025Updated 8 months ago
- ☆21Oct 31, 2024Updated last year
- [[NeurIPS 2025] UltraVideo: High-Quality UHD Video Dataset with Comprehensive Captions☆84Jul 14, 2025Updated 7 months ago
- [IJCV 2026] HiPrompt: Tuning-free Higher-Resolution Generation with Hierarchical MLLM Prompts☆26Feb 28, 2025Updated 11 months ago
- [ICLR 2026] SparseD: Sparse Attention for Diffusion Language Models☆57Oct 7, 2025Updated 4 months ago
- official repo for "VideoScore: Building Automatic Metrics to Simulate Fine-grained Human Feedback for Video Generation" [EMNLP2024]☆112Dec 4, 2025Updated 2 months ago
- DanceTogether! Identity-Preserving Multi-Person Interactive Video Generation☆39Aug 3, 2025Updated 6 months ago
- ComfyUI-AniSora is now available in ComfyUI, Index-AniSora is the most powerful open-source animated video generation model. It enables o…☆48May 27, 2025Updated 8 months ago
- [ICCV 2025 ⭐highlight⭐] Implementation of VMem: Consistent Interactive Video Scene Generation with Surfel-Indexed View Memory☆414Jul 25, 2025Updated 6 months ago
- The first spoken long-text dataset derived from live streams, designed to reflect the redundancy-rich and conversational nature of real-w…☆13Jun 28, 2025Updated 7 months ago
- A Text2SQL benchmark for evaluation of Large Language Models☆41Feb 8, 2026Updated last week
- [ICCV 2025] Gaussian Variation Field Diffusion for High-fidelity Video-to-4D Synthesis☆110Nov 3, 2025Updated 3 months ago