UCSC-VLAA / story-adapter
A Training-free Iterative Framework for Long Story Visualization
☆877Updated 3 months ago
Alternatives and similar repositories for story-adapter:
Users that are interested in story-adapter are comparing it to the libraries listed below
- StoryMaker: Towards consistent characters in text-to-image generation☆688Updated 4 months ago
- SD变现宝:一键把comfyui工作流转换成小程序。☆1,369Updated 2 months ago
- ☆530Updated this week
- Official implementation of "Sonic: Shifting Focus to Global Audio Perception in Portrait Animation"☆2,487Updated last month
- This is a study aim to transfer the single concept by using DIT model self-attention capablity☆710Updated 5 months ago
- Project Page repo of OmniTalker: Real-Time Text-Driven Talking Head Generation with In-Context Audio-Visual Style Replication☆223Updated last week
- Sonic is a method about ' Shifting Focus to Global Audio Perception in Portrait Animation',you can use it in comfyUI☆868Updated last month
- JoyHallo: Digital human model for Mandarin☆479Updated 5 months ago
- [TPAMI under review] The official implementation of paper "BrushEdit: All-In-One Image Inpainting and Editing"☆549Updated 3 months ago
- The fastest digital human algorithm, now on your desktop.☆499Updated 3 months ago
- Memory-Guided Diffusion for Expressive Talking Video Generation☆793Updated 2 months ago
- 一个超轻量级、可以在移动端实时运行的数字人模型☆1,835Updated last month
- The official HelloMeme GitHub site☆593Updated 3 weeks ago
- You can using EchoMimic in ComfyUI☆608Updated 2 weeks ago
- Diffusion-based Portrait and Animal Animation☆748Updated last month
- SkyReels V1: The first and most advanced open-source human-centric video foundation model☆2,010Updated last month
- talking-face video editing☆307Updated last month
- Taming Stable Diffusion for Lip Sync!☆3,725Updated this week
- ☆235Updated 7 months ago
- 🔥ICLR 2025 (Spotlight) One-Prompt-One-Story: Free-Lunch Consistent Text-to-Image Generation Using a Single Prompt☆246Updated 2 weeks ago
- Code Implementation of "PhotoDoodle: Learning Artistic Image Editing from Few-Shot Pairwise Data"☆377Updated last month
- This node provides lip-sync capabilities in ComfyUI using ByteDance's LatentSync model. It allows you to synchronize video lips with audi…☆702Updated last week
- AI ContentCraft is an all-in-one content creation suite that helps creators generate stories, podcast scripts, and multimedia content usi…☆330Updated 3 months ago
- AutoStudio: Crafting Consistent Subjects in Multi-turn Interactive Image Generation☆438Updated last week
- Official repository of In-Context LoRA for Diffusion Transformers☆1,809Updated 4 months ago
- gradio WebUI for AdvancedLivePortrait☆480Updated last month
- [AAAI 2025] StoryWeaver: A Unified World Model for Knowledge-Enhanced Story Character Customization☆207Updated last week
- CogView4, CogView3-Plus and CogView3(ECCV 2024)☆1,003Updated 3 weeks ago
- [CVPR 2025] EchoMimicV2: Towards Striking, Simplified, and Semi-Body Human Animation☆3,584Updated last month
- ☆572Updated 5 months ago