UCSC-VLAA / story-adapterLinks
A Training-free Iterative Framework for Long Story Visualization
☆908Updated 7 months ago
Alternatives and similar repositories for story-adapter
Users that are interested in story-adapter are comparing it to the libraries listed below
Sorting:
- ☆597Updated 3 weeks ago
- StoryMaker: Towards consistent characters in text-to-image generation☆706Updated 8 months ago
- This is a study aim to transfer the single concept by using DIT model self-attention capablity☆766Updated 9 months ago
- Project Page repo of OmniTalker: Real-Time Text-Driven Talking Head Generation with In-Context Audio-Visual Style Replication☆376Updated 3 weeks ago
- ☆264Updated 11 months ago
- SD变现宝:一键把comfyui工作流转换成小程序。☆1,430Updated 6 months ago
- JoyHallo: Digital human model for Mandarin☆503Updated 9 months ago
- The fastest digital human algorithm, now on your desktop.☆544Updated 2 months ago
- CogView4, CogView3-Plus and CogView3(ECCV 2024)☆1,083Updated 4 months ago
- MagicTryOn is a video virtual try-on framework based on a large-scale video diffusion Transformer.☆427Updated last month
- SEED-Story: Multimodal Long Story Generation with Large Language Model☆865Updated 10 months ago
- Open CapCut API.☆817Updated this week
- [TPAMI under review] The official implementation of paper "BrushEdit: All-In-One Image Inpainting and Editing"☆575Updated 7 months ago
- Implementation of [CVPR 2025] "DiffSensei: Bridging Multi-Modal LLMs and Diffusion Models for Customized Manga Generation"☆845Updated 6 months ago
- Official implementation of "Sonic: Shifting Focus to Global Audio Perception in Portrait Animation"☆3,004Updated last month
- [AAAI 2025] StoryWeaver: A Unified World Model for Knowledge-Enhanced Story Character Customization☆216Updated 4 months ago
- 🔥ICLR 2025 (Spotlight) One-Prompt-One-Story: Free-Lunch Consistent Text-to-Image Generation Using a Single Prompt☆284Updated 2 months ago
- (SIGGRAPH Asia 2024) This is the official PyTorch implementation of SIGGRAPH Asia 2024 paper: DrawingSpinUp: 3D Animation from Single Cha…☆640Updated 3 months ago
- The official HelloMeme GitHub site☆620Updated last month
- Fogsight is an AI agent and animation engine powered by Large Language Models.☆870Updated 2 weeks ago
- AutoStudio: Crafting Consistent Subjects in Multi-turn Interactive Image Generation☆445Updated 4 months ago
- SkyReels V1: The first and most advanced open-source human-centric video foundation model☆2,250Updated 5 months ago
- gradio WebUI for AdvancedLivePortrait☆507Updated 5 months ago
- AutoClip: AI-powered video clipping and highlight generation · 一款智能高光提取与剪辑的二创工具☆413Updated this week
- You can using EchoMimic in ComfyUI☆663Updated this week
- SkyReels-A1: Expressive Portrait Animation in Video Diffusion Transformers☆557Updated 2 months ago
- [ICCV 2025] Code Implementation of "ArtEditor: Learning Customized Instructional Image Editor from Few-Shot Examples"☆413Updated 3 months ago
- Fuse ChatTTS with OpenVoice, upload a 10-second audio clip, and clone your personalized ChatTTS voice.☆454Updated 9 months ago
- ☆591Updated 9 months ago
- talking-face video editing☆374Updated 5 months ago