TencentARC / SEED-StoryLinks

SEED-Story: Multimodal Long Story Generation with Large Language Model

☆874

Alternatives and similar repositories for SEED-Story

Users that are interested in SEED-Story are comparing it to the libraries listed below

Sorting:

FireRedTeam / StoryMaker
StoryMaker: Towards consistent characters in text-to-image generation
☆713Updated 11 months ago
AIGText / Glyph-ByT5
[ECCV2024] This is an official inference code of the paper "Glyph-ByT5: A Customized Text Encoder for Accurate Visual Text Rendering" and…
☆615Updated 2 months ago
donahowe / AutoStudio
AutoStudio: Crafting Consistent Subjects in Multi-turn Interactive Image Generation
☆447Updated 7 months ago
kongzhecn / OMG
[ECCV 2024] OMG: Occlusion-friendly Personalized Multi-concept Generation In Diffusion Models
☆697Updated last year
williamyang1991 / FRESCO
[CVPR 2024] FRESCO: Spatial-Temporal Correspondence for Zero-Shot Video Translation
☆779Updated last year
TencentARC / BrushEdit
[under review] The official implementation of paper "BrushEdit: All-In-One Image Inpainting and Editing"
☆582Updated 2 months ago
MyNiuuu / MOFA-Video
[ECCV 2024] MOFA-Video: Controllable Image Animation via Generative Motion Field Adaptions in Frozen Image-to-Video Diffusion Model.
☆755Updated 11 months ago
mayuelala / FollowYourClick
[AAAI 2025] Follow-Your-Click: This repo is the official implementation of "Follow-Your-Click: Open-domain Regional Image Animation via S…
☆907Updated 2 months ago
UCSC-VLAA / story-adapter
A Training-free Iterative Framework for Long Story Visualization
☆929Updated 10 months ago
Jeff-LiangF / streamv2v
Official Pytorch implementation of StreamV2V.
☆520Updated 9 months ago
AILab-CVC / SEED-X
Multimodal Models in Real World
☆548Updated 8 months ago
zai-org / CogView4
CogView4, CogView3-Plus and CogView3(ECCV 2024)
☆1,091Updated 7 months ago
sail-sg / CLoT
CVPR'24, Official Codebase of our Paper: "Let's Think Outside the Box: Exploring Leap-of-Thought in Large Language Models with Creative H…
☆321Updated last year
Picsart-AI-Research / StreamingT2V
[CVPR 2025] StreamingT2V: Consistent, Dynamic, and Extendable Long Video Generation from Text
☆1,611Updated 7 months ago
byliutao / 1Prompt1Story
🔥ICLR 2025 (Spotlight) One-Prompt-One-Story: Free-Lunch Consistent Text-to-Image Generation Using a Single Prompt
☆306Updated last month
modelscope / scepter
SCEPTER is an open-source framework used for training, fine-tuning, and inference with generative models.
☆545Updated 7 months ago
TIGER-AI-Lab / AnyV2V
Code and data for "AnyV2V: A Tuning-Free Framework For Any Video-to-Video Editing Tasks" [TMLR 2024]
☆634Updated last year
AILab-CVC / VideoGen-Eval
VideoGen-Eval: Agent-based System for Video Generation Evaluation
☆250Updated 7 months ago
jianzongwu / DiffSensei
Implementation of [CVPR 2025] "DiffSensei: Bridging Multi-Modal LLMs and Diffusion Models for Customized Manga Generation"
☆877Updated 9 months ago
Vchitect / Vlogger
[CVPR2024] Make Your Dream A Vlog
☆428Updated 6 months ago
zhoudaquan / ChatAnything
Official Repo for the Paper: CHATANYTHING: FACETIME CHAT WITH LLM-ENHANCED PERSONAS
☆382Updated last year
Xiaojiu-z / Stable-Hair
Pytorch Implementation of: "Stable-Hair: Real-World Hair Transfer via Diffusion Model" (AAAI 2025)
☆516Updated 8 months ago
Boese0601 / MagicDance
[ICML 2024] MagicPose(also known as MagicDance): Realistic Human Poses and Facial Expressions Retargeting with Identity-aware Diffusion
☆771Updated last year
open-mmlab / PIA
[CVPR 2024] PIA, your Personalized Image Animator. Animate your images by text prompt, combing with Dreambooth, achieving stunning videos…
☆974Updated last year
Deaddawn / DreamFrame-code
☆184Updated 3 months ago
Vchitect / LaVie
[IJCV 2024] LaVie: High-Quality Video Generation with Cascaded Latent Diffusion Models
☆939Updated last year
Aria-Zhangjl / StoryWeaver
[AAAI 2025] StoryWeaver: A Unified World Model for Knowledge-Enhanced Story Character Customization
☆222Updated 7 months ago
cangcz / AnchorCrafter
☆634Updated 3 months ago
dvlab-research / ControlNeXt
Controllable video and image Generation, SVD, Animate Anyone, ControlNet, ControlNeXt, LoRA
☆1,621Updated last year
design-edit / DesignEdit
DesignEdit: Unify Spatial-Aware Image Editing via Training-free Inpainting with a Multi-Layered Latent Diffusion Framework
☆356Updated 11 months ago