xiaoqian-shen/StoryGPT-V

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/xiaoqian-shen/StoryGPT-V)

xiaoqian-shen / StoryGPT-V

[CVPR 2025] Official PyTorch implementation of StoryGPT-V

☆42

Alternatives and similar repositories for StoryGPT-V

Users that are interested in StoryGPT-V are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

ubc-vision / Make-A-Story
View on GitHub
Code Release for the paper "Make-A-Story: Visual Memory Conditioned Consistent Story Generation" in CVPR 2023
☆43Jun 27, 2023Updated 3 years ago
YeLuoSuiYou / openstorypp
View on GitHub
We introduce OpenStory++, a large-scale open-domain dataset focusing on enabling MLLMs to perform storytelling generation tasks.
☆18Aug 30, 2024Updated last year
xichenpan / ARLDM
View on GitHub
Official Pytorch Implementation of Synthesizing Coherent Story with Auto-Regressive Latent Diffusion Models
☆203Jul 9, 2023Updated 3 years ago
volkancirik / refer360
View on GitHub
Repository for ACL2020 paper "Refer360° A Referring Expression Recognition Dataset in 360°Images"
☆15Jun 26, 2021Updated 5 years ago
MSiam / motion_adaptation
View on GitHub
☆11Jun 2, 2019Updated 7 years ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
tingyaohsu / VIST-Edit
View on GitHub
Visual Storytelling post-edit dataset
☆18Sep 27, 2019Updated 6 years ago
zomian87x / selectable-palette-editor
View on GitHub
☆18Nov 20, 2024Updated last year
shunk031 / training-free-structured-diffusion-guidance
View on GitHub
🤗 Unofficial huggingface/diffusers-based implementation of the paper "Training-Free Structured Diffusion Guidance for Compositional Text…
☆120Mar 29, 2023Updated 3 years ago
Kilichbek / artemis-speaker-tools-b
View on GitHub
Artemis Speaker Tools B
☆24Apr 4, 2021Updated 5 years ago
CyberAgentAILab / sprite-decompose
View on GitHub
[ECCV2024] Fast Sprite Decomposition from Animated Graphics
☆31Sep 26, 2024Updated last year
HuiGuanLab / RaTSG
View on GitHub
This is a repository contains the implementation of our NeurIPS'24 paper "Temporal Sentence Grounding with Relevance Feedback in Videos"
☆13Aug 22, 2025Updated 11 months ago
Vision-CAIR / saai-factory-tutorial-creative-ai
View on GitHub
Creative AI for Visual Art and Music slides and demos.
☆11May 2, 2023Updated 3 years ago
Vision-CAIR / iMotion-LLM
View on GitHub
☆15Apr 23, 2026Updated 3 months ago
Maryeon / whiten_mtd
View on GitHub
Official repository of paper "Let All be Whitened: Multi-teacher Distillation for Efficient Visual Retrieval"
☆11Dec 20, 2023Updated 2 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
twelvelabs-io / pegasus-1-eval
View on GitHub
Repository for evaluating Pegasus-1 and video-language foundation models
☆14Nov 12, 2024Updated last year
jacklxc / ParagraphJointModel
View on GitHub
Implementation of the AAAI-21 Workshop on Scientific Document Understanding paper "A Paragraph-level Multi-task Learning Model for Scient…
☆15Oct 9, 2023Updated 2 years ago
LingjieKong-fdu / CustAny
View on GitHub
Official code for CustAny: Customizing Anything from A Single Example. Accepted by CVPR2025 (Oral)
☆47Apr 10, 2025Updated last year
haoningwu3639 / StoryGen
View on GitHub
[CVPR 2024] Intelligent Grimm - Open-ended Visual Storytelling via Latent Diffusion Models
☆270Dec 2, 2024Updated last year
akashrajkn / dependency-parser
View on GitHub
Neural graph-based dependency parser
☆13Dec 20, 2017Updated 8 years ago
muzishen / RCDMs
View on GitHub
[AAAI 2025] 🎬RCDMs🎬: Boosting Consistency in Story Visualization with Rich-Contextual Conditional Diffusion Models. RCDMs improve story…
☆70Sep 30, 2025Updated 9 months ago
LiBingyu01 / U3M
View on GitHub
[Pattern Recognition 2025 🌟]Unbiased Multiscale Modal Fusion Model for Multimodal Semantic Segmentation
☆10Jun 12, 2024Updated 2 years ago
tobran / StoryImager
View on GitHub
[ECCV2024] StoryImager: A Unified and Efficient Framework for Coherent Story Visualization and Completion
☆40Jul 5, 2024Updated 2 years ago
yhlleo / DWC-GAN
View on GitHub
[ACM MM 2020] DWC-GAN.
☆33Jun 29, 2021Updated 5 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
adymaharana / storydalle
View on GitHub
☆336Feb 14, 2023Updated 3 years ago
discus0434 / evaluate-images-to-feed-diffusion
View on GitHub
Small notebook to preprocess and evaluate images.
☆14Nov 11, 2022Updated 3 years ago
Sheldonmao / MatSparse3D
View on GitHub
This repository contains the code for CVPRW 2024 paper: Generating Material-Aware 3D Models from Sparse Views
☆13Jun 11, 2024Updated 2 years ago
babajide07 / Redundant-Feature-Pruning-Pytorch-Implementation
View on GitHub
☆16Apr 20, 2020Updated 6 years ago
Vision-CAIR / dochaystacks
View on GitHub
Document Haystacks: Vision-Language Reasoning Over Piles of 1000+ Documents, CVPR 2025
☆26Jan 25, 2025Updated last year
WildVision-AI / LMM-Engines
View on GitHub
☆17Oct 22, 2024Updated last year
OpenGVLab / MM-Interleaved
View on GitHub
MM-Interleaved: Interleaved Image-Text Generative Modeling via Multi-modal Feature Synchronizer
☆255Apr 3, 2024Updated 2 years ago
wenhaochai / claude-plugins
View on GitHub
Personal Claude Code plugin marketplace
☆16Updated this week
xiangyu-mm / EasyGen
View on GitHub
The official code for paper "EasyGen: Easing Multimodal Generation with a Bidirectional Conditional Diffusion Model and LLMs"
☆73Nov 21, 2024Updated last year
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
WilliamYi96 / HGR-Net
View on GitHub
Exploring Hierarchical Graph Representation for Large-Scale Zero-Shot Image Classification. ECCV 2022.
☆18Jul 12, 2022Updated 4 years ago
Nihukat / Concept-Conductor
View on GitHub
☆17Feb 21, 2025Updated last year
ErgastiAlex / SCA-DM
View on GitHub
☆15Apr 29, 2025Updated last year
RUCAIBox / Event-Bench
View on GitHub
Official code of *Towards Event-oriented Long Video Understanding*
☆12Jul 26, 2024Updated last year
Iniquitatis / sd-webui-temporal
View on GitHub
A "loopback on steroids" type of extension for Stable Diffusion Web UI.
☆31Oct 10, 2025Updated 9 months ago
isikmustafa / face-tracking
View on GitHub
RGB Face Tracking and Reconstruction on GPU using CUDA
☆16Feb 8, 2020Updated 6 years ago
Logeswaran123 / Stable-Diffusion-Playground
View on GitHub
An application that generates images or videos using Stable Diffusion models.
☆22Nov 2, 2022Updated 3 years ago