xiaoqian-shen / StoryGPT-VView external linksLinks
[CVPR 2025] Official PyTorch implementation of StoryGPT-V
☆40Jun 14, 2025Updated 8 months ago
Alternatives and similar repositories for StoryGPT-V
Users that are interested in StoryGPT-V are comparing it to the libraries listed below
Sorting:
- Official Pytorch Implementation of Synthesizing Coherent Story with Auto-Regressive Latent Diffusion Models☆202Jul 9, 2023Updated 2 years ago
- We introduce OpenStory++, a large-scale open-domain dataset focusing on enabling MLLMs to perform storytelling generation tasks.☆16Aug 30, 2024Updated last year
- Implementation of the AAAI-21 Workshop on Scientific Document Understanding paper "A Paragraph-level Multi-task Learning Model for Scient…☆15Oct 9, 2023Updated 2 years ago
- Visual Storytelling post-edit dataset☆18Sep 27, 2019Updated 6 years ago
- 🤗 Unofficial huggingface/diffusers-based implementation of the paper "Training-Free Structured Diffusion Guidance for Compositional Text…☆121Mar 29, 2023Updated 2 years ago
- An application that generates images or videos using Stable Diffusion models.☆22Nov 2, 2022Updated 3 years ago
- A PyTorch implementation of TVC☆24Dec 18, 2023Updated 2 years ago
- ☆22Sep 28, 2023Updated 2 years ago
- MDMMT: Multidomain Multimodal Transformer for Video Retrieval☆26Jun 28, 2021Updated 4 years ago
- [CVPR 2024] Intelligent Grimm - Open-ended Visual Storytelling via Latent Diffusion Models☆262Dec 2, 2024Updated last year
- Implementation for "Correcting Diffusion Generation through Resampling" [CVPR 2024]☆34Dec 12, 2023Updated 2 years ago
- MLPs for Vision and Langauge Modeling (Coming Soon)☆27Dec 9, 2021Updated 4 years ago
- ☆31Mar 24, 2022Updated 3 years ago
- The official code for paper "EasyGen: Easing Multimodal Generation with a Bidirectional Conditional Diffusion Model and LLMs"☆73Nov 21, 2024Updated last year
- [CVPR 2025] PatchDPO: Patch-level DPO for Finetuning-free Personalized Image Generation☆45Jul 1, 2025Updated 7 months ago
- The repository of VG-Refiner paper☆17Dec 9, 2025Updated 2 months ago
- Style Transfer by Rigid Alignment in Neural Net Feature Space☆11Jan 23, 2021Updated 5 years ago
- Finetuning & extending DiffusionDet to video & pedestrian multi-object-tracking☆13Apr 12, 2023Updated 2 years ago
- 🤗 Unofficial huggingface/diffusers-based implementation of the paper "Training-Free Layout Control with Cross-Attention Guidance".☆42May 24, 2023Updated 2 years ago
- 新词发现/新词挖掘/自由度/凝固度/python3☆10May 28, 2019Updated 6 years ago
- [ECCV2024] StoryImager: A Unified and Efficient Framework for Coherent Story Visualization and Completion☆40Jul 5, 2024Updated last year
- Beyond Entities: A Large-Scale Multi-Modal Knowledge Graph with Triplet Fact Grounding☆11May 23, 2024Updated last year
- Basic template for using Flan-t5 on Banana's serverless GPU platform. Ready for 1-Click deploy☆11Jan 30, 2023Updated 3 years ago
- Detects scene change or cuts in a video file☆11Oct 23, 2017Updated 8 years ago
- ☆12Aug 30, 2022Updated 3 years ago
- ☆11Feb 18, 2022Updated 4 years ago
- A clone of OpenAI's Tokenizer page for HuggingFace Models☆46Nov 13, 2023Updated 2 years ago
- ☆11May 24, 2024Updated last year
- This is a data repository for the ACL 2020 paper: "Let Me Choose: From Verbal Context to Font Selection"☆10May 5, 2020Updated 5 years ago
- 豆瓣电影评论可视化☆10May 19, 2016Updated 9 years ago
- I have created a dataset of Image-Text-Pairs by using the cosine similarity of the CLIP embeddings of the image & it's caption derrived f…☆15Apr 22, 2021Updated 4 years ago
- ECCV2020 paper: Fashion Captioning: Towards Generating Accurate Descriptions with Semantic Rewards. Code and Data.☆86Jun 22, 2023Updated 2 years ago
- Official code for CustAny: Customizing Anything from A Single Example. Accepted by CVPR2025 (Oral)☆48Apr 10, 2025Updated 10 months ago
- ☆88Jul 4, 2024Updated last year
- AI NPCs that can control their actions along with dialogue. For instance, if I ask an NPC to tell me its favorite magic spell, it not onl…☆48Oct 31, 2023Updated 2 years ago
- ☆10Feb 22, 2022Updated 3 years ago
- Takes a list of vertices and faces, giving you back an array of individual triangles.☆11Nov 18, 2015Updated 10 years ago
- This repository is created on top of two repositories i.e., yolov7 face detection and yolov7 blurring object☆15Jan 21, 2023Updated 3 years ago
- ☆10Jul 20, 2020Updated 5 years ago