google/storybench

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/google/storybench)

google / storybench

☆55

Alternatives and similar repositories for storybench

Users that are interested in storybench are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

google / video-localized-narratives
View on GitHub
☆60Aug 10, 2023Updated 2 years ago
vivoutlaw / tcbp
View on GitHub
Temporal Compact Bilinear Pooling (TCBP)
☆11May 27, 2020Updated 6 years ago
AILab-CVC / TaleCrafter
View on GitHub
[SIGGRAPH Asia 2023] An interactive story visualization tool that support multiple characters
☆268Mar 22, 2024Updated 2 years ago
CrickWu / Clevr-for-StoryGAN
View on GitHub
StoryGAN clevr dataset
☆25Apr 15, 2019Updated 7 years ago
leexinhao / ZeroI2V
View on GitHub
[ECCV 2024] ZeroI2V: Zero-Cost Adaptation of Pre-trained Transformers from Image to Video
☆20Jul 29, 2024Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
YunzeMan / Lexicon3D
View on GitHub
[NeurIPS 2024] Lexicon3D: Probing Visual Foundation Models for Complex 3D Scene Understanding
☆102Feb 2, 2025Updated last year
WalBouss / GEM
View on GitHub
[CVPR24] Official Implementation of GEM (Grounding Everything Module)
☆139Apr 10, 2025Updated last year
JuanFMontesinos / PyNVIdeoReader
View on GitHub
GPU-accelerated video decoder
☆20May 18, 2021Updated 5 years ago
m-bain / webvid
View on GitHub
Large-scale text-video dataset. 10 million captioned short videos.
☆685Aug 14, 2024Updated last year
DAMO-NLP-SG / SSTuning
View on GitHub
Code for ACL paper "Zero-Shot Text Classification via Self-Supervised Tuning"
☆29Sep 25, 2023Updated 2 years ago
cambridgeltl / visual-spatial-reasoning
View on GitHub
[TACL'23] VSR: A probing benchmark for spatial undersranding of vision-language models.
☆149Mar 25, 2023Updated 3 years ago
facebookresearch / dualformer
View on GitHub
implementation of dualformer
☆25Mar 1, 2025Updated last year
MCG-NJU / JoMoLD
View on GitHub
[ECCV 2022] Joint-Modal Label Denoising for Weakly-Supervised Audio-Visual Video Parsing
☆27Jul 15, 2022Updated 4 years ago
icoz69 / StableLLAVA
View on GitHub
Official repo for StableLLAVA
☆94Dec 22, 2023Updated 2 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
adymaharana / storydalle
View on GitHub
☆336Feb 14, 2023Updated 3 years ago
haochenheheda / LVVIS
View on GitHub
Large-Vocabulary Video Instance Segmentation dataset
☆99Jul 5, 2024Updated 2 years ago
buxiangzhiren / VD-IT
View on GitHub
Code for the paper "Exploring Pre-trained Text-to-Video Diffusion Models for Referring Video Object Segmentation", ECCV 2024
☆48Sep 28, 2024Updated last year
MikeWangWZHL / Paxion
View on GitHub
Repo for paper: "Paxion: Patching Action Knowledge in Video-Language Foundation Models" Neurips 23 Spotlight
☆38May 23, 2023Updated 3 years ago
YWolfeee / InfoTok
View on GitHub
Codebase for InfoTok: Adaptive Discrete Video Tokenizer via Information-Theoretic Compression
☆53Mar 18, 2026Updated 4 months ago
Lzq5 / Video-Text-Alignment
View on GitHub
☆28Jul 18, 2025Updated last year
gogoduan / GoT-R1
View on GitHub
[ICLR26] GoT-R1: Unleashing Reasoning Capability of MLLM for Visual Generation with Reinforcement Learning
☆106Jan 27, 2026Updated 5 months ago
SilentView / LVD-2M
View on GitHub
[NeurIPS 2024 D&B Track] Official Repo for "LVD-2M: A Long-take Video Dataset with Temporally Dense Captions"
☆79Oct 15, 2024Updated last year
facebookresearch / NeuralMemory
View on GitHub
A Data Source for Reasoning Embodied Agents
☆20Sep 18, 2023Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
mugen-org / MUGEN_coinrun
View on GitHub
A repository for the updated version of CoinRun used to collect MUGEN, a multimodal video-audio-text dataset. This repo contains scripts …
☆13Jul 13, 2022Updated 4 years ago
HKU-MMLab / EVATok
View on GitHub
[CVPR 2026] Official repo for "EVATok: Adaptive Length Video Tokenization for Efficient Visual Autoregressive Generation"
☆61Mar 13, 2026Updated 4 months ago
technion-cs-nlp / ReFACT
View on GitHub
☆13Apr 3, 2024Updated 2 years ago
pro2nit / STREAM
View on GitHub
official implementation of 'STREAM : Spatio-TempoRal Evaluation and Analysis Metric for Video Generative Models'
☆28Dec 24, 2025Updated 7 months ago
Hritikbansal / jpo
View on GitHub
☆13Jul 2, 2025Updated last year
songweige / TATS
View on GitHub
Official PyTorch implementation of TATS: A Long Video Generation Framework with Time-Agnostic VQGAN and Time-Sensitive Transformer (ECCV …
☆288May 1, 2024Updated 2 years ago
nirat1606 / OADis
View on GitHub
Official code for "Disentangling Visual Embeddings for Attributes and Objects" Published at CVPR 2022
☆34Aug 4, 2023Updated 2 years ago
soCzech / GenHowTo
View on GitHub
Code for the paper "GenHowTo: Learning to Generate Actions and State Transformations from Instructional Videos" published at CVPR 2024
☆54Mar 3, 2024Updated 2 years ago
AILab-CVC / FreeNoise
View on GitHub
[ICLR 2024] Code for FreeNoise based on VideoCrafter
☆429Aug 25, 2025Updated 11 months ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
akakzia / decstr
View on GitHub
☆15Aug 9, 2021Updated 4 years ago
zihuixue / ProgCaptioner
View on GitHub
Code release for the paper "Progress-Aware Video Frame Captioning" (CVPR 2025)
☆26Jul 16, 2025Updated last year
yonseivnl / cmota
View on GitHub
☆10Sep 12, 2024Updated last year
Tony-Lowe / RotationDrag
View on GitHub
☆35Jan 23, 2024Updated 2 years ago
sjenni / temporal-ssl
View on GitHub
Video Representation Learning by Recognizing Temporal Transformations. In ECCV, 2020.
☆49Mar 18, 2021Updated 5 years ago
bpiyush / TestOfTime
View on GitHub
Official code for our CVPR 2023 paper: Test of Time: Instilling Video-Language Models with a Sense of Time
☆46Jun 11, 2024Updated 2 years ago
jianzongwu / MotionBooth
View on GitHub
[NeurIPS 2024 Spotlight] The official implement of research paper "MotionBooth: Motion-Aware Customized Text-to-Video Generation"
☆138Oct 8, 2024Updated last year