[CVPR 2026] ViStoryBench: AI Story Visualization Benchmark
☆144Mar 4, 2026Updated last month
Alternatives and similar repositories for vistorybench
Users that are interested in vistorybench are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Omni Controllable Video Diffusion☆45Dec 22, 2025Updated 3 months ago
- [NeurIPS 2025 DB] OneIG-Bench is a meticulously designed comprehensive benchmark framework for fine-grained evaluation of T2I models acro…☆115Feb 10, 2026Updated 2 months ago
- iMontage: Unified, Versatile, Highly Dynamic Many-to-many Image Generation☆185Dec 1, 2025Updated 4 months ago
- ☆454Aug 10, 2025Updated 8 months ago
- [ICLR 2026] The official implementation of "RegionE: Adaptive Region-Aware Generation for Efficient Image Editing"☆98Feb 3, 2026Updated 2 months ago
- Deploy open-source AI quickly and easily - Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Official implementation of "NoiseAR: AutoRegressing Initial Noise Prior for Diffusion Models"☆17Jun 3, 2025Updated 10 months ago
- DiffThinker: Towards Generative Multimodal Reasoning with Diffusion Models☆181Jan 4, 2026Updated 3 months ago
- ☆53Dec 10, 2025Updated 4 months ago
- MovieAgent: Automated Movie Generation via Multi-Agent CoT Planning☆313Mar 26, 2025Updated last year
- A SOTA open-source image editing model, which aims to provide comparable performance against the closed-source models like GPT-4o and Gem…☆2,182Dec 29, 2025Updated 3 months ago
- Official InfiniBench: A Benchmark for Large Multi-Modal Models in Long-Form Movies and TV Shows☆19Nov 4, 2025Updated 5 months ago
- Google Scholar自搜小脚本,每次开启命令行即显示当前citation。Small Script displaying current citation count each time the shell is opened.☆21Mar 3, 2025Updated last year
- Code for FreeTraj, a tuning-free method for trajectory-controllable video generation☆112Sep 19, 2025Updated 6 months ago
- Step1X-3D: Towards High-Fidelity and Controllable Generation of Textured 3D Assets☆855Sep 8, 2025Updated 7 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Official Code for 'AR-1-to-3: Single Image to Consistent 3D Object Generation via Next-View Prediction' (ICCV 2025)☆64Nov 8, 2025Updated 5 months ago
- YOLO-TLP: detected and classified tiny objects with bounding box dimensions smaller than 15 pixels, outperforming other one-stage detecto…☆24Oct 6, 2025Updated 6 months ago
- The heartbeat animation indicates that the BGM is loading, please be patient and wait util the envelope appears.☆32Feb 16, 2026Updated 2 months ago
- Official Repository of "OmniTry: Virtual Try-On Anything without Masks"☆253Aug 29, 2025Updated 7 months ago
- ☆27Jan 28, 2026Updated 2 months ago
- Offical implementation of "Re-Aligning Language to Visual Objects with an Agentic Workflow"☆32Apr 20, 2025Updated 11 months ago
- PaCoRe: Learning to Scale Test-Time Compute with Parallel Coordinated Reasoning☆337Feb 5, 2026Updated 2 months ago
- The official pytorch implemention of the CVPR paper "Temporal Modulation Network for Controllable Space-Time Video Super-Resolution".☆112Jul 13, 2022Updated 3 years ago
- A clumsy video auto duplication remove and frame interpolate script (mainly for 24fps cfr animation with dup-frames)☆36Apr 9, 2025Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- This is the official code repository for the paper: Towards General Continuous Memory for Vision-Language Models.☆25Jul 3, 2025Updated 9 months ago
- base: https://github.com/Sense-GVT/Fast-BEV , delete time sequence,update mm releated ,add onnx export for tensorrt☆12May 12, 2023Updated 2 years ago
- ☆11Jan 12, 2023Updated 3 years ago
- Spatial Aptitude Training for Multimodal Langauge Models☆27Feb 8, 2026Updated 2 months ago
- This is a summary of recent video frame interpolation (VFI) methods☆30Apr 17, 2023Updated 3 years ago
- The official repository of paper "Evaluating MLLMs with Multimodal Multi-image Reasoning Benchmark"☆20Jun 20, 2025Updated 9 months ago
- OmniSVG: A Unified Scalable Vector Graphics Generation Model,you can try it in ComfyUI☆28Dec 5, 2025Updated 4 months ago
- More reliable Video Understanding Evaluation☆15Sep 23, 2025Updated 6 months ago
- [CVPR2025] VDocRAG: Retirval-Augmented Generation over Visually-Rich Documents☆64May 26, 2025Updated 10 months ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- [ICLR 2025] Official code implementation of DreamBench++: A Human-Aligned Benchmark for Personalized Image Generation☆131Feb 23, 2025Updated last year
- 这是一个专为动漫视频优化和剪辑设计的高级抽帧工具。本工具结合多种图像处理与分析算法,能够智能地识别并去除冗余或相似的视频帧,显著优化动画的动态效果,或为动漫 AMV/MAD 创作者提供更高效的补帧与素材处理方案。☆35May 4, 2025Updated 11 months ago
- Code for paper: "Executing Arithmetic: Fine-Tuning Large Language Models as Turing Machines"☆11Oct 11, 2024Updated last year
- (CVPR 2026 Highlight) Official repository for Scone (Subject-driven COmposition and DistinctioN Enhancement) model, supporting subject co…☆29Apr 9, 2026Updated last week
- [NIPS 25'] Evaluation code of paper "KRIS-Bench: Benchmarking Next-Level Intelligent Image Editing Models"☆42Oct 19, 2025Updated 5 months ago
- Visual Spatial Tuning☆193Mar 25, 2026Updated 3 weeks ago
- [ICCV 2025] Official implementation of "Anchor Token Matching: Implicit Structure Locking for Training-free AR Image Editing"☆29Apr 15, 2025Updated last year