WorldModelBench-Team / WorldModelBench
☆21Updated 2 months ago
Alternatives and similar repositories for WorldModelBench
Users that are interested in WorldModelBench are comparing it to the libraries listed below
Sorting:
- A list of works on video generation towards world model☆58Updated last week
- ☆47Updated 5 months ago
- PyTorch implementation of DiffMoE, TC-DiT, EC-DiT and Dense DiT☆81Updated 3 weeks ago
- Sora Generates Videos with Stunning Geometrical Consistency☆49Updated last year
- ☆15Updated last month
- Official implementation for WorldScore: A Unified Evaluation Benchmark for World Generation☆99Updated last month
- [ICLR 2025] Implementation of Accelerating Auto-regressive Text-to-Image Generation with Training-free Speculative Jacobi Decoding☆38Updated 3 weeks ago
- EgoVid-5M: A Large-Scale Video-Action Dataset for Egocentric Video Generation☆103Updated 6 months ago
- A framework named B^2-DiffuRL for RL-based diffusion model fine-tuning.☆29Updated last month
- [arXiv: 2502.05178] QLIP: Text-Aligned Visual Tokenization Unifies Auto-Regressive Multimodal Understanding and Generation☆73Updated 2 months ago
- The official implementation of "Neighboring Autoregressive Modeling for Efficient Visual Generation"☆48Updated last month
- Memory Efficient Training Framework for Large Video Generation Model☆25Updated last year
- [Neurips 2024] Video Diffusion Models are Training-free Motion Interpreter and Controller☆40Updated last month
- ☆39Updated last year
- ☆61Updated 5 months ago
- This repository provides an improved LLamaGen Model, fine-tuned on 500,000 high-quality images, each accompanied by over 300 token prompt…☆30Updated 6 months ago
- ☆29Updated 5 months ago
- [CVPR 2025 Highlight] Towards Autonomous Micromobility through Scalable Urban Simulation☆21Updated 2 weeks ago
- [ICLR 2025] Official implementation and benchmark evaluation repository of <PhysBench: Benchmarking and Enhancing Vision-Language Models …☆59Updated 2 months ago
- ☆126Updated 4 months ago
- ☆80Updated last month
- Diffusion Powers Video Tokenizer for Comprehension and Generation (CVPR 2025)☆67Updated 2 months ago
- VideoAuteur: Towards Long Narrative Video Generation☆38Updated 4 months ago
- [ECCV 2024] M3DBench introduces a comprehensive 3D instruction-following dataset with support for interleaved multi-modal prompts.☆60Updated 7 months ago
- Official implementation of "Parameter-Efficient Orthogonal Finetuning via Butterfly Factorization"☆77Updated last year
- ☆30Updated 3 months ago
- Scaling Properties of Diffusion Models For Perceptual Tasks (CVPR 2025)☆38Updated 2 weeks ago
- Unofficial Implementation of "Stable Video Diffusion Multi-View"☆79Updated last year
- Program synthesis for 3D spatial reasoning☆31Updated 2 months ago
- Code for paper Background Prompting for Improved Object Depth☆29Updated last year