lichao-sun / SoraReview
The official GitHub page for the review paper "Sora: A Review on Background, Technology, Limitations, and Opportunities of Large Vision Models".
☆497Updated last year
Alternatives and similar repositories for SoraReview:
Users that are interested in SoraReview are comparing it to the libraries listed below
- SEED-Voken: A Series of Powerful Visual Tokenizers☆863Updated last month
- LaVIT: Empower the Large Language Model to Understand and Generate Visual Content☆579Updated 6 months ago
- Implementation of MagViT2 Tokenizer in Pytorch☆599Updated 3 months ago
- 🔥🔥🔥 A curated list of papers on LLMs-based multimodal generation (image, video, 3D and audio).☆458Updated 2 weeks ago
- [CVPR2024 Highlight] VBench - We Evaluate Video Generation☆914Updated this week
- [CVPR 2024] Panda-70M: Captioning 70M Videos with Multiple Cross-Modality Teachers☆592Updated 5 months ago
- VisionLLaMA: A Unified LLaMA Backbone for Vision Tasks☆386Updated 9 months ago
- Official PyTorch Implementation of "SiT: Exploring Flow and Diffusion-based Generative Models with Scalable Interpolant Transformers"☆807Updated last year
- [ICLR 2025] Repository for Show-o, One Single Transformer to Unify Multimodal Understanding and Generation.☆1,348Updated 2 weeks ago
- This repo contains the code for 1D tokenizer and generator☆821Updated 3 weeks ago
- A reading list of video generation☆545Updated last week
- [TMLR 2025🔥] A survey for the autoregressive models in vision.☆504Updated this week
- [ICLR 2024 Spotlight] DreamLLM: Synergistic Multimodal Comprehension and Creation☆432Updated 4 months ago
- A collection of awesome video generation studies.