seervideodiffusion / SeerVideoLDM
[ICLR 2024] Seer: Language Instructed Video Prediction with Latent Diffusion Models
☆31Updated 11 months ago
Alternatives and similar repositories for SeerVideoLDM
Users that are interested in SeerVideoLDM are comparing it to the libraries listed below
Sorting:
- [ICLR 2025 Spotlight] Grounding Video Models to Actions through Goal Conditioned Exploration☆48Updated last week
- [ECCV2024, Oral, Best Paper Finalist]This is the official implementation of the paper "LEGO: Learning EGOcentric Action Frame Generation …☆37Updated 2 months ago
- Code release for NeurIPS 2023 paper SlotDiffusion: Object-centric Learning with Diffusion Models☆86Updated last year
- Official code for MotionBench (CVPR 2025)☆37Updated 2 months ago
- ☆22Updated 6 months ago
- Code release for "PISA Experiments: Exploring Physics Post-Training for Video Diffusion Models by Watching Stuff Drop" (ICML 2025)☆29Updated this week
- [ICML 2024] A Touch, Vision, and Language Dataset for Multimodal Alignment☆76Updated 3 months ago
- A comprehensive list of papers investigating physical cognition in video generation, including papers, codes, and related websites.☆82Updated last week
- EgoVid-5M: A Large-Scale Video-Action Dataset for Egocentric Video Generation☆103Updated 6 months ago
- Video Generation, Physical Commonsense, Semantic Adherence, VideoCon-Physics☆99Updated last week
- [CVPR 2024] On the Content Bias in Fréchet Video Distance☆112Updated 7 months ago
- [ICML 2024] Compositional Image Decomposition with Diffusion Models☆50Updated 10 months ago
- Diffusion Powers Video Tokenizer for Comprehension and Generation (CVPR 2025)☆67Updated 2 months ago
- Official Pytorch implementation for LARP: Tokenizing Videos with a Learned Autoregressive Generative Prior (ICLR 2025 Oral).☆68Updated 3 months ago
- "SlimFlow: Training Smaller One-Step Diffusion Models with Rectified Flow", Yuanzhi Zhu, Xingchao Liu, Qiang Liu