[NeurIPS 2025] VideoREPA: Learning Physics for Video Generation through Relational Alignment with Foundation Models
☆164Jan 7, 2026Updated last month
Alternatives and similar repositories for VideoREPA
Users that are interested in VideoREPA are comparing it to the libraries listed below
Sorting:
- Adapting Self-Supervised Representations as a Latent Space for Efficient Generation☆40Oct 17, 2025Updated 4 months ago
- JoVA: Unified Multimodal Learning for Joint Video-Audio Generation☆30Dec 22, 2025Updated 2 months ago
- [CVPR’25] PIVRG & ConsMTL☆21Oct 21, 2025Updated 4 months ago
- official code repo of CVPR 2025 paper PhyT2V: LLM-Guided Iterative Self-Refinement for Physics-Grounded Text-to-Video Generation☆62Jul 31, 2025Updated 7 months ago
- Official implementation of ECCV 2024 paper: Take A Step Back: Rethinking the Two Stages in Visual Reasoning☆15Jun 1, 2025Updated 9 months ago
- ✈️ [ICCV 2025] Towards Stabilized and Efficient Diffusion Transformers through Long-Skip-Connections with Spectral Constraints☆79Jul 10, 2025Updated 7 months ago
- [ICLR 2024] Seer: Language Instructed Video Prediction with Latent Diffusion Models☆35May 23, 2024Updated last year
- [ICLR 2026] Official implementation for What matters for Representation Alignment: Global Information or Spatial Structure?☆221Dec 15, 2025Updated 2 months ago
- [ICML2025, NeurIPS2025 Spotlight] Sparse VideoGen 1 & 2: Accelerating Video Diffusion Transformers with Sparse Attention☆629Feb 3, 2026Updated last month
- ☆53Dec 10, 2025Updated 2 months ago
- ☆16May 27, 2025Updated 9 months ago
- [ECCV 2024] 3DPE: Real-time 3D-aware Portrait Editing from a Single Image☆22Sep 15, 2025Updated 5 months ago
- [ICML 2025] M-STAR (Multimodal Self-Evolving TrAining for Reasoning) Project. Diving into Self-Evolving Training for Multimodal Reasoning☆71Jul 13, 2025Updated 7 months ago
- ☆122Feb 4, 2026Updated last month
- Code release for "PISA Experiments: Exploring Physics Post-Training for Video Diffusion Models by Watching Stuff Drop" (ICML 2025)☆53May 8, 2025Updated 9 months ago
- ☆66Jul 8, 2025Updated 7 months ago
- [CVPR-2025] GREAT: Geometry-Intention Collaborative Inference for Open-Vocabulary 3D Object Affordance Grounding☆37Aug 15, 2025Updated 6 months ago
- Image Tokenizer Needs Post-Training☆24Oct 4, 2025Updated 5 months ago
- Public implementation of Video2Act: A Dual-System Video Diffusion Policy with Robotic Spatio-Motional Modeling☆30Dec 3, 2025Updated 3 months ago
- ☆39Oct 29, 2025Updated 4 months ago
- [CVPR 2026] TiViBench: Benchmarking Think-in-Video Reasoning for Video Generative Models☆64Feb 21, 2026Updated last week
- [NeurIPS 2024 Spotlight (Top 2.5%🏆)] PCP-MAE: Learning to Predict Centers for Point Masked Autoencoders☆51Sep 1, 2025Updated 6 months ago
- [NeurIPS 2025] PhysCtrl: Generative Physics for Controllable and Physics-Grounded Video Generation☆116Feb 22, 2026Updated last week
- Official implementation for the paper "Aligning Perception, Reasoning, Modeling and Interaction: A Survey on Physical AI"☆40Oct 26, 2025Updated 4 months ago
- ☆16Oct 12, 2025Updated 4 months ago
- ☆30Dec 18, 2025Updated 2 months ago
- Minute-long video generation at 24FPS.☆50Feb 2, 2026Updated last month
- Code for: "Long-Context Autoregressive Video Modeling with Next-Frame Prediction"☆300Apr 23, 2025Updated 10 months ago
- [ACL' 25] The official code repository for PRMBench: A Fine-grained and Challenging Benchmark for Process-Level Reward Models.☆88Feb 15, 2025Updated last year
- Source code for paper GS-DiT: Advancing Video Generation with Pseudo 4D Gaussian Fields through Efficient Dense 3D Point Tracking☆56Jan 5, 2025Updated last year
- [ICML 2025] Diff-MoE: Diffusion Transformer with Time-Aware and Space-Adaptive Experts☆26Nov 10, 2025Updated 3 months ago
- [AAAI 2026] Zero-to-Hero: Zero-Shot Initialization Empowering Reference-Based Video Appearance Editing☆25Nov 20, 2025Updated 3 months ago
- Galaxea's first diffusion policy release☆38Aug 18, 2025Updated 6 months ago
- [ICRA 2026] 🌠 DSPv2: Improved Dense Policy for Effective and Generalizable Whole-body Mobile Manipulation☆29Jan 14, 2026Updated last month
- RLHF for Video Diffusion Models☆26Jul 30, 2025Updated 7 months ago
- Open-Pandora: On-the-fly Control Video Generation☆35Nov 28, 2024Updated last year
- Towards Scalable Pre-training of Visual Tokenizers for Generation☆445Dec 16, 2025Updated 2 months ago
- Official implementation of paper "Pyramid Diffusion for Fine 3D Large Scene Generation" (ECCV 2024 Oral)☆134Apr 4, 2025Updated 11 months ago
- A comprehensive list of papers investigating physical cognition in video generation, including papers, codes, and related websites.☆267Feb 8, 2026Updated 3 weeks ago