[NeurIPS 2025] VideoREPA: Learning Physics for Video Generation through Relational Alignment with Foundation Models
☆193Mar 6, 2026Updated 3 months ago
Alternatives and similar repositories for VideoREPA
Users that are interested in VideoREPA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ICLR 2026] Adapting Self-Supervised Representations as a Latent Space for Efficient Generation☆60Apr 24, 2026Updated 2 months ago
- [CVPR’25] PIVRG & ConsMTL☆23Oct 21, 2025Updated 8 months ago
- ✈️ [ICCV 2025] Towards Stabilized and Efficient Diffusion Transformers through Long-Skip-Connections with Spectral Constraints☆80Jul 10, 2025Updated 11 months ago
- Official implementation of ECCV 2024 paper: Take A Step Back: Rethinking the Two Stages in Visual Reasoning☆13Jun 1, 2025Updated last year
- ☆20Oct 12, 2025Updated 8 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- [ICML 2025] M-STAR (Multimodal Self-Evolving TrAining for Reasoning) Project. Diving into Self-Evolving Training for Multimodal Reasoning☆75Jul 13, 2025Updated 11 months ago
- JoVA: Unified Multimodal Learning for Joint Video-Audio Generation☆33Dec 22, 2025Updated 6 months ago
- ☆71Jul 8, 2025Updated 11 months ago
- ☆138Feb 4, 2026Updated 5 months ago
- [CVPR 2026] TiViBench: Benchmarking Think-in-Video Reasoning for Video Generative Models☆67Feb 21, 2026Updated 4 months ago
- [ECCV 2026] Freqformer: Image-Demoiréing Transformer via Efficient Frequency Decomposition☆16May 27, 2025Updated last year
- [NeurIPS 2024 Spotlight (Top 2.5%🏆)] PCP-MAE: Learning to Predict Centers for Point Masked Autoencoders☆53Sep 1, 2025Updated 10 months ago
- instruction-following benchmark for large reasoning models☆48Apr 19, 2026Updated 2 months ago
- VanGogh: A Unified Multimodal Diffusion-based Framework for Video Colorization☆20Jan 17, 2025Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- [ACL' 25] The official code repository for PRMBench: A Fine-grained and Challenging Benchmark for Process-Level Reward Models.☆92Feb 15, 2025Updated last year
- official code repo of CVPR 2025 paper PhyT2V: LLM-Guided Iterative Self-Refinement for Physics-Grounded Text-to-Video Generation☆68Jul 31, 2025Updated 11 months ago
- [ICLR 2024] Seer: Language Instructed Video Prediction with Latent Diffusion Models☆35May 23, 2024Updated 2 years ago
- [ICML2025, NeurIPS2025 Spotlight] Sparse VideoGen 1 & 2: Accelerating Video Diffusion Transformers with Sparse Attention☆685Mar 6, 2026Updated 3 months ago
- The official code repository for the FullFront benchmark☆27May 16, 2025Updated last year
- Code release for "PISA Experiments: Exploring Physics Post-Training for Video Diffusion Models by Watching Stuff Drop" (ICML 2025)☆57May 8, 2025Updated last year
- [ECCV 2026] Official code repository for "Self-transcendence: Is External Feature Guidance Indispensable for Accelerating Diffusion Trans…☆33Mar 17, 2026Updated 3 months ago
- Official Repo of "Flow-OPD: On-Policy Distillation for Flow Matching Models"☆248Jun 24, 2026Updated last week
- Official implementation for the paper "Aligning Perception, Reasoning, Modeling and Interaction: A Survey on Physical AI"☆64Oct 26, 2025Updated 8 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- 🚀 LLaMA-MoE v2: Exploring Sparsity of LLaMA from Perspective of Mixture-of-Experts with Post-Training☆93Dec 3, 2024Updated last year
- Open-Pandora: On-the-fly Control Video Generation☆35Nov 28, 2024Updated last year
- [ICLR 2026] Official implementation for What matters for Representation Alignment: Global Information or Spatial Structure?☆253Dec 15, 2025Updated 6 months ago
- World Simulator Assistant for Physics-Aware Text-to-Video Generation☆274Sep 22, 2025Updated 9 months ago
- ☆55Dec 10, 2025Updated 6 months ago
- [ICML 2025 Oral] The official repository for the paper "Can MLLMs Reason in Multimodality? EMMA: An Enhanced MultiModal ReAsoning Benchma…☆69Jul 17, 2025Updated 11 months ago
- Pixel-Space Generative Models☆316May 11, 2025Updated last year
- [AAAI 2026] Zero-to-Hero: Zero-Shot Initialization Empowering Reference-Based Video Appearance Editing☆24Nov 20, 2025Updated 7 months ago
- [ECCV 2024] 3DPE: Real-time 3D-aware Portrait Editing from a Single Image☆22Sep 15, 2025Updated 9 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- [ICML 2025] Official Implementation of GLIDER☆74Oct 9, 2025Updated 8 months ago
- code for Learning the Unlearned: Mitigating Feature Suppression in Contrastive Learning☆20Jul 16, 2024Updated last year
- Code for: "Long-Context Autoregressive Video Modeling with Next-Frame Prediction"☆308Apr 23, 2025Updated last year
- [CVPR-2025] GREAT: Geometry-Intention Collaborative Inference for Open-Vocabulary 3D Object Affordance Grounding☆46Aug 15, 2025Updated 10 months ago
- This repository is the official code for the paper "AUCSeg: AUC-oriented Pixel-level Long-tail Semantic Segmentation" (NeurIPS 2024).☆14Sep 17, 2025Updated 9 months ago
- A comprehensive list of papers investigating physical cognition in video generation, including papers, codes, and related websites.☆316Jun 23, 2026Updated last week
- Public implementation of Video2Act: A Dual-System Video Diffusion Policy with Robotic Spatio-Motional Modeling☆31Jun 24, 2026Updated last week