Code release for "PISA Experiments: Exploring Physics Post-Training for Video Diffusion Models by Watching Stuff Drop" (ICML 2025)
☆55May 8, 2025Updated 11 months ago
Alternatives and similar repositories for pisa-experiments
Users that are interested in pisa-experiments are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A comprehensive list of papers investigating physical cognition in video generation, including papers, codes, and related websites.☆288Apr 2, 2026Updated 2 weeks ago
- Video Generation, Physical Commonsense, Semantic Adherence, VideoCon-Physics☆191Jan 30, 2026Updated 2 months ago
- official code repo of CVPR 2025 paper PhyT2V: LLM-Guided Iterative Self-Refinement for Physics-Grounded Text-to-Video Generation☆64Jul 31, 2025Updated 8 months ago
- MR. Video: MapReduce is the Principle for Long Video Understanding☆31Apr 23, 2025Updated 11 months ago
- ☆174Jan 6, 2025Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- [CVPR 2025] Science-T2I: Addressing Scientific Illusions in Image Synthesis☆62Mar 31, 2026Updated 2 weeks ago
- [ICML2025] The code and data of Paper: Towards World Simulator: Crafting Physical Commonsense-Based Benchmark for Video Generation☆157Oct 25, 2024Updated last year
- Benchmarking physical understanding in generative video models☆273Apr 8, 2026Updated last week
- [CVPR 2025] Exploring the Deep Fusion of Large Language Models and Diffusion Transformers for Text-to-Image Synthesis☆133May 16, 2025Updated 11 months ago
- [Neurips 2024] Video Diffusion Models are Training-free Motion Interpreter and Controller☆50Aug 5, 2025Updated 8 months ago
- [ECCV'24 Oral] PiTe: Pixel-Temporal Alignment for Large Video-Language Model☆17Feb 13, 2025Updated last year
- Video Diffusion Alignment via Reward Gradients. We improve a variety of video diffusion models such as VideoCrafter, OpenSora, ModelScope…☆312Mar 12, 2025Updated last year
- [CVPR 2026 Oral] PAI-Bench: A Comprehensive Benchmark for Physical AI☆64Apr 9, 2026Updated last week
- [ICML 2025] Official PyTorch Implementation of "History-Guided Video Diffusion"☆648Jul 1, 2025Updated 9 months ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- [NeurIPS 2025] VideoREPA: Learning Physics for Video Generation through Relational Alignment with Foundation Models☆183Mar 6, 2026Updated last month
- ☆28Apr 8, 2025Updated last year
- [ICLR2026] Video-GPT via Next Clip Diffusion.☆44Jun 2, 2025Updated 10 months ago
- Scaling Text-to-Image Diffusion Transformers with Representation Autoencoders☆234Feb 13, 2026Updated 2 months ago
- Code and data for the paper: Learning Action and Reasoning-Centric Image Editing from Videos and Simulation☆35Jun 30, 2025Updated 9 months ago
- ☆68Aug 16, 2024Updated last year
- Official Implementation of Paper Transfer between Modalities with MetaQueries☆316Oct 12, 2025Updated 6 months ago
- VanGogh: A Unified Multimodal Diffusion-based Framework for Video Colorization☆20Jan 17, 2025Updated last year
- [ICLR 2026] NewtonGen: Physics-Consistent and Controllable Text-to-Video Generation via Neural Newtonian Dynamics☆127Mar 17, 2026Updated 3 weeks ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Source code for my homepage.☆15Nov 25, 2025Updated 4 months ago
- [NeurIPS 2025] Improving Video Generation with Human Feedback☆448Sep 24, 2025Updated 6 months ago
- Physical laws underpin all existence, and harnessing them for generative modeling opens boundless possibilities for advancing science and…☆276Dec 23, 2025Updated 3 months ago
- [ICLR 2026 Oral] DiffusionNFT: Online Diffusion Reinforcement with Forward Process☆766Feb 10, 2026Updated 2 months ago
- ☆37Dec 16, 2025Updated 4 months ago
- [CVPR-2025] GREAT: Geometry-Intention Collaborative Inference for Open-Vocabulary 3D Object Affordance Grounding☆42Aug 15, 2025Updated 8 months ago
- ☆24May 23, 2025Updated 10 months ago
- From Accuracy to Robustness: A Study of Rule- and Model-based Verifiers in Mathematical Reasoning.☆25Oct 7, 2025Updated 6 months ago
- Thinking with Videos from Open-Source Priors. We reproduce chain-of-frames visual reasoning by fine-tuning open-source video models. Give…☆221Oct 12, 2025Updated 6 months ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- [AAAI 2026] Minute-Long Videos with Dual Parallelisms☆47Mar 25, 2026Updated 3 weeks ago
- A Massive Multi-Discipline Lecture Understanding Benchmark☆34Nov 1, 2025Updated 5 months ago
- ☆22Sep 26, 2024Updated last year
- ☆55Sep 21, 2025Updated 6 months ago
- [CVPR 2025 Oral] FluidNexus: 3D Fluid Reconstruction and Prediction from a Single Video☆71Mar 22, 2026Updated 3 weeks ago
- Finetuning Offline World Models in the Real World☆66Oct 25, 2023Updated 2 years ago
- A simple example for using `DDIMInverseScheduler` for inverting an input image to StableDiffusion's latent space☆88Mar 16, 2026Updated last month