Code release for "PISA Experiments: Exploring Physics Post-Training for Video Diffusion Models by Watching Stuff Drop" (ICML 2025)
☆56May 8, 2025Updated 11 months ago
Alternatives and similar repositories for pisa-experiments
Users that are interested in pisa-experiments are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code for Commonsense-T2I Challenge: Can Text-to-Image Generation Models Understand Commonsense? [COLM 2024]☆24Aug 13, 2024Updated last year
- A comprehensive list of papers investigating physical cognition in video generation, including papers, codes, and related websites.☆293Apr 2, 2026Updated last month
- Video Generation, Physical Commonsense, Semantic Adherence, VideoCon-Physics☆192Jan 30, 2026Updated 3 months ago
- official code repo of CVPR 2025 paper PhyT2V: LLM-Guided Iterative Self-Refinement for Physics-Grounded Text-to-Video Generation☆64Jul 31, 2025Updated 9 months ago
- MR. Video: MapReduce is the Principle for Long Video Understanding☆31Apr 23, 2025Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆173Jan 6, 2025Updated last year
- [CVPR 2025] Science-T2I: Addressing Scientific Illusions in Image Synthesis☆62Mar 31, 2026Updated last month
- [ICML2025] The code and data of Paper: Towards World Simulator: Crafting Physical Commonsense-Based Benchmark for Video Generation☆158Oct 25, 2024Updated last year
- Benchmarking physical understanding in generative video models☆287Updated this week
- [ICLR 2026][Ultra Fast&Powerful Diffusion RL] Reinforcing Diffusion Models by Direct Group Preference Optimization☆66Apr 25, 2026Updated last week
- [CVPR 2025] Exploring the Deep Fusion of Large Language Models and Diffusion Transformers for Text-to-Image Synthesis☆134May 16, 2025Updated 11 months ago
- World Simulator Assistant for Physics-Aware Text-to-Video Generation☆273Sep 22, 2025Updated 7 months ago
- [Neurips 2024] Video Diffusion Models are Training-free Motion Interpreter and Controller☆50Aug 5, 2025Updated 9 months ago
- [ECCV'24 Oral] PiTe: Pixel-Temporal Alignment for Large Video-Language Model☆17Feb 13, 2025Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Official codes for the paper "GARDO: Reinforcing Diffusion Models without Reward Hacking"☆57Feb 2, 2026Updated 3 months ago
- Video Diffusion Alignment via Reward Gradients. We improve a variety of video diffusion models such as VideoCrafter, OpenSora, ModelScope…☆314Mar 12, 2025Updated last year
- [ICML 2025] Official PyTorch Implementation of "History-Guided Video Diffusion"☆659Jul 1, 2025Updated 10 months ago
- [NeurIPS 2025] VideoREPA: Learning Physics for Video Generation through Relational Alignment with Foundation Models☆187Mar 6, 2026Updated 2 months ago
- [NeurIPS 2025] Improving Video Generation with Human Feedback☆457Sep 24, 2025Updated 7 months ago
- ☆29Apr 8, 2025Updated last year
- [ICLR2026] Video-GPT via Next Clip Diffusion.☆45Jun 2, 2025Updated 11 months ago
- Scaling Text-to-Image Diffusion Transformers with Representation Autoencoders☆242Feb 13, 2026Updated 2 months ago
- Official Implementation of Paper Transfer between Modalities with MetaQueries☆317Oct 12, 2025Updated 6 months ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Source code for my homepage.☆15Apr 24, 2026Updated last week
- Physical laws underpin all existence, and harnessing them for generative modeling opens boundless possibilities for advancing science and…☆283Dec 23, 2025Updated 4 months ago
- [ICLR 2026] NewtonGen: Physics-Consistent and Controllable Text-to-Video Generation via Neural Newtonian Dynamics☆130Mar 17, 2026Updated last month
- [ICLR 2026 Oral] DiffusionNFT: Online Diffusion Reinforcement with Forward Process☆806Feb 10, 2026Updated 2 months ago
- ☆38Dec 16, 2025Updated 4 months ago
- [CVPR-2025] GREAT: Geometry-Intention Collaborative Inference for Open-Vocabulary 3D Object Affordance Grounding☆43Aug 15, 2025Updated 8 months ago
- ☆24May 23, 2025Updated 11 months ago
- Thinking with Videos from Open-Source Priors. We reproduce chain-of-frames visual reasoning by fine-tuning open-source video models. Give…☆225Apr 13, 2026Updated 3 weeks ago
- [AAAI 2026] Minute-Long Videos with Dual Parallelisms☆47Mar 25, 2026Updated last month
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- A Massive Multi-Discipline Lecture Understanding Benchmark☆34Apr 20, 2026Updated 2 weeks ago
- ☆22Sep 26, 2024Updated last year
- ☆55Sep 21, 2025Updated 7 months ago
- [CVPR 2025 Oral] FluidNexus: 3D Fluid Reconstruction and Prediction from a Single Video☆73Mar 22, 2026Updated last month
- Finetuning Offline World Models in the Real World☆66Oct 25, 2023Updated 2 years ago
- A simple example for using `DDIMInverseScheduler` for inverting an input image to StableDiffusion's latent space☆88Mar 16, 2026Updated last month
- Official implementation of "Force Prompting: Video Generation Models Can Learn and Generalize Physics-based Control Signals" (NeurIPS 202…☆155Apr 29, 2026Updated last week