vision-x-nyu/pisa-experiments

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/vision-x-nyu/pisa-experiments)

vision-x-nyu / pisa-experiments

Code release for "PISA Experiments: Exploring Physics Post-Training for Video Diffusion Models by Watching Stuff Drop" (ICML 2025)

☆59

Alternatives and similar repositories for pisa-experiments

Users that are interested in pisa-experiments are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Hritikbansal / videophy
View on GitHub
Video Generation, Physical Commonsense, Semantic Adherence, VideoCon-Physics
☆207Jan 30, 2026Updated 5 months ago
minnie-lin / Awesome-Physics-Cognition-based-Video-Generation
View on GitHub
A comprehensive list of papers investigating physical cognition in video generation, including papers, codes, and related websites.
☆321Jun 23, 2026Updated last month
zeyofu / Commonsense-T2I
View on GitHub
Code for Commonsense-T2I Challenge: Can Text-to-Image Generation Models Understand Commonsense? [COLM 2024]
☆24Aug 13, 2024Updated last year
google-deepmind / physics-IQ-benchmark
View on GitHub
Benchmarking physical understanding in generative video models
☆323Jun 22, 2026Updated last month
pittisl / PhyT2V
View on GitHub
official code repo of CVPR 2025 paper PhyT2V: LLM-Guided Iterative Self-Refinement for Physics-Grounded Text-to-Video Generation
☆68Jul 31, 2025Updated 11 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
360CVGroup / WISA
View on GitHub
World Simulator Assistant for Physics-Aware Text-to-Video Generation
☆278Sep 22, 2025Updated 10 months ago
zlab-princeton / UEval
View on GitHub
UEval: A Benchmark for Unified Multimodal Generation
☆24Apr 20, 2026Updated 3 months ago
phyworld / phyworld
View on GitHub
☆175Jan 6, 2025Updated last year
OpenGVLab / PhyGenBench
View on GitHub
[ICML2025] The code and data of Paper: Towards World Simulator: Crafting Physical Commonsense-Based Benchmark for Video Generation
☆163Oct 25, 2024Updated last year
SHI-Labs / physical-ai-bench
View on GitHub
[CVPR 2026 Oral] PAI-Bench: A Comprehensive Benchmark for Physical AI
☆91Jun 23, 2026Updated last month
tang-bd / fuse-dit
View on GitHub
[CVPR 2025] Exploring the Deep Fusion of Large Language Models and Diffusion Transformers for Text-to-Image Synthesis
☆140May 16, 2025Updated last year
wenhaochai / claude-plugins
View on GitHub
Personal Claude Code plugin marketplace
☆16Updated this week
brown-palm / force-prompting
View on GitHub
Official implementation of "Force Prompting: Video Generation Models Can Learn and Generalize Physics-based Control Signals" (NeurIPS 202…
☆161Apr 29, 2026Updated 2 months ago
solaris-wm / solaris-engine
View on GitHub
Scalable Minecraft multiplayer data collection engine
☆139Updated this week
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
aHapBean / VideoREPA
View on GitHub
[NeurIPS 2025] VideoREPA: Learning Physics for Video Generation through Relational Alignment with Foundation Models
☆196Mar 6, 2026Updated 4 months ago
mihirp1998 / VADER
View on GitHub
Video Diffusion Alignment via Reward Gradients. We improve a variety of video diffusion models such as VideoCrafter, OpenSora, ModelScope…
☆315Mar 12, 2025Updated last year
pandayuanyu / NewtonGen
View on GitHub
[ICLR 2026] NewtonGen: Physics-Consistent and Controllable Text-to-Video Generation via Neural Newtonian Dynamics
☆145Jun 9, 2026Updated last month
xizaoqu / MOFT
View on GitHub
[Neurips 2024] Video Diffusion Models are Training-free Motion Interpreter and Controller
☆51Aug 5, 2025Updated 11 months ago
vision-x-nyu / vstat
View on GitHub
Evaluation code for "Benchmarking Visual State Tracking in Multimodal Video Understanding"
☆38Jun 3, 2026Updated last month
KlingAIResearch / VideoAlign
View on GitHub
[NeurIPS 2025] Improving Video Generation with Human Feedback
☆487Sep 24, 2025Updated 10 months ago
ziqipang / MR-Video
View on GitHub
MR. Video: MapReduce is the Principle for Long Video Understanding
☆31Jun 18, 2026Updated last month
cvlab-stonybrook / NewtonRewards
View on GitHub
☆16Jul 17, 2026Updated last week
Jialuo-Li / Science-T2I
View on GitHub
[CVPR 2025] Science-T2I: Addressing Scientific Illusions in Image Synthesis
☆62Mar 31, 2026Updated 3 months ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
solaris-wm / solaris
View on GitHub
The first multiplayer video world model in Minecraft
☆219Mar 3, 2026Updated 4 months ago
freemty / labmate
View on GitHub
Research Harness for Claude Code. Keep your agent grounded in context, not lost in vibe coding.
☆25Jun 23, 2026Updated last month
ZitengWangNYU / Scale-RAE
View on GitHub
Scaling Text-to-Image Diffusion Transformers with Representation Autoencoders
☆255Feb 13, 2026Updated 5 months ago
SihanXU / nepa
View on GitHub
PyTorch implementation of NEPA
☆338Feb 9, 2026Updated 5 months ago
NJU-PCALab / OpenVid-1M
View on GitHub
[ICLR 2025] OpenVid-1M: A Large-Scale High-Quality Dataset for Text-to-video Generation
☆452May 30, 2025Updated last year
yliu-cs / PiTe
View on GitHub
[ECCV'24 Oral] PiTe: Pixel-Temporal Alignment for Large Video-Language Model
☆17Feb 13, 2025Updated last year
xizaoqu / WorldMem
View on GitHub
[NeurIPS 2025] WorldMem: Long-term Consistent World Simulation with Memory
☆381Feb 21, 2026Updated 5 months ago
BestJunYu / Awesome-Physics-aware-Generation
View on GitHub
Physical laws underpin all existence, and harnessing them for generative modeling opens boundless possibilities for advancing science and…
☆296Dec 23, 2025Updated 7 months ago
XinyaChen21 / TeFF
View on GitHub
☆22Sep 26, 2024Updated last year
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
kwsong0113 / diffusion-forcing-transformer
View on GitHub
[ICML 2025] Official PyTorch Implementation of "History-Guided Video Diffusion"
☆705Jul 1, 2025Updated last year
RERV / VDT
View on GitHub
[ICLR2024] The official implementation of paper "VDT: General-purpose Video Diffusion Transformers via Mask Modeling", by Haoyu Lu, Guoxi…
☆256May 5, 2024Updated 2 years ago
thuml / MiniVeo3-Reasoner
View on GitHub
Thinking with Videos from Open-Source Priors. We reproduce chain-of-frames visual reasoning by fine-tuning open-source video models. Give…
☆230Apr 13, 2026Updated 3 months ago
WayneJin0918 / SOTA-paper-rating.io
View on GitHub
A tiny paper rating web
☆41Mar 19, 2025Updated last year
facebookresearch / metaquery
View on GitHub
Official Implementation of Paper Transfer between Modalities with MetaQueries
☆325Oct 12, 2025Updated 9 months ago
willisma / diffuse_nnx
View on GitHub
A comprehensive JAX/NNX library for diffusion and flow matching generative algorithms, featuring DiT (Diffusion Transformer) and its vari…
☆152Oct 16, 2025Updated 9 months ago
facebookresearch / metamorph
View on GitHub
Code for MetaMorph Multimodal Understanding and Generation via Instruction Tuning
☆235Jan 22, 2026Updated 6 months ago