SHI-Labs / physical-ai-benchLinks
PAI-Bench: A Comprehensive Benchmark for Physical AI
☆43Updated 2 months ago
Alternatives and similar repositories for physical-ai-bench
Users that are interested in physical-ai-bench are comparing it to the libraries listed below
Sorting:
- Official Implementation of Paper Transfer between Modalities with MetaQueries☆296Updated 3 months ago
- Official Pytorch implementation for LARP: Tokenizing Videos with a Learned Autoregressive Generative Prior (ICLR 2025 Oral).☆98Updated 11 months ago
- Code release for "PISA Experiments: Exploring Physics Post-Training for Video Diffusion Models by Watching Stuff Drop" (ICML 2025)☆53Updated 8 months ago
- Visual Spatial Tuning☆171Updated this week
- Adaptive Length Image Tokenization via Recurrent Allocation | How many tokens is an image worth ?☆144Updated 11 months ago
- ☆162Updated last year
- [arXiv: 2502.05178] QLIP: Text-Aligned Visual Tokenization Unifies Auto-Regressive Multimodal Understanding and Generation☆95Updated 11 months ago
- [NeurIPS 2025] The official repository of "Sekai: A Video Dataset towards World Exploration"☆253Updated last month
- Official PyTorch Implementation of "Latent Denoising Makes Good Visual Tokenizers"☆172Updated last month
- [CVPR 2025] Program synthesis for 3D spatial reasoning☆56Updated 7 months ago
- ElasticTok: Adaptive Tokenization for Image and Video☆88Updated last year
- Seeing from Another Perspective: Evaluating Multi-View Understanding in MLLMs☆60Updated last month
- A list of works on video generation towards world model☆334Updated this week
- [Nips 2025] EgoVid-5M: A Large-Scale Video-Action Dataset for Egocentric Video Generation☆126Updated 6 months ago
- Official implementation for WorldScore: A Unified Evaluation Benchmark for World Generation☆209Updated last month
- [ICML2025] The code and data of Paper: Towards World Simulator: Crafting Physical Commonsense-Based Benchmark for Video Generation☆148Updated last year
- [NeurIPS 2025] VideoREPA: Learning Physics for Video Generation through Relational Alignment with Foundation Models☆157Updated 3 weeks ago
- Cambrian-S: Towards Spatial Supersensing in Video☆482Updated last month
- ☆38Updated 11 months ago
- Diffusion Powers Video Tokenizer for Comprehension and Generation (CVPR 2025)☆86Updated 11 months ago
- ☆66Updated 2 months ago
- ☆51Updated 5 months ago
- A comprehensive list of papers investigating physical cognition in video generation, including papers, codes, and related websites.☆246Updated last week
- Ego-R1: Chain-of-Tool-Thought for Ultra-Long Egocentric Video Reasoning☆137Updated 5 months ago
- Public release of the code for "Accelerating Vision Transformers with Adaptive Patches"☆90Updated 2 months ago
- ☆115Updated 2 months ago
- ☆40Updated 7 months ago
- Code release for NeurIPS 2023 paper SlotDiffusion: Object-centric Learning with Diffusion Models☆94Updated 2 years ago
- [ICCV 2025] Official repo for "GigaTok: Scaling Visual Tokenizers to 3 Billion Parameters for Autoregressive Image Generation"☆198Updated 3 weeks ago
- From Flatland to Space (SPAR). Accepted to NeurIPS 2025 Datasets & Benchmarks. A large-scale dataset & benchmark for 3D spatial perceptio…☆75Updated 3 weeks ago