facebookresearch / IntPhys2Links

This is the code repository for IntPhys 2, a video benchmark designed to evaluate the intuitive physics understanding of deep learning models.

☆78

Alternatives and similar repositories for IntPhys2

Users that are interested in IntPhys2 are comparing it to the libraries listed below

Sorting:

facebookresearch / jepa-intuitive-physics
This repo contains the code for the paper "Intuitive physics understanding emerges fromself-supervised pretraining on natural videos"
☆187Updated 8 months ago
myscience / open-genie
Pytorch implementation of "Genie: Generative Interactive Environments", Bruce et al. (2024).
☆208Updated last year
google-deepmind / physics-IQ-benchmark
Benchmarking physical understanding in generative video models
☆205Updated 2 weeks ago
gaoyuezhou / dino_wm
☆283Updated 6 months ago
lucidrains / dreamer4
Implementation of Danijar's latest iteration for his Dreamer line of work
☆78Updated this week
NVlabs / TokenBench
A Video Tokenizer Evaluation Dataset
☆135Updated 9 months ago
Little-Podi / AdaWorld
[ICML'25] The PyTorch implementation of paper: "AdaWorld: Learning Adaptable World Models with Latent Actions".
☆166Updated 4 months ago
lorenmt / clarity-template
Clarity: A Minimalist Website Template for AI Research
☆149Updated 9 months ago
nvidia-cosmos / cosmos-predict1
Cosmos-Predict1 is a collection of general-purpose world foundation models for Physical AI that can be fine-tuned into customized world m…
☆363Updated last month
wilson1yan / teco
☆121Updated 7 months ago
NVIDIA / GR00T-Dreams
Nvidia GEAR Lab's initiative to solve the robotics data problem using world models
☆332Updated last month
nvidia-cosmos / cosmos-predict2
Cosmos-Predict2 is a collection of general-purpose world foundation models for Physical AI that can be fine-tuned into customized world m…
☆633Updated last week
phyworld / phyworld
☆143Updated 9 months ago
video-language-planning / vlp_code
☆77Updated 4 months ago
bdaiinstitute / theia
Theia: Distilling Diverse Vision Foundation Models for Robot Learning
☆253Updated 6 months ago
USC-GVL / PhysBench
[ICLR 2025] Official implementation and benchmark evaluation repository of <PhysBench: Benchmarking and Enhancing Vision-Language Models …
☆72Updated 4 months ago
allenai / molmoact
Official Repository for MolmoAct
☆212Updated last month
GenEx-world / genex
Generative World Explorer
☆157Updated 4 months ago
thuml / iVideoGPT
Official repository for "iVideoGPT: Interactive VideoGPTs are Scalable World Models" (NeurIPS 2024), https://arxiv.org/abs/2405.15223
☆153Updated 3 weeks ago
behavior-vision-suite / behavior-vision-suite.github.io
☆167Updated 7 months ago
LostXine / LLaRA
[ICLR'25] LLaRA: Supercharging Robot Learning Data for Vision-Language Policy
☆225Updated 6 months ago
LargeWorldModel / ElasticTok
ElasticTok: Adaptive Tokenization for Image and Video
☆80Updated 11 months ago
akanazawa / fpo
Implementation of Flow Policy Optimization (FPO)
☆257Updated 3 weeks ago
DannyTran123 / egopet
Official implementation of the paper "EgoPet: Egomotion and Interaction Data from an Animal's Perspective".
☆27Updated last year
LatentActionPretraining / LAPA
[ICLR 2025] LAPA: Latent Action Pretraining from Videos
☆385Updated 8 months ago
a1600012888 / LaCT
Code release for paper "Test-Time Training Done Right"
☆295Updated last month
pairlab / SlotFormer
Code release for ICLR 2023 paper: SlotFormer on object-centric dynamics models
☆114Updated 2 years ago
lucidrains / TRI-LBM
Implementation of the Large Behavioral Model architecture for dexterous manipulation from Toyota Research Institute
☆66Updated last month
JeffWang987 / EgoVid
[Nips 2025] EgoVid-5M: A Large-Scale Video-Action Dataset for Egocentric Video Generation
☆119Updated 2 months ago
embodiedreasoning / ERQA
Embodied Reasoning Question Answer (ERQA) Benchmark
☆229Updated 7 months ago