facebookresearch / IntPhys2Links
This is the code repository for IntPhys 2, a video benchmark designed to evaluate the intuitive physics understanding of deep learning models.
☆73Updated 3 months ago
Alternatives and similar repositories for IntPhys2
Users that are interested in IntPhys2 are comparing it to the libraries listed below
Sorting:
- This repo contains the code for the paper "Intuitive physics understanding emerges fromself-supervised pretraining on natural videos"☆183Updated 7 months ago
- [ICML'25] The PyTorch implementation of paper: "AdaWorld: Learning Adaptable World Models with Latent Actions".☆155Updated 3 months ago
- Pytorch implementation of "Genie: Generative Interactive Environments", Bruce et al. (2024).☆201Updated last year
- ☆270Updated 6 months ago
- Benchmarking physical understanding in generative video models☆197Updated 4 months ago
- [ICLR 2025] Official implementation and benchmark evaluation repository of <PhysBench: Benchmarking and Enhancing Vision-Language Models …☆70Updated 3 months ago
- ☆121Updated 7 months ago
- Official repository for "iVideoGPT: Interactive VideoGPTs are Scalable World Models" (NeurIPS 2024), https://arxiv.org/abs/2405.15223☆146Updated 4 months ago
- A Video Tokenizer Evaluation Dataset☆133Updated 8 months ago
- Clarity: A Minimalist Website Template for AI Research☆143Updated 8 months ago
- Theia: Distilling Diverse Vision Foundation Models for Robot Learning☆252Updated 5 months ago
- Generative World Explorer☆155Updated 3 months ago
- [ICLR'25] LLaRA: Supercharging Robot Learning Data for Vision-Language Policy☆224Updated 5 months ago
- Nvidia GEAR Lab's initiative to solve the robotics data problem using world models☆309Updated last month
- ☆142Updated 8 months ago
- ☆37Updated 7 months ago
- Visual Embodied Brain: Let Multimodal Large Language Models See, Think, and Control in Spaces☆80Updated 3 months ago
- Scaling Vision Pre-Training to 4K Resolution☆205Updated 3 weeks ago
- ☆78Updated 4 months ago
- Cosmos-Predict1 is a collection of general-purpose world foundation models for Physical AI that can be fine-tuned into customized world m…☆355Updated last month
- [ICLR 2025] LAPA: Latent Action Pretraining from Videos☆375Updated 8 months ago
- Cosmos-Predict2 is a collection of general-purpose world foundation models for Physical AI that can be fine-tuned into customized world m…☆592Updated 3 weeks ago
- Code release for ICLR 2023 paper: SlotFormer on object-centric dynamics models☆113Updated 2 years ago
- Code for "Scaling Language-Free Visual Representation Learning" paper (Web-SSL).☆185Updated 4 months ago
- Implementation of Flow Policy Optimization (FPO)☆240Updated 3 weeks ago
- Official implementation of the paper "EgoPet: Egomotion and Interaction Data from an Animal's Perspective".☆27Updated last year
- ☆167Updated 7 months ago
- Official Implementation for our NeurIPS 2024 paper, "Don't Look Twice: Run-Length Tokenization for Faster Video Transformers".☆224Updated 5 months ago
- Implementation of the Large Behavioral Model architecture for dexterous manipulation from Toyota Research Institute☆61Updated last week
- ElasticTok: Adaptive Tokenization for Image and Video☆77Updated 10 months ago