This repo contains the code for the paper "Intuitive physics understanding emerges fromself-supervised pretraining on natural videos"
☆244Feb 17, 2025Updated last year
Alternatives and similar repositories for jepa-intuitive-physics
Users that are interested in jepa-intuitive-physics are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- PyTorch code and models for V-JEPA self-supervised learning from video.☆3,616Feb 27, 2025Updated last year
- Open source repo for Locate 3D Model, 3D-JEPA and Locate 3D Dataset☆425Jun 3, 2025Updated 9 months ago
- A repository for paper Joint Embedding Predictive Architectures Focus on Slow Features☆25Oct 27, 2022Updated 3 years ago
- Benchmarking physical understanding in generative video models☆267Feb 2, 2026Updated last month
- Experiments in Joint Embedding Predictive Architectures (JEPAs).☆48Jan 5, 2024Updated 2 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- [WACV 2024] Code release for "VEATIC: Video-based Emotion and Affect Tracking in Context Dataset"☆21Jan 14, 2026Updated 2 months ago
- Visual Representation Learning with Stochastic Frame Prediction (ICML 2024)☆26Nov 27, 2024Updated last year
- Phys4DGen: A Physics-Driven Framework for Controllable and Efficient 4D Content Generation from a Single Image☆12May 10, 2025Updated 10 months ago
- Implementation of I-JEPA from "Self-Supervised Learning from Images with a Joint-Embedding Predictive Architecture"☆284Jan 3, 2025Updated last year
- Tuning-Free Image Editing with Fidelity and Editability via Unified Latent Diffusion Model☆13Dec 29, 2024Updated last year
- State-of-the-art Image & Video CLIP, Multimodal Large Language Models, and More!☆2,212Mar 12, 2026Updated 2 weeks ago
- [NeurIPS 2023] Self-supervised Object-Centric Learning for Videos☆32Nov 28, 2024Updated last year
- PyTorch code and models for VJEPA2 self-supervised learning from video.☆3,309Mar 17, 2026Updated last week
- OpenVLThinker: An Early Exploration to Vision-Language Reasoning via Iterative Self-Improvement☆131Jul 24, 2025Updated 8 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- A Shortcut-aware Video-QA Benchmark for Physical Understanding via Minimal Video Pairs☆36Sep 22, 2025Updated 6 months ago
- [ICLR 2025] Official implementation and benchmark evaluation repository of <PhysBench: Benchmarking and Enhancing Vision-Language Models …☆87Jan 21, 2026Updated 2 months ago
- Personal reading list for learning-based long-horizon goal reaching methods☆17Nov 26, 2020Updated 5 years ago
- Official PyTorch codebase for the Modeling Caption Diversity in ContrastiveVision-Language Pretraining paper.☆18Mar 28, 2025Updated 11 months ago
- ☆171Jan 6, 2025Updated last year
- Official code for DAM: Dynamic Adapter Merging for Continual Video QA Learning☆14Apr 25, 2024Updated last year
- ☆15May 4, 2025Updated 10 months ago
- [IJCV 2025] OmniDrag: Enabling Motion Control for Omnidirectional Image-to-Video Generation☆15Feb 13, 2026Updated last month
- This repo contains the official implementation of ICLR 2024 paper "Is ImageNet worth 1 video? Learning strong image encoders from 1 long …☆95May 17, 2024Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Memory-Bounded GPU Acceleration for Vector Search☆33Dec 29, 2025Updated 2 months ago
- ☆26Oct 15, 2024Updated last year
- [T-RO '25, ICRA '23] Official repository for AUTO-IceNav: A Local Navigation Strategy for Autonomous Surface Ships in Broken Ice Fields☆16Oct 24, 2025Updated 5 months ago
- Learning 1D Causal Visual Representation with De-focus Attention Networks☆35Jun 7, 2024Updated last year
- GUI to compute and explore receptive fields, primarily from calcium imaging recordings☆13Jun 26, 2021Updated 4 years ago
- Implementation of the new SOTA for model based RL, from the paper "Improving Transformer World Models for Data-Efficient RL", in Pytorch☆153May 2, 2025Updated 10 months ago
- ☆52Dec 13, 2024Updated last year
- [CVPR 2025 highlight] Generating 6DoF Object Manipulation Trajectories from Action Description in Egocentric Vision☆39Dec 2, 2025Updated 3 months ago
- Official Implementation of wd1☆24Sep 25, 2025Updated 6 months ago
- NordVPN Threat Protection Pro™ • AdTake your cybersecurity to the next level. Block phishing, malware, trackers, and ads. Lightweight app that works with all browsers.
- Code for "How far can we go with ImageNet for Text-to-Image generation?" paper☆95Nov 13, 2025Updated 4 months ago
- (NeurIPS 2024) BiDM: Pushing the Limit of Quantization for Diffusion Models☆22Nov 20, 2024Updated last year
- The official UniVerse-1 code.☆123Oct 13, 2025Updated 5 months ago
- WideRange4D: Enabling High-Quality 4D Reconstruction with Wide-Range Movements and Scenes☆109Mar 19, 2025Updated last year
- ☆63Jul 1, 2025Updated 8 months ago
- Library that provides metrics to assess representation quality☆26Feb 5, 2025Updated last year
- ROSA-Tuning☆71Feb 4, 2026Updated last month