IsshikiHugh / ExpOvenLinks
Using message app/bot to notify you when doing time-consuming tasks. Bake your experiments!
☆85Updated this week
Alternatives and similar repositories for ExpOven
Users that are interested in ExpOven are comparing it to the libraries listed below
Sorting:
- Official implementation of ICCV 2025 paper "EgoAgent: A Joint Predictive Agent Model in Egocentric Worlds".☆44Updated 7 months ago
- Collection of forcing related autoregressive video Gen☆43Updated this week
- A curated list of Awesome 3D Vision, including 3D Gaussian Splatting, SLAM, Neural Radiance Fields.☆22Updated last year
- Code for "BoxDreamer: Dreaming Box Corners for Generalizable Object Pose Estimation", ICCV 2025.☆101Updated 4 months ago
- ☆124Updated 3 months ago
- An easy way for debug python for Slurm HPC users.☆28Updated 10 months ago
- [NeurIPS‘24] Multi-Object 3D Grounding with Dynamic Modules and Language Informed Spatial Attention☆27Updated 7 months ago
- [NeurIPS 2025] InternScenes: A Large-scale Interactive Indoor Scene Dataset with Realistic Layouts.☆221Updated 3 months ago
- Official code for paper: N3D-VLM: Native 3D Grounding Enables Accurate Spatial Reasoning in Vision-Language Models☆85Updated 3 weeks ago
- [ICLR 2026] OmniWorld: A Multi-Domain and Multi-Modal Dataset for 4D World Modeling☆427Updated last month
- An ML research template with good documentation by Boyuan Chen, an MIT PhD student☆128Updated 11 months ago
- The code for paper 'Learning from Videos for 3D World: Enhancing MLLMs with 3D Vision Geometry Priors'☆197Updated 2 months ago
- [NeurIPS 24] The implementation and dataset of LiveScene: Language Embedding Interactive Radiance Fields for Physical Scene Rendering and…☆60Updated 10 months ago
- SpatialVID: A Large-Scale Video Dataset with Spatial Annotations☆497Updated this week
- Unifying 2D and 3D Vision-Language Understanding☆121Updated 6 months ago
- Official Implementation of "Geometrically-Constrained Agent for Spatial Reasoning"☆49Updated last month
- [CVPR 2024 Highlight] GP-NeRF: Generalized Perception NeRF for Context-Aware 3D Scene Understanding☆27Updated last year
- MetaSpatial leverages reinforcement learning to enhance 3D spatial reasoning in vision-language models (VLMs), enabling more structured, …☆203Updated 9 months ago
- [ICLR’26] Learning Video Generation for Robotic Manipulation with Collaborative Trajectory Control☆94Updated 7 months ago
- [NeurIPS 2025] Source codes for the paper "MindJourney: Test-Time Scaling with World Models for Spatial Reasoning"☆128Updated 3 months ago
- Official Code Release of NeurIPS 2025 Paper: HoloScene: Simulation‑Ready Interactive 3D Worlds from a Single Video☆86Updated 4 months ago
- SPAgent, a spatial intelligence agent designed to operate in the physical and spatial world.☆98Updated 2 weeks ago
- [ICLR26] Official implementation of Geometry Forcing: Marrying Video Diffusion and 3D Representation for Consistent World Modeling☆144Updated 2 weeks ago
- 清华大学飞跃数据库☆31Updated this week
- Official implementation for WorldScore: A Unified Evaluation Benchmark for World Generation☆212Updated 2 months ago
- Code snippets for understanding common techniques for virtual humans.☆109Updated this week
- Official implementation of “4D LangVGGT: 4D Language-Visual Geometry Grounded Transformer”☆78Updated 2 months ago
- Official repository for "Vid2World: Crafting Video Diffusion Models to Interactive World Models" (ICLR 2026), https://arxiv.org/abs/2505.…☆38Updated 2 weeks ago
- From Flatland to Space (SPAR). Accepted to NeurIPS 2025 Datasets & Benchmarks. A large-scale dataset & benchmark for 3D spatial perceptio…☆77Updated last month
- Multi-SpatialMLLM Multi-Frame Spatial Understanding with Multi-Modal Large Language Models☆167Updated 4 months ago