IsshikiHugh / ExpOvenLinks
Using message app/bot to notify you when doing time-consuming tasks. Bake your experiments!
☆83Updated 2 months ago
Alternatives and similar repositories for ExpOven
Users that are interested in ExpOven are comparing it to the libraries listed below
Sorting:
- Official implementation of ICCV 2025 paper "EgoAgent: A Joint Predictive Agent Model in Egocentric Worlds".☆43Updated 6 months ago
- ☆116Updated 2 months ago
- [NeurIPS 2025] InternScenes: A Large-scale Interactive Indoor Scene Dataset with Realistic Layouts.☆206Updated 2 months ago
- Spatial Reasoning with Vision-Language Models☆31Updated last month
- Code for "BoxDreamer: Dreaming Box Corners for Generalizable Object Pose Estimation", ICCV 2025.☆98Updated 3 months ago
- The code for paper 'Learning from Videos for 3D World: Enhancing MLLMs with 3D Vision Geometry Priors'☆189Updated last month
- OmniWorld: A Multi-Domain and Multi-Modal Dataset for 4D World Modeling☆409Updated 2 weeks ago
- (🛠️ *WIP*) Code snippets for understanding common techniques for virtual humans.☆103Updated last week
- [NeurIPS 2025 Spotlight] MesaTask: Towards Task-Driven Tabletop Scene Generation via 3D Spatial Reasoning☆70Updated 3 months ago
- [NeurIPS‘24] Multi-Object 3D Grounding with Dynamic Modules and Language Informed Spatial Attention☆27Updated 6 months ago
- [ARXIV’25] Learning Video Generation for Robotic Manipulation with Collaborative Trajectory Control☆85Updated 6 months ago
- Official implementation for WorldScore: A Unified Evaluation Benchmark for World Generation☆180Updated last month
- [arXiv 2025] MMSI-Bench: A Benchmark for Multi-Image Spatial Intelligence☆68Updated last week
- Official implementation of Geometry Forcing: Marrying Video Diffusion and 3D Representation for Consistent World Modeling☆124Updated last month
- Multi-SpatialMLLM Multi-Frame Spatial Understanding with Multi-Modal Large Language Models☆164Updated 2 months ago
- SpatialVID: A Large-Scale Video Dataset with Spatial Annotations☆465Updated 3 weeks ago
- An easy way for debug python for Slurm HPC users.☆28Updated 9 months ago
- [NeurIPS 2025] Source codes for the paper "MindJourney: Test-Time Scaling with World Models for Spatial Reasoning"☆121Updated 2 months ago
- GenWorld: Towards Detecting AI-generated Real-world Simulation Videos☆34Updated 6 months ago
- From Flatland to Space (SPAR). Accepted to NeurIPS 2025 Datasets & Benchmarks. A large-scale dataset & benchmark for 3D spatial perceptio…☆72Updated 3 months ago
- MetaSpatial leverages reinforcement learning to enhance 3D spatial reasoning in vision-language models (VLMs), enabling more structured, …☆198Updated 8 months ago
- Official Implementation of "Geometrically-Constrained Agent for Spatial Reasoning"☆23Updated 3 weeks ago
- Thinking in 360°: Humanoid Visual Search in the Wild☆105Updated last month
- Official implementation of Spatial-MLLM: Boosting MLLM Capabilities in Visual-based Spatial Intelligence☆419Updated last month
- A curated list of Awesome 3D Vision, including 3D Gaussian Splatting, SLAM, Neural Radiance Fields.☆22Updated last year
- [TCSVT 2025] Niagara: Normal-Integrated Geometric Affine Field for Scene Reconstruction from a Single View☆103Updated 3 weeks ago
- An ML research template with good documentation by Boyuan Chen, an MIT PhD student☆105Updated 10 months ago
- ☆48Updated last month
- Automatically hold idle GPU.☆78Updated 2 months ago
- Implementation of paper "SceneMaker: Open-set 3D Scene Generation with Decoupled De-occlusion and Pose Estimation Model"☆42Updated last week