IsshikiHugh / ExpOvenLinks
Using message app/bot to notify you when doing time-consuming tasks. Bake your experiments!
☆78Updated 3 months ago
Alternatives and similar repositories for ExpOven
Users that are interested in ExpOven are comparing it to the libraries listed below
Sorting:
- Official implementation of ICCV 2025 paper "EgoAgent: A Joint Predictive Agent Model in Egocentric Worlds".☆26Updated 3 months ago
- [NeurIPS 2025] InternScenes: A Large-scale Interactive Indoor Scene Dataset with Realistic Layouts.☆179Updated last month
- Code for "BoxDreamer: Dreaming Box Corners for Generalizable Object Pose Estimation", ICCV 2025.☆85Updated last week
- ☆90Updated 2 weeks ago
- A curated list of Awesome 3D Vision, including 3D Gaussian Splatting, SLAM, Neural Radiance Fields.☆22Updated last year
- (🛠️ *WIP*) Code snippets for understanding common techniques for virtual humans.☆72Updated 2 weeks ago
- This project is my attempt at automating work in Notion.☆17Updated last month
- A paper list for spatial reasoning☆143Updated 4 months ago
- [NeurIPS‘24] Multi-Object 3D Grounding with Dynamic Modules and Language Informed Spatial Attention☆27Updated 4 months ago
- Thinking with Videos from Open-Source Priors. We reproduce chain-of-frames visual reasoning by fine-tuning open-source video models. Give…☆131Updated this week
- Automatically hold idle GPU.☆78Updated last month
- [ARXIV’25] Learning Video Generation for Robotic Manipulation with Collaborative Trajectory Control☆81Updated 3 months ago
- OmniWorld: A Multi-Domain and Multi-Modal Dataset for 4D World Modeling☆374Updated last week
- ☆44Updated 4 months ago
- From Flatland to Space (SPAR). Accepted to NeurIPS 2025 Datasets & Benchmarks. A large-scale dataset & benchmark for 3D spatial perceptio…☆59Updated last week
- Unifying 2D and 3D Vision-Language Understanding☆109Updated 2 months ago
- SpatialVID: A Large-Scale Video Dataset with Spatial Annotations☆385Updated this week
- ☆34Updated 2 years ago
- Trace Anything: Representing Any Video in 4D via Trajectory Fields☆175Updated this week
- [NeurIPS 2025 Spotlight] Official implementation of the SIU3R: Simultaneous Scene Understanding and 3D Reconstruction Beyond Feature Alig…☆106Updated 3 weeks ago
- Multi-SpatialMLLM Multi-Frame Spatial Understanding with Multi-Modal Large Language Models☆155Updated last week
- [CVPR 2024 Highlight] GP-NeRF: Generalized Perception NeRF for Context-Aware 3D Scene Understanding☆27Updated last year
- Official Reporsitory of "EgoMono4D: Self-Supervised Monocular 4D Scene Reconstruction for Egocentric Videos"☆35Updated 3 weeks ago
- The code for paper 'Learning from Videos for 3D World: Enhancing MLLMs with 3D Vision Geometry Priors'☆141Updated last week
- Source codes for the paper "MindJourney: Test-Time Scaling with World Models for Spatial Reasoning"☆85Updated 2 months ago
- WideRange4D: Enabling High-Quality 4D Reconstruction with Wide-Range Movements and Scenes☆102Updated 6 months ago
- [NeurIPS 24] The implementation and dataset of LiveScene: Language Embedding Interactive Radiance Fields for Physical Scene Rendering and…☆56Updated 6 months ago
- [NeurIPS 2025] WorldMem: Long-term Consistent World Simulation with Memory☆246Updated last month
- Code release for paper "Test-Time Training Done Right"☆295Updated last month
- MetaSpatial leverages reinforcement learning to enhance 3D spatial reasoning in vision-language models (VLMs), enabling more structured, …☆190Updated 5 months ago