IsshikiHugh / ExpOvenLinks
Using message app/bot to notify you when doing time-consuming tasks. Bake your experiments!
β75Updated last month
Alternatives and similar repositories for ExpOven
Users that are interested in ExpOven are comparing it to the libraries listed below
Sorting:
- β85Updated last month
- (π οΈ *WIP*) Code snippets for understanding common techniques for virtual humans.β59Updated 5 months ago
- This project is my attempt at automating work in Notion.β17Updated this week
- Official implementation of ICCV 2025 paper "EgoAgent: A Joint Predictive Agent Model in Egocentric Worlds".β20Updated 2 months ago
- Code for "BoxDreamer: Dreaming Box Corners for Generalizable Object Pose Estimation", ICCV 2025.β80Updated 2 months ago
- InternScenes: A Large-scale Interactive Indoor Scene Dataset with Realistic Layouts.β84Updated last month
- [ARXIVβ25] Learning Video Generation for Robotic Manipulation with Collaborative Trajectory Controlβ76Updated last month
- [NeurIPSβ24] Multi-Object 3D Grounding with Dynamic Modules and Language Informed Spatial Attentionβ26Updated 2 months ago
- β43Updated 2 months ago
- A paper list for spatial reasoningβ134Updated 2 months ago
- Official implementation of the SIU3R: Simultaneous Scene Understanding and 3D Reconstruction Beyond Feature Alignmentβ27Updated last month
- From Flatland to Space: Teaching Vision-Language Models to Perceive and Reason in 3Dβ56Updated 3 months ago
- Unifying 2D and 3D Vision-Language Understandingβ100Updated last month
- Official implementation for WorldScore: A Unified Evaluation Benchmark for World Generationβ128Updated last month
- [arXiv 2025] MMSI-Bench: A Benchmark for Multi-Image Spatial Intelligenceβ47Updated 3 weeks ago
- MetaSpatial leverages reinforcement learning to enhance 3D spatial reasoning in vision-language models (VLMs), enabling more structured, β¦β187Updated 3 months ago
- OST-Bench: Evaluating the Capabilities of MLLMs in Online Spatio-temporal Scene Understandingβ59Updated last month
- A curated list of Awesome 3D Vision, including 3D Gaussian Splatting, SLAM, Neural Radiance Fields.β22Updated last year
- Multi-SpatialMLLM Multi-Frame Spatial Understanding with Multi-Modal Large Language Modelsβ147Updated 3 months ago
- The code for paper 'Learning from Videos for 3D World: Enhancing MLLMs with 3D Vision Geometry Priors'β111Updated last month
- Code release for paper "Test-Time Training Done Right"β272Updated last month
- WideRange4D: Enabling High-Quality 4D Reconstruction with Wide-Range Movements and Scenesβ97Updated 5 months ago
- β38Updated 5 months ago
- β101Updated 5 months ago
- An ML research template with good documentation by Boyuan Chen, an MIT PhD studentβ80Updated 5 months ago
- [NeurIPS 24] The implementation and dataset of LiveScene: Language Embedding Interactive Radiance Fields for Physical Scene Rendering andβ¦β56Updated 5 months ago
- GenWorld: Towards Detecting AI-generated Real-world Simulation Videosβ32Updated 2 months ago
- The official implementation of The paper "Exploring the Potential of Encoder-free Architectures in 3D LMMs"β55Updated 3 months ago
- [COLING 2025] Idea23D: Collaborative LMM Agents Enable 3D Model Generation from Interleaved Multimodal Inputsβ51Updated 7 months ago
- SceneCompleter: Dense 3D Scene Completion for Generative Novel View Synthesisβ35Updated 2 months ago