[ArXiv 26] The official repository of "ArtHOI: Articulated Human-Object Interaction Synthesis by 4D Reconstruction from Video Priors".
☆36Mar 5, 2026Updated 2 months ago
Alternatives and similar repositories for ArtHOI
Users that are interested in ArtHOI are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- M³: Dense Matching Meets Multi-View Foundation Models for Monocular Gaussian Splatting SLAM☆71Mar 18, 2026Updated 2 months ago
- [CVPR 2025 Oral] FluidNexus: 3D Fluid Reconstruction and Prediction from a Single Video☆74Mar 22, 2026Updated 2 months ago
- Agent-to-Sim Learning Interactive Behavior from Casual Videos.☆48Oct 16, 2024Updated last year
- [CVPR 2026] Official repo for "EVATok: Adaptive Length Video Tokenization for Efficient Visual Autoregressive Generation"☆59Mar 13, 2026Updated 2 months ago
- OnlyFlow: Optical Flow based Motion Conditioning for Video Diffusion Models☆20Feb 20, 2025Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Official implementation of FRAPPE: Infusing World Modeling into Generalist Policies via Multiple Future Representation Alignment☆50Mar 24, 2026Updated 2 months ago
- Official implementation of paper "Stratified Avatar Generation from Sparse Observations"☆27Aug 30, 2024Updated last year
- [ACL'26] EvoToken-DLM (Beyond Hard Masks: Progressive Token Evolution for Diffusion Language)☆48Apr 7, 2026Updated last month
- In-Context Reinforcement Learning for Tool Use in Large Language Models☆47Mar 26, 2026Updated 2 months ago
- Open Ended Medical Reinforcement Learning☆55Mar 15, 2026Updated 2 months ago
- ☆73Apr 14, 2026Updated last month
- 🎥 A website-based video player for sharing videos remotely and synchronously. | 一个视频播放网站,可以远程同步看视频。☆13Jul 11, 2022Updated 3 years ago
- [CVPR 2026] Official code of "EmbodiedSplat: Online Feed-Forward Semantic 3DGS for Open-Vocabulary 3D Scene Understanding"☆86May 21, 2026Updated last week
- Towards Pixel-Level VLM Perception via Simple Points Prediction☆104Feb 9, 2026Updated 3 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- CVPR 2025' Instruct-4DGS: Efficient Dynamic Scene Editing via 4D Gaussian-based Static-Dynamic Separation☆31Sep 21, 2025Updated 8 months ago
- daVinci-Agency: Unlocking Long-Horizon Agency Data-Efficiently☆39Feb 4, 2026Updated 3 months ago
- ☆36Jan 30, 2026Updated 3 months ago
- OmniStream: Mastering Perception, Reconstruction and Action in Continuous Streams☆100Mar 15, 2026Updated 2 months ago
- [ICML 2026] The official implementation of paper "Generation Enhances Understanding in Unified Multimodal Models via Multi-Representation…☆72Updated this week
- [RSS 2026] The first framework enabling humanoid robots to learn whole-body loco-manipulation from egocentric human demos☆140Apr 10, 2026Updated last month
- ☆44May 15, 2026Updated 2 weeks ago
- The official implementation of “MonoArt: Progressive Structural Reasoning for Monocular Articulated 3D Reconstruction”☆63Mar 20, 2026Updated 2 months ago
- SPAgent, a foundation agent for understanding, reasoning over, and operating within the physical and spatial world.☆185May 20, 2026Updated last week
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- The official code of our CVPR 2024 paper, "3D Human Pose Perception from Egocentric Stereo Videos".☆27Dec 12, 2025Updated 5 months ago
- [ACL 2026 Findings] "Omni-R1: Towards the Unified Generative Paradigm for Multimodal Reasoning"☆62Updated this week
- The evaluation code for A Safety Report on GPT-5.2, Gemini 3 Pro, Qwen3-VL, Grok 4.1 Fast, Nano Banana Pro, and Seedream 4.5☆53Jan 18, 2026Updated 4 months ago
- Residual Context Diffusion (RCD): Repurposing discarded signals as structured priors for high-performance reasoning in dLLMs.☆57Mar 12, 2026Updated 2 months ago
- Towards Photorealistic 4D Scene Generation via Video Diffusion Models☆19Jun 12, 2024Updated last year
- ☆25Jul 22, 2025Updated 10 months ago
- TBD☆57Mar 13, 2026Updated 2 months ago
- ☆40Oct 29, 2025Updated 7 months ago
- Green-VLA: Staged Vision-Language-Action Model for Generalist Robots☆133Mar 5, 2026Updated 2 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆69Feb 6, 2026Updated 3 months ago
- Official Implementation of "ToolSafe: Enhancing Tool Invocation Safety of LLM-based Agents via Proactive Step-level Guardrail and Feedbac…☆58Mar 25, 2026Updated 2 months ago
- [CVPR 2025] Offcial implementation of PhysFlow: Unleashing the Potential of Multi-modal Foundation Models and Video Diffusion for 4D Dyna…☆38Jul 28, 2025Updated 10 months ago
- Official implementation of ICCV 2025 paper "EgoAgent: A Joint Predictive Agent Model in Egocentric Worlds".☆51Jun 30, 2025Updated 10 months ago
- Official Implementation of ARM4R ICML 2025☆53Sep 18, 2025Updated 8 months ago
- Official implementation of Forge4D: Feed-Forward 4D Human Reconstruction and Interpolation from Uncalibrated Sparse Videos☆52May 2, 2026Updated 3 weeks ago
- [ICLR' 25] AvatarGO: Zero-shot 4D Human-Object Interaction Generation and Animation☆69Mar 19, 2025Updated last year