β47Aug 26, 2025Updated 8 months ago
Alternatives and similar repositories for embodied-videoagent
Users that are interested in embodied-videoagent are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- γICLR 2025 π₯γMMKE-Bench, a challenging benchmark for evaluating diverse semantic editing in real-world scenarios.β22Apr 19, 2025Updated last year
- β55Oct 3, 2024Updated last year
- β258Aug 6, 2025Updated 9 months ago
- A powerful automation agent for macOS that enables natural language control of various system applications and services. This agent allowβ¦β60Jun 5, 2025Updated 11 months ago
- β12Dec 4, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer β’ AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- β30Jun 19, 2024Updated last year
- An automatic pipeline to generate highquality BIMs by learning floor plan arrangements using a graph neural network. -SIGGRAPH 2023β39Jul 18, 2023Updated 2 years ago
- FSD Tesla Open-source. Real-Time Environment Reconstruction System for Autonomous Vehiclesβ20Jan 8, 2026Updated 4 months ago
- β13Nov 5, 2024Updated last year
- β14Jan 5, 2022Updated 4 years ago
- [ICCV'25] Ross3D: Reconstructive Visual Instruction Tuning with 3D-Awarenessβ69Jul 22, 2025Updated 9 months ago
- β13Apr 24, 2023Updated 3 years ago
- [CVPR'25] SeeGround: See and Ground for Zero-Shot Open-Vocabulary 3D Visual Groundingβ223Apr 21, 2025Updated last year
- [CVPR 2025] PVC: Progressive Visual Token Compression for Unified Image and Video Processing in Large Vision-Language Modelsβ54Jun 12, 2025Updated 11 months ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits β’ AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Code for ICML2018 - Generalized Earley Parser: Bridging Symbolic Grammars and Sequence Data for Future Prediction.β36Apr 20, 2019Updated 7 years ago
- β17Apr 17, 2025Updated last year
- From Flatland to Space (SPAR). Accepted to NeurIPS 2025 Datasets & Benchmarks. A large-scale dataset & benchmark for 3D spatial perceptioβ¦β86Jan 5, 2026Updated 4 months ago
- Official Implementation of "Geometrically-Constrained Agent for Spatial Reasoning"β76Apr 7, 2026Updated last month
- [AAAI 2025] Official data and code for "TB-HSU: Hierarchical 3D Scene Understanding with Contextual Affordances"β15Sep 11, 2025Updated 8 months ago
- β14Jul 23, 2024Updated last year
- Code accompanying paper "SharpDepth: Sharpening Metric Depth Predictions Using Diffusion Distillation"β26May 8, 2026Updated last week
- [ECCV 2024 Oral] The official implementation of paper: COHO: Context-Sensitive City-Scale Hierarchical Urban Layout Generationβ11Aug 13, 2024Updated last year
- [CVPR 2026] FluxMem: Adaptive Hierarchical Memory for Streaming Video Understandingβ61Mar 16, 2026Updated 2 months ago
- Proton VPN Special Offer - Get 70% off β’ AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Theano implementation of neuraltalk code by karpathy (https://github.com/karpathy/neuraltalk)β11Jul 9, 2015Updated 10 years ago
- β13Nov 7, 2021Updated 4 years ago
- Official implementation of paper "GAPrompt: Geometry-Aware Point Cloud Prompt for 3D Vision Model", ICML 2025β16Dec 25, 2025Updated 4 months ago
- Reproducing the reinforcement learning models used in "Emergence of Linguistic Communication from Referential Games with Symbolic and Pixβ¦β12Jun 23, 2018Updated 7 years ago
- [ICCV'25] ScenePainter: Semantically Consistent Perpetual 3D Scene Generation with Concept Relation Alignmentβ37Oct 5, 2025Updated 7 months ago
- PyTorch implementation of the paper: CASAGPT: Cuboid Arrangement and Scene Assembly for Interior Design [CVPR 2025]β14Apr 5, 2025Updated last year
- Official implementation of "In-style: Bridging Text and Uncurated Videos with Style Transfer for Cross-modal Retrieval." ICCV 2023β11Oct 5, 2023Updated 2 years ago
- Code for the SIGGRAPH Asia 2025 paper Gradient-Weighted Feature Back-Projection: A Fast Alternative to Feature Distillation in 3D Gaussiaβ¦β22Dec 23, 2025Updated 4 months ago
- β78Apr 3, 2026Updated last month
- Serverless GPU API endpoints on Runpod - Get Bonus Credits β’ AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- [NeurIPS 2024] MSR3D: Advanced Situated Reasoning in 3D Scenesβ72Dec 2, 2025Updated 5 months ago
- This repository is an implementation of the ICCV 2025 paper "LoD-Loc v2: Aerial Visual Localization over Low Level-of-Detail City Models β¦β27Apr 23, 2026Updated 3 weeks ago
- Skipping Recurrent Neural Networksβ13Aug 3, 2018Updated 7 years ago
- Independent implementation of DBCA method from http://arxiv.org/abs/1912.09713β11Nov 25, 2020Updated 5 years ago
- β14Jul 11, 2024Updated last year
- β36May 29, 2025Updated 11 months ago
- Xfce Desktop container designed for direct access to the GPU with EGL using VirtualGL for GPUs. Does not require /tmp/.X11-unix host sockβ¦β10Jul 25, 2022Updated 3 years ago