Official implementation of "RoboTracer: Mastering Spatial Trace with Reasoning in Vision-Language Models for Robotics"
☆63Jan 19, 2026Updated last month
Alternatives and similar repositories for RoboTracer
Users that are interested in RoboTracer are comparing it to the libraries listed below
Sorting:
- Official Implementation of "Steering Vision-Language-Action Models as Anti-Exploration: A Test-Time Scaling Approach"☆29Dec 3, 2025Updated 3 months ago
- ☆18Sep 25, 2025Updated 5 months ago
- Documents for fourierN1☆20Dec 29, 2025Updated 2 months ago
- [NeurIPS 2025] Official implementation of "RoboRefer: Towards Spatial Referring with Reasoning in Vision-Language Models for Robotics"☆233Dec 16, 2025Updated 2 months ago
- Reasoning in Space via Grounding in the World☆50Nov 3, 2025Updated 4 months ago
- [World-Model-Survey-2024] Paper list and projects for World Model☆15Oct 31, 2024Updated last year
- [ICLR 2025] Layout-Your-3D: Controllable and Precise 3D Generation with 2D Blueprint☆19Dec 22, 2025Updated 2 months ago
- [NeurIPS 2025] AutoSeg3D, online real-time 3D segmentation as instance tracking with long-short term query memory for embodied perception☆42Dec 18, 2025Updated 2 months ago
- ☆35Jul 19, 2025Updated 7 months ago
- This repository contains the code for our ICML 2025 paper——LENSLLM: Unveiling Fine-Tuning Dynamics for LLM Selection🎉☆26May 29, 2025Updated 9 months ago
- ☆28Aug 6, 2024Updated last year
- Visual Embodied Brain: Let Multimodal Large Language Models See, Think, and Control in Spaces☆88Jun 6, 2025Updated 8 months ago
- Code for the paper "DraftRec: Personalized Draft Recommendation for Winning in Multi-Player Online Battle Arena Games" (WWW 2022)☆18Aug 11, 2023Updated 2 years ago
- [CVPR 2025] Program synthesis for 3D spatial reasoning☆58Jun 16, 2025Updated 8 months ago
- Repo for the paper "Meta SecAlign: A Secure Foundation LLM Against Prompt Injection Attacks".☆51Feb 25, 2026Updated last week
- Official PyTorch implementation of the paper ‘CLIP-GS: CLIP-Informed Gaussian Splatting for Real-time and View-consistent 3D Semantic Und…☆55Apr 25, 2024Updated last year
- ☆50Sep 18, 2025Updated 5 months ago
- The first spoken long-text dataset derived from live streams, designed to reflect the redundancy-rich and conversational nature of real-w…☆12Jun 28, 2025Updated 8 months ago
- ICCV2023, NeRDF: Efficient View Synthesis with Neural Radiance Distribution Field☆23Dec 4, 2023Updated 2 years ago
- PanSt3R: Multi-view Consistent Panoptic Segmentation (official code)☆58Feb 13, 2026Updated 2 weeks ago
- FLM-Audio is a audio-language subversion of RoboEgo/FLM-Ego -- an omnimodal model with native full duplexity.☆62Dec 9, 2025Updated 2 months ago
- [NeurIPS 2025] InternScenes: A Large-scale Interactive Indoor Scene Dataset with Realistic Layouts.☆226Oct 17, 2025Updated 4 months ago
- [ICLR 2026] OmniWorld: A Multi-Domain and Multi-Modal Dataset for 4D World Modeling☆433Feb 25, 2026Updated last week
- ☆81Jan 11, 2026Updated last month
- VisPlay: Self-Evolving Vision-Language Models☆47Feb 25, 2026Updated last week
- Implementation of Discovery of Complex Behaviors Through Contact-Invariant Optimization☆28Dec 10, 2015Updated 10 years ago
- Scaling Spatial Intelligence with Multimodal Foundation Models☆177Feb 6, 2026Updated 3 weeks ago
- Official code for paper: N3D-VLM: Native 3D Grounding Enables Accurate Spatial Reasoning in Vision-Language Models☆87Jan 14, 2026Updated last month
- ☆55Feb 2, 2026Updated last month
- ☆88Feb 14, 2026Updated 2 weeks ago
- [AAAI 2026] Data and Code for Paper IS-Bench: Evaluating Interactive Safety of VLM-Driven Embodied Agents in Daily Household Tasks☆40Nov 24, 2025Updated 3 months ago
- ☆84Oct 4, 2025Updated 5 months ago
- ☆35Apr 4, 2024Updated last year
- Pytorch implementation on OpenAI's Procgen ppo-baseline, built from scratch.☆31Sep 10, 2020Updated 5 years ago
- Orient Anything V2, NeurIPS 2025 Spotlight☆202Jan 19, 2026Updated last month
- ☆89Sep 23, 2025Updated 5 months ago
- This repo contains code for Ctrl-Room.☆40Sep 26, 2025Updated 5 months ago
- [ICRA 2025] A Parameter-Efficient Tuning Framework for Language-guided Object Grounding and Robot Grasping☆11Feb 7, 2025Updated last year
- [NeurIPS 2025] DreamVLA: A Vision-Language-Action Model Dreamed with Comprehensive World Knowledge☆291Jan 6, 2026Updated last month