jetteezhou / PhysVLMView external linksLinks
PhysVLM: Enabling Visual Language Models to Understand Robotic Physical Reachability
☆37Mar 18, 2025Updated 10 months ago
Alternatives and similar repositories for PhysVLM
Users that are interested in PhysVLM are comparing it to the libraries listed below
Sorting:
- Source code of "Point Set Voting for Partial Point Clouds Analysis"☆14Jan 5, 2021Updated 5 years ago
- An End-to-End Model with Adaptive Filtering for Retrieval-Augmented Generation☆16Oct 27, 2024Updated last year
- ☆20Jul 5, 2024Updated last year
- [RA-L + IROS2024] Learning to place unseen objects stably using large-scale simulation☆21Jun 30, 2024Updated last year
- The paper list of multilingual pre-trained models (Continual Updated).☆24Jun 18, 2024Updated last year
- Click to Grasp takes calibrated RGB-D images of a tabletop and user-defined part instances in diverse source images as input, and produce…☆21Apr 4, 2024Updated last year
- [ICLR 2025] Official implementation and benchmark evaluation repository of <PhysBench: Benchmarking and Enhancing Vision-Language Models …☆83Jan 21, 2026Updated 3 weeks ago
- point cloud viz☆30Dec 16, 2023Updated 2 years ago
- The first spoken long-text dataset derived from live streams, designed to reflect the redundancy-rich and conversational nature of real-w…☆13Jun 28, 2025Updated 7 months ago
- A Text2SQL benchmark for evaluation of Large Language Models☆41Updated this week
- [NeurIPS 2025] A multimodal agent that can interact with its own PC in a multimodal manner.☆36Nov 10, 2025Updated 3 months ago
- ☆18Jun 10, 2025Updated 8 months ago
- A task sequencer framework for achieving a GPT-to-action system in robotics.☆16Mar 6, 2025Updated 11 months ago
- [NeurIPS 2025] VIKI‑R: Coordinating Embodied Multi-Agent Cooperation via Reinforcement Learning☆72Dec 14, 2025Updated 2 months ago
- XmodelLM☆38Nov 19, 2024Updated last year
- The official implement of paper 《DaMo: Data Mixing Optimizer in Fine-tuning Multimodal LLMs for Mobile Phone Agents》☆28Oct 23, 2025Updated 3 months ago
- [ICML 2024] Official Repository for the paper "Transformers Get Stable: An End-to-End Signal Propagation Theory for Language Models"☆10Jul 19, 2024Updated last year
- ☆11Jun 22, 2025Updated 7 months ago
- A Framework for Evaluating AI Agent Safety in Realistic Environments☆30Oct 2, 2025Updated 4 months ago
- [ICRA 2025] Next Best Sense: Autonomously reconstructing a 3D Gaussian Splatting scene for robotic manipulators.☆50Feb 1, 2025Updated last year
- [ICCV 2023] Take-A-Photo: 3D-to-2D Generative Pre-training of Point Cloud Models☆44Jul 30, 2024Updated last year
- [ICPR 2022] Data, code and pretrained models for Deep Surface Reconstruction from Point Clouds with Visibility Information☆38Dec 7, 2022Updated 3 years ago
- ☆12Jun 11, 2025Updated 8 months ago
- Integrating opencv with mujoco.☆11Mar 25, 2025Updated 10 months ago
- ☆14Mar 20, 2025Updated 10 months ago
- A Practical Zoom-in GUI Grounding and Behavior-Based Evaluation method.☆19Dec 8, 2025Updated 2 months ago
- ☆11Dec 27, 2022Updated 3 years ago
- 使用Qt+librviz+ros设计点云显示界面☆11Jan 5, 2022Updated 4 years ago
- ☆20Jul 29, 2025Updated 6 months ago
- ☆16Sep 17, 2024Updated last year
- This is the official code repository for the paper: Towards General Continuous Memory for Vision-Language Models.☆19Jul 3, 2025Updated 7 months ago
- Official Implementation of HIMA (COLM'25)☆19Nov 25, 2025Updated 2 months ago
- ☆14Apr 14, 2025Updated 10 months ago
- ☆11Dec 15, 2025Updated last month
- Continuous Pipelined Speculative Decoding☆16Jan 4, 2026Updated last month
- ☆24Aug 19, 2025Updated 5 months ago
- [CVPR 2025] DiscoVLA: Discrepancy Reduction in Vision, Language, and Alignment for Parameter-Efficient Video-Text Retrieval☆22Jun 23, 2025Updated 7 months ago
- ☆12Apr 1, 2025Updated 10 months ago
- [CVPR 2024] Dataset and Code for "Language-driven Grasp Detection."☆48Feb 9, 2025Updated last year