jetteezhou/PhysVLM

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/jetteezhou/PhysVLM)

jetteezhou / PhysVLM

PhysVLM: Enabling Visual Language Models to Understand Robotic Physical Reachability

☆38

Alternatives and similar repositories for PhysVLM

Users that are interested in PhysVLM are comparing it to the libraries listed below

Sorting:

junming259 / PointSetVoting
View on GitHub
Source code of "Point Set Voting for Partial Point Clouds Analysis"
☆14Jan 5, 2021Updated 5 years ago
XieZilongAI / E2E-AFG
View on GitHub
An End-to-End Model with Adaptive Filtering for Retrieval-Augmented Generation
☆16Oct 27, 2024Updated last year
haonan16 / Stow
View on GitHub
☆20Jul 5, 2024Updated last year
gist-ailab / uop-net
View on GitHub
[RA-L + IROS2024] Learning to place unseen objects stably using large-scale simulation
☆21Jun 30, 2024Updated last year
tsagkas / click2grasp
View on GitHub
Click to Grasp takes calibrated RGB-D images of a tabletop and user-defined part instances in diverse source images as input, and produce…
☆21Apr 4, 2024Updated last year
kaiyuhwang / MLLM-Survey
View on GitHub
The paper list of multilingual pre-trained models (Continual Updated).
☆24Jun 18, 2024Updated last year
physical-superintelligence-lab / PhysBench
View on GitHub
[ICLR 2025] Official implementation and benchmark evaluation repository of <PhysBench: Benchmarking and Enhancing Vision-Language Models …
☆86Jan 21, 2026Updated last month
wheelos-tools / whl-pcdviz
View on GitHub
point cloud viz
☆30Dec 16, 2023Updated 2 years ago
LLMSQL / llmsql-benchmark
View on GitHub
A Text2SQL benchmark for evaluation of Large Language Models
☆41Updated this week
Yarayx / livelongbench
View on GitHub
The first spoken long-text dataset derived from live streams, designed to reflect the redundancy-rich and conversational nature of real-w…
☆12Jun 28, 2025Updated 8 months ago
bin123apple / InfantAgent
View on GitHub
[NeurIPS 2025] A multimodal agent that can interact with its own PC in a multimodal manner.
☆35Feb 25, 2026Updated last week
microsoft / robotics-task-sequencer-system-framework
View on GitHub
A task sequencer framework for achieving a GPT-to-action system in robotics.
☆17Mar 6, 2025Updated last year
wln20 / CSKV
View on GitHub
[NeurIPS ENLSP Workshop'24] CSKV: Training-Efficient Channel Shrinking for KV Cache in Long-Context Scenarios
☆16Oct 18, 2024Updated last year
marinero4972 / CyberV
View on GitHub
☆18Jun 10, 2025Updated 8 months ago
XiaoduoAILab / XmodelLM
View on GitHub
XmodelLM
☆38Nov 19, 2024Updated last year
MARS-EAI / VIKI-R
View on GitHub
[NeurIPS 2025] VIKI‑R: Coordinating Embodied Multi-Agent Cooperation via Reinforcement Learning
☆74Dec 14, 2025Updated 2 months ago
GradientHQ / symphony
View on GitHub
Symphony — A decentralized multi-agent framework that enables intelligent agents to collaborate seamlessly across heterogeneous edge devi…
☆30Oct 30, 2025Updated 4 months ago
sani903 / OpenAgentSafety
View on GitHub
A Framework for Evaluating AI Agent Safety in Realistic Environments
☆30Oct 2, 2025Updated 5 months ago
akhilkedia / TranformersGetStable
View on GitHub
[ICML 2024] Official Repository for the paper "Transformers Get Stable: An End-to-End Signal Propagation Theory for Language Models"
☆10Jul 19, 2024Updated last year
OPPO-Mente-Lab / DaMo
View on GitHub
The official implement of paper 《DaMo: Data Mixing Optimizer in Fine-tuning Multimodal LLMs for Mobile Phone Agents》
☆29Oct 23, 2025Updated 4 months ago
furiosa-ai / ParallelBench
View on GitHub
[ICLR 2026] ParallelBench: Understanding the Tradeoffs of Parallel Decoding in Diffusion LLMs
☆42Feb 27, 2026Updated last week
lixiaoyu2000 / HAT
View on GitHub
Official Repo For AAAI 2026 Accepted Paper "Rethinking the Spatio-Temporal Alignment of End-to-End 3D Perception"
☆29Jan 13, 2026Updated last month
wzhan24 / UniMate
View on GitHub
☆11Jun 22, 2025Updated 8 months ago
YunzeTong / TurningPoint-GRPO
View on GitHub
☆23Feb 10, 2026Updated 3 weeks ago
armlabstanford / NextBestSense
View on GitHub
[ICRA 2025] Next Best Sense: Autonomously reconstructing a 3D Gaussian Splatting scene for robotic manipulators.
☆51Feb 1, 2025Updated last year
raphaelsulzer / dsrv-data
View on GitHub
[ICPR 2022] Data, code and pretrained models for Deep Surface Reconstruction from Point Clouds with Visibility Information
☆38Dec 7, 2022Updated 3 years ago
wangzy22 / TAP
View on GitHub
[ICCV 2023] Take-A-Photo: 3D-to-2D Generative Pre-training of Point Cloud Models
☆44Jul 30, 2024Updated last year
SEU-VIPGroup / Understanding_Vision_Tasks
View on GitHub
☆13Feb 2, 2025Updated last year
WilliamBonilla62 / TactSim-IsaacLab_4_5
View on GitHub
☆12Mar 24, 2025Updated 11 months ago
Princeton-AI2-Lab / ZoomClick
View on GitHub
A Practical Zoom-in GUI Grounding and Behavior-Based Evaluation method.
☆20Dec 8, 2025Updated 2 months ago
g-meghana-reddy / open-world-panoptic-segmentation
View on GitHub
☆12Apr 1, 2025Updated 11 months ago
PacktPublishing / Mastering-AI-Agents-for-Databases
View on GitHub
☆12Dec 15, 2025Updated 2 months ago
zjuluolun / pointCloudLibrviz
View on GitHub
使用Qt+librviz+ros设计点云显示界面
☆11Jan 5, 2022Updated 4 years ago
ZhangXJ199 / EDGE-GRPO
View on GitHub
Entropy-Driven GRPO with Guided Error Correction for Advantage Diversity
☆22Aug 28, 2025Updated 6 months ago
ZJU-REAL / cooper
View on GitHub
☆25Aug 19, 2025Updated 6 months ago
dtorre38 / mujoco_opencv
View on GitHub
Integrating opencv with mujoco.
☆11Mar 25, 2025Updated 11 months ago
Fsoft-AIC / LGD
View on GitHub
[CVPR 2024] Dataset and Code for "Language-driven Grasp Detection."
☆48Feb 9, 2025Updated last year
zhangzef / COOPER
View on GitHub
The official implementation of COOPER: A Unified Model for Cooperative Perception and Reasoning in Spatial Intelligence.
☆28Dec 30, 2025Updated 2 months ago
Longin-Yu / ComRoPE
View on GitHub
☆12Jun 11, 2025Updated 8 months ago