VLM-FO1: Bridging the Gap Between High-Level Reasoning and Fine-Grained Perception in VLMs
☆322Jun 18, 2026Updated 2 weeks ago
Alternatives and similar repositories for VLM-FO1
Users that are interested in VLM-FO1 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Rui Qian, Xin Yin, Chuanhang Deng, et al.: UGround: Towards Unified Visual Grounding with Unrolled Transformers (ICML 2026)☆26Jun 18, 2026Updated 2 weeks ago
- [AAAI 2026] Empowering DINO Representations for Underwater Instance Segmentation via Aligner and Prompter☆52Feb 3, 2026Updated 4 months ago
- ☆62Apr 4, 2026Updated 2 months ago
- [ECCV 2024] SAM4MLLM: Enhance Multi-Modal Large Language Model for Referring Expression Segmentation☆51Mar 20, 2025Updated last year
- [CVPR2026] Detect Anything via Next Point Prediction☆1,464Feb 22, 2026Updated 4 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- VisualAD: Language-Free Zero-Shot Anomaly Detection via Vision Transformer (CVPR 2026)☆101Jun 7, 2026Updated 3 weeks ago
- Third place of 2021 IEEE GRSS Data Fusion Contest: Track MSD☆10Mar 31, 2021Updated 5 years ago
- ☆115Aug 14, 2025Updated 10 months ago
- Official repo of Griffon series including v1(ECCV 2024), v2(ICCV 2025), G, and R, and also the RL tool Vision-R1(CVPR 2026).☆249Apr 17, 2026Updated 2 months ago
- Alignment-Free RGB-T Salient Object Detection: A Large-scale Dataset and Progressive Correlation Network☆19Apr 2, 2026Updated 3 months ago
- ☆31Jan 18, 2026Updated 5 months ago
- [ACM MM 2025] This repository is the official implementation of the paper "Motion Matters: Motion-guided Modulation Network for Skeleton-…☆22Nov 28, 2025Updated 7 months ago
- [TIP 2026 🎉] 3D Underwater Novel View Synthesis and Image Restoration!☆34Jun 5, 2026Updated 3 weeks ago
- [ICLR 2026] Empowering Small VLMs to Think with Dynamic Memorization and Exploration☆19Mar 18, 2026Updated 3 months ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Fine-Grained Pixel-Text Alignment for Open-Vocabulary Semantic Segmentation☆16Mar 28, 2026Updated 3 months ago
- [CVPR 2025] DynRefer: Delving into Region-level Multimodal Tasks via Dynamic Resolution☆59Mar 4, 2025Updated last year
- 使用TensorRT推理GroundingDINO,推理速度提升3倍以上!☆61Oct 17, 2024Updated last year
- A Gradio component that can be used to annotate images with bounding boxes.☆67Jun 2, 2026Updated 3 weeks ago
- EKF for Radar and Lidar measurements to estimate the position and velocity an object, for example a pedestrian☆12Jun 18, 2020Updated 6 years ago
- A lightweight and real-time DETR for aerial images detection☆48Mar 22, 2025Updated last year
- 🙌 OpenHands: Code Less, Make More☆11Jan 8, 2025Updated last year
- A computer vision project to predict pedestrian crossing intention☆10Dec 4, 2021Updated 4 years ago
- Region Encoder Network☆21Oct 2, 2025Updated 8 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆29Apr 23, 2020Updated 6 years ago
- The official repository of "MarineInst: A Foundation Model for Marine Image Analysis with Instance Visual Description". [ECCV Oral 2024.]☆23Sep 24, 2024Updated last year
- [RSS 2026] Code for RISE: Self-Improving Robot Policy with Compositional World Model☆299Jun 4, 2026Updated 3 weeks ago
- Galaxea's first diffusion policy release☆39Aug 18, 2025Updated 10 months ago
- ALTo: Adaptive-Length Tokenizer for Autoregressive Mask Generation☆29May 27, 2025Updated last year
- ☆13Jul 11, 2025Updated 11 months ago
- Low-latency ONNX and TensorRT based zero-shot classification and detection with contrastive language-image pre-training based prompts☆46May 20, 2026Updated last month
- UGRoadUpd: An unchanged-guided road updating framework based on remotely sensed imagery☆12Mar 15, 2023Updated 3 years ago
- [TGRS 2023] An official implementation of Multitype Feature Perception and Refined Network for Spaceborne Infrared Ship Detection☆11May 23, 2024Updated 2 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- ☆10Nov 15, 2023Updated 2 years ago
- [TGRS 2021] TEBCF: Real-World Underwater Image Texture Enhancement Model Based on Blurriness and Color Fusion☆11Jun 9, 2022Updated 4 years ago
- yolov5: pytorch->onnx->caffe->hisi3559☆23Jun 5, 2024Updated 2 years ago
- [AAAI 2026 Oral] LENS: Learning to Segment Anything with Unified Reinforced Reasoning☆134Dec 3, 2025Updated 6 months ago
- [ICLR-2026] Rex-Thinker: Grounded Object Refering via Chain-of-Thought Reasoning☆150Jun 30, 2025Updated last year
- ☆10Oct 26, 2023Updated 2 years ago
- yolov8n 目标检测部署版本,便于移植不同平台(onnx、tensorRT、rknn、Horizon),全网部署最简单、速度最快的部署方式。☆50Mar 11, 2024Updated 2 years ago