ControlNet / NAVERView external linksLinks
[ICCV] NAVER: A Neuro-Symbolic Compositional Automaton for Visual Grounding with Explicit Logic Reasoning
☆28Jan 15, 2026Updated 3 weeks ago
Alternatives and similar repositories for NAVER
Users that are interested in NAVER are comparing it to the libraries listed below
Sorting:
- Segment This Thing is an efficient image segmentation models that uses a biologically-inspired foveated tokenization to reduce inference …☆56Jun 16, 2025Updated 7 months ago
- A vision-language model with bidirectional progressive fusion and global-local alignment for enhanced medical image segmentation.☆17Dec 25, 2025Updated last month
- A Real time LiDAR-Visual-Inertial object level semantic SLAM for Forest Environments☆13Dec 2, 2024Updated last year
- The implementation codes of paper: Multimodal Sentiment Analysis with Mutual Information-based Disentangled Representation Learning☆18May 8, 2025Updated 9 months ago
- [ACM MM2024] The code for HMLLM.☆11Oct 27, 2024Updated last year
- 🤖 Dataset for TextSLAM: Visual SLAM with Semantic Planar Text Features. (ICRA2020 & TPAMI2023)☆42Jan 4, 2024Updated 2 years ago
- Hypergraph Vision Transformers: Images are More than Nodes, More than Edges☆17Jul 25, 2025Updated 6 months ago
- Dur360BEV: (ICRA 2025) A Real-world 360-degree Single Camera Dataset and Benchmark for Bird-Eye View Mapping in Autonomous Driving☆23Feb 2, 2026Updated last week
- RA-LLO: Robust Adaptive Legged-LiDAR Odometry with Gaussian Process Motion Prior☆16Jul 5, 2025Updated 7 months ago
- Ubuntu 配置脚本 全功能美化一键安装 Linux Auto Configuration Script for ubuntu 14.04 to 22.04☆17Jul 8, 2025Updated 7 months ago
- ☆18Nov 10, 2025Updated 3 months ago
- Code for I-RAVEN-X generation and experiments☆19Sep 18, 2025Updated 4 months ago
- The official pytorch implemention of our IJCV-2025 paper "Learning with Enriched Inductive Biases for Vision-Language Models".☆14Mar 26, 2025Updated 10 months ago
- 海杂波☆13May 27, 2025Updated 8 months ago
- Official codebase for FACMIC: Federated Adaptative CLIP Model for Medical Image Classification (Accepted at MICCAI 2024)☆14Jun 21, 2024Updated last year
- ☆13Jul 8, 2024Updated last year
- Official implementation of the paper "M3CoTBench: Benchmark Chain-of-Thought of MLLMs in Medical Image Understanding"☆20Jan 14, 2026Updated last month
- Official Implementation of the topograph method for topology-preserving image segmentation.☆21Oct 2, 2024Updated last year
- PRODeep: A Platform for Robustness Verification of Deep Neural Networks☆12Nov 11, 2020Updated 5 years ago
- ☆10Mar 24, 2025Updated 10 months ago
- ☆12Oct 30, 2024Updated last year
- [MICCAI 2025] FEAT:Full-Dimensional Efficient Attention Transformer for Medical Video Generation.☆21Sep 24, 2025Updated 4 months ago
- Module for Pickling objects in C++.☆14May 2, 2021Updated 4 years ago
- RLCNet: A Novel Deep Feature-Based Method for Online Target-Free Radar-LiDAR Calibration☆12Sep 16, 2024Updated last year
- ☆13Jul 6, 2024Updated last year
- [NeurIPS'24] SpatialEval: a benchmark to evaluate spatial reasoning abilities of MLLMs and LLMs☆59Jan 23, 2025Updated last year
- Source code of NA-LOAM: Normal-based Adaptive LiDAR Odometry and Mapping☆16Aug 17, 2024Updated last year
- TaGAT For Multi-modal Retinal Image Fusion☆10Jul 31, 2024Updated last year
- A multimodal model bridging vision and genomics for biodiversity monitoring at scale.☆16Sep 18, 2025Updated 4 months ago
- Source code for the Paper "Mind the Gap: Benchmarking Spatial Reasoning in Vision-Language Models"☆18Feb 1, 2026Updated last week
- Fine-Grained Pixel-Text Alignment for Open-Vocabulary Semantic Segmentation☆15Sep 24, 2025Updated 4 months ago
- Semantic2D: Enabling Semantic Scene Understanding with 2D Lidar Alone☆17Feb 2, 2026Updated last week
- Dolphin is a Python package that enables scalable neurosymbolic learning by performing probabilistic computations over the GPU.☆16Feb 6, 2026Updated last week
- Code for our paper: TransRAD: Retentive Vision Transformer for Enhanced Radar Object Detection☆20Sep 24, 2025Updated 4 months ago
- [MICCAI'24] Incorporating Clinical Guidelines through Adapting Multi-modal Large Language Model for Prostate Cancer PI-RADS Scoring☆12Aug 2, 2024Updated last year
- [RA-L] SHeRLoc: Synchronized Heterogeneous Radar Place Recognition for Cross-Modal Localization☆29Nov 24, 2025Updated 2 months ago
- 🌳 [ICRA'25] Hier-SLAM: Semantic Gaussian Splatting SLAM with Hierarchical Categorical Representation☆159Jun 21, 2025Updated 7 months ago
- The official implementation for paper: Vision-Language Models are Strong Noisy Label Detectors☆15Mar 31, 2025Updated 10 months ago
- ☆17Jun 23, 2021Updated 4 years ago