本项目结合了YOLO的目标检测与分割能力,以及CLIP微调分类能力,YOLO初步框选物体,CLIP对框选物体进行细致化分类,能够在复杂场景下实现物体的精准定位与属性提取。通过对目标物体的检测、分割和细粒度分类,项目特别适用于商品分类、智能货柜管理等任务
☆23Nov 26, 2024Updated last year
Alternatives and similar repositories for YOLO_CLIP_targetDetection
Users that are interested in YOLO_CLIP_targetDetection are comparing it to the libraries listed below
Sorting:
- 整理与红外可见光图像融合的相关期刊会议☆11Mar 25, 2024Updated last year
- ReSemAct: Advancing Fine-Grained Robotic Manipulation via Semantic Structuring and Affordance Refinement☆17Jan 5, 2026Updated 2 months ago
- ☆15Jun 14, 2025Updated 8 months ago
- [ACM MM 2025] LIDAR: Lightweight Adaptive Cue-Aware Fusion Vision Mamba for Multimodal Segmentation of Structural Cracks☆22Nov 18, 2025Updated 3 months ago
- Code for RA-L paper "One-shot Learning for Task-oriented Grasping"☆12May 9, 2024Updated last year
- Classify Traffic Signs.☆10Jan 31, 2017Updated 9 years ago
- Code for REACT: Real-time Efficient Attribute Clustering and Transfer for Updatable 3D Scene Graph☆16Feb 12, 2026Updated 3 weeks ago
- Implementation of Manuscript "Training large-scale optoelectronic neural networks with dual-neuron optical-artificial learning"☆11Oct 19, 2023Updated 2 years ago
- ☆15Oct 13, 2024Updated last year
- Loop Clousure Detector☆14Feb 2, 2018Updated 8 years ago
- ☆14Aug 31, 2025Updated 6 months ago
- 桥梁病害检测分割系 统后端项目☆14Oct 13, 2025Updated 4 months ago
- 3D_lut generate for surround view☆13Jul 31, 2019Updated 6 years ago
- The official repository for the paper "Statler: State-Maintaining Language Models for Embodied Reasoning"☆13Jun 10, 2024Updated last year
- This sample shows how to deploy an industrial computer vision model to detect real world analog pointer meters and extract corresponding …☆12Sep 23, 2022Updated 3 years ago
- Temporal memory system for AI assistants with human-like forgetting curves. All data stored locally in human-readable formats: JSONL for …☆29Updated this week
- A novel lightweight monocular depth estimation method☆32Nov 17, 2025Updated 3 months ago
- Code for paper: "Few-Shot In-Context Imitation Learning via Implicit Graph Alignment"☆22Apr 5, 2024Updated last year
- MiDaS(Multiple Depth Estimation Accuracy with Single Network)单目深度估计,部署 rk3588。☆17Jul 10, 2024Updated last year
- [Information Fusion 2025] Official implementation for "MMIF-INet: Multimodal medical image fusion by invertible network"☆19Feb 4, 2026Updated last month
- Simulate 6 DOF robotic arms using PyQt5 and OpenGL☆14Apr 16, 2023Updated 2 years ago
- Unbiased Directed Object Attention Graph for Object Navigation☆15Nov 28, 2022Updated 3 years ago
- 一个基于 bilibili 用户行为数据和深度学习的个性化视频推荐系统。☆29Mar 12, 2025Updated 11 months ago
- lidar-imu-cam-GPS时间戳硬件同步方案☆15Jul 21, 2021Updated 4 years ago
- ☆22Jan 19, 2023Updated 3 years ago
- NeoNav: Improving the Generalization of Visual Navigation via Generating Next Expected Observations☆17Jul 26, 2020Updated 5 years ago
- rgbdslam repos from OpenSLAM.org☆14May 15, 2018Updated 7 years ago
- code for A Multi-scale Information Integration Framework for Infrared and Visible Image Fusion☆17Jul 15, 2024Updated last year
- DELTA: Decomposed Efficient Long-Term Robot Task Planning using Large Language Models☆21Apr 16, 2025Updated 10 months ago
- Rudimentary custom dataset collector for OnePose☆17Apr 26, 2023Updated 2 years ago
- reading obj and multithread loading texture☆16Apr 6, 2017Updated 8 years ago
- An implementation of NLMap with additional utilities for integration with Boston Dynamics Spot☆17Apr 26, 2023Updated 2 years ago
- ☆20Jun 3, 2020Updated 5 years ago
- EasyMocap中文文档☆17Jun 2, 2022Updated 3 years ago
- A lightweight and real-time DETR for aerial images detection☆43Mar 22, 2025Updated 11 months ago
- 这是一个不基于任何框架实现的从0到1的VLM finetune(包括Pre-train和SFT)☆37Aug 22, 2025Updated 6 months ago
- An Incremental SfM program. 增量式SfM程序☆18Apr 2, 2020Updated 5 years ago
- Code for the paper: "U2Net: A General Framework with Spatial-Spectral-Integrated Double U-Net for Image Fusion", ACM MM 2023☆23Nov 14, 2023Updated 2 years ago
- streamlit一些样例以及相关的博文收集☆25Feb 18, 2021Updated 5 years ago