ZillaRU / AnnoAnythingLinks
万物检测(零样本检测+识别) demo for SG2300X 【Recognize Anything + GroundingDINO】
☆21Updated last year
Alternatives and similar repositories for AnnoAnything
Users that are interested in AnnoAnything are comparing it to the libraries listed below
Sorting:
- 研究GOT-OCR-项目落地加速,不限语言☆61Updated 9 months ago
- UMatcher: A modern template matching model☆59Updated 2 months ago
- 💡💡💡awesome compute vision app in gradio☆54Updated last year
- Accelerate segment anything model inference using Tensorrt 8.6.1.6☆97Updated last year
- Text2speech & tone color conversion demo running on SG2300x 结合openvoice和emotivoice的TTS+即时克隆☆22Updated 9 months ago
- A CPU Realtime VLM in 500M. Surpassed Moondream2 and SmolVLM. Training from scratch with ease.☆223Updated 3 months ago
- Official implementation of the paper: “CountingDINO: A Training-free Pipeline for Exemplar-based Class-Agnostic Counting”☆37Updated 2 months ago
- 将SmolVLM2的视觉头与Qwen3-0.6B模型进行了拼接微调☆226Updated 2 weeks ago
- The Go-To Choice for CV Data Visualization, Annotation, and Model Analysis.☆258Updated last year
- Multimodal chatbot with computer vision capabilities integrated, our 1st-gen LMM☆101Updated last year
- ☆122Updated 2 years ago
- YOLO-UniOW: Efficient Universal Open-World Object Detection☆149Updated 6 months ago
- Official implementation of OV-DINO: Unified Open-Vocabulary Detection with Language-Aware Selective Fusion☆359Updated 5 months ago
- ☆173Updated 6 months ago
- 模型 llava-Qwen2-7B-Instruct-Chinese-CLIP 增强中文文字识别能力和表情包内涵识别能力,接近gpt4o、claude-3.5-sonnet的识别水平!☆24Updated last year
- run chatglm3-6b in BM1684X☆40Updated last year
- 安卓手机部署DeepSeek-R1 蒸馏的1.5B模型☆22Updated 6 months ago
- 使用SG2300X实现无瑕疵换脸☆30Updated 11 months ago
- ☆57Updated last year
- Official Repository for VELM, featured in CVPRW 2025 paper: "Detect, Classify, Act: Categorizing Industrial Anomalies with Multi-Modal LL…☆42Updated last month
- Mamba-YOLO-World: Marrying YOLO-World with Mamba for Open-Vocabulary Detection☆91Updated 5 months ago
- The source code of IEEE TPAMI 2025 "Hyper-YOLO: When Visual Object Detection Meets Hypergraph Computation".☆109Updated 7 months ago
- 视频分类标注、视频时空标注☆41Updated last year
- 用于学习GOT/Qwen/OnnxLLm☆53Updated 10 months ago
- Official code for "No time to train! Training-Free Reference-Based Instance Segmentation"☆184Updated 2 weeks ago
- 适用于sophon bm1684x,基于 Langchain 与 ChatGLM 等语言模型的本地知识库问答☆13Updated last year
- Valley is a cutting-edge multimodal large model designed to handle a variety of tasks involving text, images, and video data.☆245Updated 5 months ago
- run ChatGLM2-6B in BM1684X☆49Updated last year
- YOLOv9 paper解析,训练自己的数据集,TensorRT端到端部署, NCNN安卓手机部署☆65Updated last year
- 使用OpenCV+onnxruntime部署中文clip做以文搜图,给出一句话来描述想要的图片,就能从图库中搜出来符合要求的图片。包含C++和Python两个版本的程序☆77Updated last year