ZillaRU / AnnoAnythingLinks
万物检测(零样本检测+识别) demo for SG2300X 【Recognize Anything + GroundingDINO】
☆22Updated last year
Alternatives and similar repositories for AnnoAnything
Users that are interested in AnnoAnything are comparing it to the libraries listed below
Sorting:
- 研究GOT-OCR-项目落地加速,不限语言☆61Updated 10 months ago
- A CPU Realtime VLM in 500M. Surpassed Moondream2 and SmolVLM. Training from scratch with ease.☆233Updated 4 months ago
- Text2speech & tone color conversion demo running on SG2300x 结合openvoice和emotivoice的TTS+即时克隆☆22Updated 10 months ago
- Multimodal chatbot with computer vision capabilities integrated, our 1st-gen LMM☆100Updated last year
- 💡💡💡awesome compute vision app in gradio☆54Updated last year
- 将SmolVLM2的视觉头与Qwen3-0.6B模型进行了拼接微调☆311Updated last month
- Valley is a cutting-edge multimodal large model designed to handle a variety of tasks involving text, images, and video data.☆248Updated 3 weeks ago
- ☆57Updated last year
- Accelerate segment anything model inference using Tensorrt 8.6.1.6☆98Updated last year
- A high-throughput and memory-efficient inference and serving engine for LLMs☆43Updated last month
- UMatcher: A modern template matching model☆64Updated 3 months ago
- Handwritten Text Recognition and Character Detection☆156Updated 3 months ago
- A Simple MLLM Surpassed QwenVL-Max with OpenSource Data Only in 14B LLM.☆38Updated 11 months ago
- 视频理解:千问视频多模态模型 & Dify☆64Updated last year
- In this fast-paced world, we all need a little something to spice up life. Whether you need a glass of sweet talk to lift your spirits or…☆59Updated 3 months ago
- The Go-To Choice for CV Data Visualization, Annotation, and Model Analysis.☆259Updated last year
- 本项目使用LLaVA 1.6多模态模型实现以文搜图和以图搜图功能。☆25Updated last year
- 训练一个对中文支持更好的LLaVA模型,并开源训练代码和数据。☆69Updated 11 months ago
- 模型 llava-Qwen2-7B-Instruct-Chinese-CLIP 增强中文文字识别能力和表情包内涵识别能力,接近gpt4o、claude-3.5-sonnet的识别水平!☆24Updated last year
- 适用于sophon bm1684x,基于 Langchain 与 ChatGLM 等语言模型的本地知识库问答☆14Updated last year
- 使用SG2300X实现无瑕疵换脸☆30Updated last year
- 读光中英文OCR onnx 版本模型使用 | Code for using the ONNX version of DuGuang OCR in both Chinese and English☆41Updated 3 months ago
- run chatglm3-6b in BM1684X☆40Updated last year
- Florence-2☆69Updated 6 months ago
- ☆174Updated 6 months ago
- Finetune Llama 3, Mistral & Gemma LLMs 2-5x faster with 80% less memory☆29Updated last year
- Demo for Qwen2.5-VL-3B-Instruct on Axera device.☆13Updated this week
- Official implementation of the paper: “CountingDINO: A Training-free Pipeline for Exemplar-based Class-Agnostic Counting”☆39Updated 2 months ago
- The official repository of the dots.vlm1 instruct models proposed by rednote-hilab.☆236Updated this week
- 使用onnxruntime部署GroundingDINO开放世界目标检测,包含C++和Python两个版本的程序☆73Updated last year