ZillaRU / AnnoAnything
万物检测(零样本检测+识别) demo for SG2300X 【Recognize Anything + GroundingDINO】
☆17Updated 8 months ago
Alternatives and similar repositories for AnnoAnything:
Users that are interested in AnnoAnything are comparing it to the libraries listed below
- 研究GOT-OCR-项目落地加速,不限语言☆57Updated 3 months ago
- 适用于sophon bm1684x,基于 Langchain 与 ChatGLM 等语言模型的本地知识库问答☆11Updated 7 months ago
- 使用SG2300X实现无瑕疵换脸☆26Updated 4 months ago
- An auto-annotation demo running on SG2300X (YOLOv8/GroundingDINO + MobileSAM)☆10Updated last year
- Text2speech & tone color conversion demo running on SG2300x 结合openvoice和emotivoice的TTS+即时克隆☆21Updated 3 months ago
- 💡💡💡awesome compute vision app in gradio☆51Updated 8 months ago
- Python scripts performing Open Vocabulary Object Detection using the YOLO-World model in ONNX.☆46Updated 9 months ago
- 模型 llava-Qwen2-7B-Instruct-Chinese-CLIP 增强中文文字识别能力和表情包内涵识别能力,接近gpt4o、claude-3.5-sonnet的识别水平!☆19Updated 6 months ago
- Stable Diffusion+LCM在SG2300X上,纵享丝滑一秒出图☆17Updated 2 months ago
- run ChatGLM2-6B in BM1684X☆49Updated 10 months ago
- 使用OpenCV+onnxruntime部署中文clip做以文搜图,给出一句话来描述想要的图片,就能从图库中搜出来符合要求的图片。包含C++和Python两个版本的程序☆62Updated last year
- qwen2 and llama3 cpp implementation☆39Updated 7 months ago
- 本项目是关于Yi的多模态系列模型,如Yi-VL-6B/34B等的实验与应用。☆13Updated last year
- 使用ONNXRuntime部署Detic检测2万1千种类别的物体,包含C++和Python两个版本的程序☆17Updated last year
- Multimodal chatbot with computer vision capabilities integrated☆100Updated 8 months ago
- ☆37Updated 6 months ago
- Explore LLM model deployment based on AXera's AI chips☆71Updated last month
- Valley is a cutting-edge multimodal large model designed to handle a variety of tasks involving text, images, and video data.☆197Updated last week
- Vision Search Assistant: Empower Vision-Language Models as Multimodal Search Engines☆109Updated 2 months ago
- 训练一个对中文支持更好的LLaVA模型,并开源训练代码和数据。☆44Updated 4 months ago
- ☆25Updated 3 months ago
- A Simple MLLM Surpassed QwenVL-Max with OpenSource Data Only in 14B LLM.☆36Updated 4 months ago
- 大模型部署实战:TensorRT-LLM, Triton Inference Server, vLLM☆26Updated 11 months ago
- ☆56Updated last year
- Accelerate segment anything model inference using Tensorrt 8.6.1.6☆85Updated last year
- GLM Series Edge Models☆127Updated 3 weeks ago
- Get up and running with Llama 3, Mistral, Gemma, and other large language models.☆26Updated last week
- An open-source chat text to control actions agentic workflow framework/showcase powered by Agently AI application development framework.☆26Updated 4 months ago
- segment anything(SAM) for CPP Inference☆31Updated 7 months ago
- The Level-Navi Agent, a framework that requires no training and utilizes large language models for deep query understanding and precise s…☆11Updated last month