breezedeus / Coin-CLIPLinks
Coin-CLIP: fine-tuned with a vast collection of coin images from CLIP using contrastive learning. It enhances feature extraction for coins, boosting image search accuracy. This model merges Visual Transformer (ViT) with CLIP's multimodal learning, optimized for numismatic applications.
☆22Updated last year
Alternatives and similar repositories for Coin-CLIP
Users that are interested in Coin-CLIP are comparing it to the libraries listed below
Sorting:
- Chinese CLIP models with SOTA performance.☆58Updated 2 years ago
- ☆57Updated last year
- Multimodal chatbot with computer vision capabilities integrated, our 1st-gen LMM☆101Updated last year
- A Simple MLLM Surpassed QwenVL-Max with OpenSource Data Only in 14B LLM.☆38Updated last year
- Our 2nd-gen LMM☆34Updated last year
- Python scripts performing Open Vocabulary Object Detection using the YOLO-World model in ONNX.☆60Updated last year
- 研究GOT-OCR-项目落地加速,不限语言☆62Updated 11 months ago
- 模型 llava-Qwen2-7B-Instruct-Chinese-CLIP 增强中文文字识别能力和表情包内涵识别能力,接近gpt4o、claude-3.5-sonnet的识别水平!☆24Updated last year
- A light proxy solution for HuggingFace hub.☆46Updated last year
- ☆79Updated last year
- 安卓手机部署DeepSeek-R1 蒸馏的1.5B模型☆22Updated 8 months ago
- 补充了一些Visualglm缺少的文件,可以对Visualglm进行训练,实例中是对人脸做了面相的识别☆13Updated 2 years ago
- 读光中英文OCR onnx 版本模型使用 | Code for using the ONNX version of DuGuang OCR in both Chinese and English☆49Updated 4 months ago
- ☆30Updated last year
- ☆27Updated 11 months ago
- 💡💡💡awesome compute vision app in gradio☆55Updated last year
- ☆10Updated 3 years ago
- Official implementation of the paper: “CountingDINO: A Training-free Pipeline for Exemplar-based Class-Agnostic Counting”☆40Updated 4 months ago
- ☆183Updated last year
- 结合 mmdetection 、 label studio 实现数据集自动标注、模型自动迭代的 AI 闭环☆19Updated 2 years ago
- 本项目是关于Yi的多模态系列模型,如Yi-VL-6B/34B等的实验与应用。☆14Updated last year
- SPRINT: Script-agnostic Structure Recognition in Tables☆13Updated 6 months ago
- convert paddleOCR to torchOCR, ppocr-v3,ppocr-v4, onnx, openvino☆32Updated 2 years ago
- Submodule for Grounded-SAM☆12Updated 2 years ago
- ICDAR 2024 Table OCR Model☆38Updated 2 months ago
- 集成了LLM与SDXL的AIGC应用程序☆29Updated last year
- 使用OpenCV+onnxruntime部署中文clip做以文搜图,给出一句话来描述想要的图片,就能从图库中搜出来符合要求的图片。包含C++和Python两个版本的程序☆79Updated last year
- 可以成功Lora微调的Qwen-VL模型☆16Updated last year
- 通用版面分析 | 中文文档解析 |Document Layout Analysis | layout paser☆47Updated last year
- ☆15Updated 2 years ago