breezedeus / Coin-CLIPLinks
Coin-CLIP: fine-tuned with a vast collection of coin images from CLIP using contrastive learning. It enhances feature extraction for coins, boosting image search accuracy. This model merges Visual Transformer (ViT) with CLIP's multimodal learning, optimized for numismatic applications.
☆23Updated 2 years ago
Alternatives and similar repositories for Coin-CLIP
Users that are interested in Coin-CLIP are comparing it to the libraries listed below
Sorting:
- Chinese CLIP models with SOTA performance.☆60Updated 2 years ago
- Multimodal chatbot with computer vision capabilities integrated, our 1st-gen LMM☆101Updated last year
- A light proxy solution for HuggingFace hub.☆49Updated 2 years ago
- ☆57Updated 2 years ago
- A Simple MLLM Surpassed QwenVL-Max with OpenSource Data Only in 14B LLM.☆38Updated last year
- 用于学习GOT/Qwen/OnnxLLm☆53Updated last year
- convert paddleOCR to torchOCR, ppocr-v3,ppocr-v4, onnx, openvino☆33Updated 2 years ago
- 结合 mmdetection 、 label studio 实现数据集自动标注、模型自动迭代的 AI 闭环☆20Updated 3 years ago
- Our 2nd-gen LMM☆34Updated last year
- 读光中英文OCR onnx 版本模型使用 | Code for using the ONNX version of DuGuang OCR in both Chinese and English☆56Updated 2 months ago
- 安卓手机部署DeepSeek-R1 蒸馏的1.5B模型☆23Updated last year
- 研究GOT-OCR-项目落地加速,不限语言☆62Updated last year
- segment anything(SAM) for CPP Inference☆31Updated last year
- 使用onnxruntime部署实时视频帧插值,包含C++和Python两个版本的程序☆28Updated last year
- ☆31Updated last year
- 💡💡💡awesome compute vision app in gradio☆55Updated last year
- 陆续开源医疗行业的深度学习模型及数据集☆13Updated 4 years ago
- ☆79Updated last year
- 大模型API性能指标比较 - 深入分析TTFT、TPS等关键指标☆20Updated last year
- Python scripts performing Open Vocabulary Object Detection using the YOLO-World model in ONNX.☆62Updated last year
- 集成了LLM与SDXL的AIGC应用程序☆29Updated 2 years ago
- ICDAR 2024 Table OCR Model☆39Updated last week
- An unofficial Implementation of DocParser: End-to-end OCR-free Information Extraction from Visually Rich Documents☆37Updated 2 years ago
- 模型 llava-Qwen2-7B-Instruct-Chinese-CLIP 增强中文文字识别能力和表情包内涵识别能力,接近gpt4o、claude-3.5-sonnet的识别水平!☆27Updated last year
- 本项目是关于Yi的多模态系列模型,如Yi-VL-6B/34B等的实验与应用。☆14Updated 2 years ago
- segment anything model (SAM) infer by ncnn on Android mobile phone☆29Updated 2 years ago
- 使用ONNXRuntime部署鲁棒性视频抠图,包含C++和Python两种版本的程序☆46Updated 4 years ago
- Submodule for Grounded-SAM☆12Updated 2 years ago
- A dedicated Colab notebooks to experiment (Nanonets OCR, Monkey OCR, OCRFlux 3B, Typhoo OCR 3B & more..) On T4 GPU - free tier☆23Updated last month
- 补充了一些Visualglm缺少的文件,可以对Visualglm进行训练,实例中是对人脸做了面相的识别☆13Updated 2 years ago