Pillars-Creation / Visualglm-image-to-textLinks
补充了一些Visualglm缺少的文件,可以对Visualglm进行训练,实例中是对人脸做了面相的识别
☆13Updated 2 years ago
Alternatives and similar repositories for Visualglm-image-to-text
Users that are interested in Visualglm-image-to-text are comparing it to the libraries listed below
Sorting:
- Workshop on Foundation Model 1st foundation model challenge Track1 codebase (Open TransMind v1.0)☆18Updated 2 years ago
- 可以成功Lora微调的Qwen-VL模型☆17Updated last year
- 陆续开源医疗行业的深度学习模型及数据集☆13Updated 3 years ago
- 国内外数据竞赛资讯整理☆18Updated 3 years ago
- 使用DBNet检测条形码,包含C++和Python两种版本的程序☆37Updated 4 years ago
- ☆28Updated 11 months ago
- 安卓手机部署DeepSeek-R1 蒸馏的1.5B模型☆22Updated 5 months ago
- ☆17Updated 3 years ago
- 天池 NVIDIA TensorRT Hackathon 2023 —— 生成式AI模型优化赛 初赛第三名方案☆49Updated last year
- convert paddleOCR to torchOCR, ppocr-v3,ppocr-v4, onnx, openvino☆32Updated last year
- Bert TensorRT模型加速部署☆9Updated 3 years ago
- ☆19Updated 4 years ago
- The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoi…☆23Updated 2 years ago
- Chinese license plate recognition☆29Updated 3 years ago
- 基于MindSpore AI框架实现零售商品识别 top1方案☆41Updated 3 years ago
- deploy onnx models with TensorRT and LibTorch☆17Updated 3 years ago
- 集成了LLM与SDXL的AIGC应用程序☆29Updated last year
- 模型 llava-Qwen2-7B-Instruct-Chinese-CLIP 增强中文文字识别能力和表情包内涵识别能力,接近gpt4o、claude-3.5-sonnet的识别水平!☆23Updated 11 months ago
- Chinese CLIP models with SOTA performance.☆55Updated last year
- PP-PicoDet-Android-Demo☆30Updated 3 years ago
- 基于rknn的yolov5的cpp实现,包含各种依赖库,是一个完整工程,可直接编译运行☆20Updated 3 years ago
- Building a VLM model starts from the basic module.☆16Updated last year
- ☆23Updated 2 years ago
- Tensorflow implementation for Dash☆32Updated 2 years ago
- ☆28Updated 3 years ago
- 本项目是关于Yi的多模态系列模型,如Yi-VL-6B/34B等的实验与应用。☆13Updated last year
- 使用django+pyecharts+PP-Human开发的动态数据大屏, 有人流数据的采集入库, 打架、摔倒等事件警报,口罩检测等实用功能。边缘端版本使用onnx推理提升效率,服务端版本支持视频流推拉☆32Updated 2 years ago
- ☆64Updated last year
- Multimodal chatbot with computer vision capabilities integrated, our 1st-gen LMM☆101Updated last year
- Xtuner Factory☆33Updated last year