percent4 / yi_vl_experimentLinks
本项目是关于Yi的多模态系列模型,如Yi-VL-6B/34B等的实验与应用。
☆14Updated last year
Alternatives and similar repositories for yi_vl_experiment
Users that are interested in yi_vl_experiment are comparing it to the libraries listed below
Sorting:
- Multimodal chatbot with computer vision capabilities integrated, our 1st-gen LMM☆101Updated last year
- Chinese CLIP models with SOTA performance.☆59Updated 2 years ago
- 集成了LLM与SDXL的AIGC应用程序☆29Updated last year
- 国内外数据竞赛资讯整理☆18Updated 4 years ago
- ☆57Updated last year
- 补充了一些Visualglm缺少的文件,可以对Visualglm进行训练,实例中是对人脸做了面相的识别☆13Updated 2 years ago
- Music large model based on InternLM2-chat.☆22Updated last year
- 基于baichuan-7b的开源多模态大语言模型☆72Updated 2 years ago
- 可以成功Lora微调的Qwen-VL模型☆16Updated 2 years ago
- 使用DBNet检测条形码,包含C++和Python两种版本的程序☆37Updated 4 years ago
- 💡💡💡awesome compute vision app in gradio☆55Updated last year
- Demo for Qwen2.5-VL-3B-Instruct on Axera device.☆17Updated 3 months ago
- Large Multimodal Model☆15Updated last year
- Real-time video understanding and interaction through text,audio,image and video with large multi-modal model. 利用多模态大模型的实时视频理解和交互框架,通过文本…☆26Updated last year
- NVIDIA TensorRT Hackathon 2023复赛选题:通义千问Qwen-7B用TensorRT-LLM模型搭建及优化☆43Updated 2 years ago
- Code and model for the AI City Challenge (CVPR 2022) Track 3 Action Detection (Naturalistic Driving Action Recognition)☆28Updated 2 years ago
- A multimodal large-scale model, which performs close to the closed-source Qwen-VL-PLUS on many datasets and significantly surpasses the p…☆14Updated last year
- Our 2nd-gen LMM☆34Updated last year
- Next Gen Face detection based on YOLOv7☆62Updated 3 years ago
- 陆续开源医疗行业的深度学习模型及数据集☆13Updated 4 years ago
- Workshop on Foundation Model 1st foundation model challenge Track1 codebase (Open TransMind v1.0)☆18Updated 2 years ago
- Code of AAAI2025 Paper 《VIoTGPT: Learning to Schedule Vision Tools in LLMs towards Intelligent Video Internet of Things》☆15Updated 11 months ago
- 本项目使用LLaVA 1.6多模态模型实现以文搜图和以图搜图功能。☆28Updated last year
- 结合 mmdetection 、 label studio 实现数据集自动标注、模型自动迭代的 AI 闭环☆20Updated 3 years ago
- ChineseOcr Lite Mnn,超轻量级中文OCR PC Demo,使用MNN推理☆28Updated 4 years ago
- ☆14Updated 6 years ago
- 万物检测(零样本检测+识别) demo for SG2300X 【Recognize Anything + GroundingDINO】☆24Updated last year
- An efficient multi-modal instruction-following data synthesis tool and the official implementation of Oasis https://arxiv.org/abs/2503.08…☆35Updated 6 months ago
- Python scripts performing Open Vocabulary Object Detection using the YOLO-World model in ONNX.☆62Updated last year
- Xtuner Factory☆35Updated last year