percent4 / yi_vl_experiment
本项目是关于Yi的多模态系列模型,如Yi-VL-6B/34B等的实验与应用。
☆12Updated 9 months ago
Related projects ⓘ
Alternatives and complementary repositories for yi_vl_experiment
- 国内外数据竞赛资讯整理☆18Updated 3 years ago
- A multimodal large-scale model, which performs close to the closed-source Qwen-VL-PLUS on many datasets and significantly surpasses the p…☆14Updated 9 months ago
- Stable Diffusion in TensorRT 8.5+☆14Updated last year
- Chinese CLIP models with SOTA performance.☆48Updated last year
- Building a VLM model starts from the basic module.☆10Updated 7 months ago
- Real-time video understanding and interaction through text,audio,image and video with large multi-modal model. 利用多模态大模型的实时视频理解和交互框架,通过文本…☆22Updated 9 months ago
- Music large model based on InternLM2-chat.☆21Updated 3 months ago
- Bert TensorRT模型加速部署☆9Updated 2 years ago
- Exploration of the multi modal fuyu-8b model of Adept. 🤓 🔍☆28Updated last year
- ☆27Updated 5 months ago
- Whisper in TensorRT-LLM☆14Updated last year
- Workshop on Foundation Model 1st foundation model challenge Track1 codebase (Open TransMind v1.0)☆18Updated last year
- 可以成功Lora微调的Qwen-VL模型☆16Updated last year
- The official repo for “TextCoT: Zoom In for Enhanced Multimodal Text-Rich Image Understanding”.☆32Updated last month
- 补充了一些Visualglm缺少的文件,可以对Visualglm进行训练,实例中是对人脸做了面相的识别☆12Updated last year
- ☆35Updated 4 months ago
- Tensorflow implementation for Dash☆26Updated 2 years ago
- An interactive demo based on Segment-Anything for stroke-based painting which enables human-like painting.☆34Updated last year
- 微信公众号:机器感知 | Tracking the Latest Arxiv Papers☆37Updated 9 months ago
- 用Python封装飞书文档API,直接读写文档☆9Updated 11 months ago
- 使用onnxruntime 部署实时视频帧插值,包含C++和Python两个版本的程序☆22Updated 8 months ago
- Simple Implementation of TinyGPTV in super simple Zeta lego blocks☆15Updated this week
- ☆11Updated 2 months ago
- WanJuan-CC是以CommonCrawl为基础,经过数据抽取,规则清洗,去重,安全过滤,质量清洗等步骤得到的高质量数据。☆12Updated 6 months ago
- Facebook Image Similarity Challenge 2021☆19Updated 2 years ago
- Large Multimodal Model☆15Updated 7 months ago
- 模型 llava-Qwen2-7B-Instruct-Chinese-CLIP 增强中文文字识别能力和表情包内涵识别能力,接近gpt4o、claude-3.5-sonnet的识别水平!☆13Updated 3 months ago
- Taiyi-Diffusion-XL训练代码☆21Updated 5 months ago