percent4 / yi_vl_experiment
本项目是关于Yi的多模态系列模型,如Yi-VL-6B/34B等的实验与应用。
☆12Updated 9 months ago
Related projects ⓘ
Alternatives and complementary repositories for yi_vl_experiment
- 国内外数据竞赛资讯整理☆18Updated 3 years ago
- Chinese CLIP models with SOTA performance.☆48Updated last year
- A multimodal large-scale model, which performs close to the closed-source Qwen-VL-PLUS on many datasets and significantly surpasses the p…☆14Updated 9 months ago
- ☆27Updated 6 months ago
- 中文原生多层次文生视频测评基准☆17Updated 4 months ago
- Stable Diffusion in TensorRT 8.5+☆14Updated last year
- Code for paper <Explain Me the Painting: Multi-Topic Knowledgeable Art Description Generation> in ICCV 2021.☆12Updated 3 years ago
- An interactive demo based on Segment-Anything for stroke-based painting which enables human-like painting.☆34Updated last year
- Music large model based on InternLM2-chat.☆21Updated 4 months ago
- ☆68Updated last week
- ☆35Updated 5 months ago
- ☆55Updated 9 months ago
- Multimodal Open-O1 (MO1) is designed to enhance the accuracy of inference models by utilizing a novel prompt-based approach. This tool wo…☆26Updated last month
- Bert TensorRT模型加速部署☆9Updated 2 years ago
- ☆22Updated 3 months ago
- Real-time video understanding and interaction through text,audio,image and video with large multi-modal model. 利用多模态大模型的实时视频理解和交互框架,通过文本…☆22Updated 9 months ago
- General Image Classification Code base☆21Updated 3 years ago
- 集成了LLM与SDXL的AIGC应用程序☆25Updated 10 months ago
- Exploration of the multi modal fuyu-8b model of Adept. 🤓 🔍☆28Updated last year
- Sparse Multilabel Categorical Crossentropy☆9Updated last year
- 模型 llava-Qwen2-7B-Instruct-Chinese-CLIP 增强中文文字识别能力和表情包内涵识别能力,接近gpt4o、claude-3.5-sonnet的识别水平!☆13Updated 3 months ago
- ☆30Updated 6 months ago
- [PR 2024] A large Cross-Modal Video Retrieval Dataset with Reading Comprehension☆22Updated 10 months ago
- ☆13Updated last year
- Whisper in TensorRT-LLM☆14Updated last year
- A Framework for Decoupling and Assessing the Capabilities of VLMs☆38Updated 4 months ago