percent4 / yi_vl_experiment
本项目是关于Yi的多模态系列模型,如Yi-VL-6B/34B等的实验与应用。
☆13Updated last year
Alternatives and similar repositories for yi_vl_experiment
Users that are interested in yi_vl_experiment are comparing it to the libraries listed below
Sorting:
- 国内外数据竞赛资讯整理☆18Updated 3 years ago
- 集成了LLM与SDXL 的AIGC应用程序☆27Updated last year
- Our 2nd-gen LMM☆33Updated 11 months ago
- Chinese CLIP models with SOTA performance.☆55Updated last year
- ☆25Updated 4 months ago
- Whisper in TensorRT-LLM☆15Updated last year
- A multimodal large-scale model, which performs close to the closed-source Qwen-VL-PLUS on many datasets and significantly surpasses the p…☆14Updated last year
- 大模型API性能指标比较 - 深入分析TTFT、TPS等关键指标☆17Updated 8 months ago
- Workshop on Foundation Model 1st foundation model challenge Track1 codebase (Open TransMind v1.0)☆18Updated 2 years ago
- Large Multimodal Model☆15Updated last year
- NVIDIA TensorRT Hackathon 2023复赛选题:通义千问Qwen-7B用TensorRT-LLM模型搭建及优化☆42Updated last year
- Code for "An Empirical Study of Retrieval Augmented Generation with Chain-of-Thought"☆13Updated 9 months ago
- Music large model based on InternLM2-chat.☆22Updated 4 months ago
- ☆28Updated last year
- Real-time video understanding and interaction through text,audio,image and video with large multi-modal model. 利用多模态大模型的实时视频理解和交互框架,通过文本…☆23Updated last year
- 基于baichuan-7b的开源多模态大语言模型☆73Updated last year
- Taiyi-Diffusion-XL训练代码☆22Updated 11 months ago
- 本项目使用LLaVA 1.6多模态模型实现以文搜图和以图搜图功能。☆21Updated last year
- 中文原生文生图测评基准☆9Updated 10 months ago
- FSANet: 1 Mb!! Head Pose Estimation with MNN、TNN and ONNXRuntime C++.☆17Updated 3 years ago
- Stable Diffusion in TensorRT 8.5+☆14Updated 2 years ago
- Code and model for the AI City Challenge (CVPR 2022) Track 3 Action Detection (Naturalistic Driving Action Recognition)☆28Updated last year
- Multimodal Open-O1 (MO1) is designed to enhance the accuracy of inference models by utilizing a novel prompt-based approach. This tool wo…☆29Updated 7 months ago
- ☆34Updated 2 years ago
- Submodule for Grounded-SAM☆11Updated 2 years ago
- [CVPR Challenge Rank 2nd] The codes and related files to reproduce the results for Video Similarity Challenge Descriptor Track.☆19Updated last month
- Multimodal chatbot with computer vision capabilities integrated, our 1st-gen LMM☆100Updated 11 months ago
- ☆38Updated 6 months ago
- ☆67Updated last year
- Here is a demo for PDF parser (Including OCR, object detection tools)☆34Updated 7 months ago