codefuse-ai / CodeFuse-MFT-VLM
☆36Updated 5 months ago
Alternatives and similar repositories for CodeFuse-MFT-VLM:
Users that are interested in CodeFuse-MFT-VLM are comparing it to the libraries listed below
- MegaPairs: Massive Data Synthesis For Universal Multimodal Retrieval☆131Updated 2 weeks ago
- Skywork-MoE: A Deep Dive into Training Techniques for Mixture-of-Experts Language Models☆130Updated 9 months ago
- 基于baichuan-7b的开源多模态大语言模型☆73Updated last year
- Empirical Study Towards Building An Effective Multi-Modal Large Language Model☆23Updated last year
- ☆29Updated 7 months ago
- GLM Series Edge Models☆131Updated last month
- ☆56Updated last year
- SUS-Chat: Instruction tuning done right☆48Updated last year
- XVERSE-MoE-A4.2B: A multilingual large language model developed by XVERSE Technology Inc.☆36Updated 10 months ago
- Valley is a cutting-edge multimodal large model designed to handle a variety of tasks involving text, images, and video data.☆226Updated last month
- ☆78Updated 10 months ago
- Our 2nd-gen LMM☆33Updated 10 months ago
- A Simple MLLM Surpassed QwenVL-Max with OpenSource Data Only in 14B LLM.☆37Updated 6 months ago
- Its an open source LLM based on MOE Structure.☆58Updated 9 months ago
- Multimodal chatbot with computer vision capabilities integrated, our 1st-gen LMM☆100Updated 10 months ago
- A Simple Framework of Small-scale Large Multimodal Models for Video Understanding Based on TinyLLaVA_Factory.☆46Updated 2 weeks ago
- zero零训练llm调参☆31Updated last year
- ☆225Updated 10 months ago
- Exploring Efficient Fine-Grained Perception of Multimodal Large Language Models☆60Updated 5 months ago
- ☆67Updated last year
- LongQLoRA: Extent Context Length of LLMs Efficiently☆164Updated last year
- Just for debug☆56Updated last year
- 我们是第一个完全可商用的角色大模型。☆39Updated 7 months ago
- MM-Interleaved: Interleaved Image-Text Generative Modeling via Multi-modal Feature Synchronizer☆220Updated last year
- ☆27Updated 10 months ago
- 训练一个对中文支持更好的LLaVA模型,并开源训练代码和数据。☆54Updated 6 months ago
- Delta-CoMe can achieve near loss-less 1-bit compressin which has been accepted by NeurIPS 2024☆55Updated 4 months ago
- ☆166Updated 8 months ago
- Chinese CLIP models with SOTA performance.☆54Updated last year
- Vision Search Assistant: Empower Vision-Language Models as Multimodal Search Engines☆118Updated 4 months ago