BUAADreamer / MLLM-Finetuning-Demo
使用LLaMA-Factory微调多模态大语言模型的示例代码 Demo of Finetuning Multimodal LLM with LLaMA-Factory
☆21Updated 4 months ago
Alternatives and similar repositories for MLLM-Finetuning-Demo:
Users that are interested in MLLM-Finetuning-Demo are comparing it to the libraries listed below
- ☆76Updated 8 months ago
- This project aims to collect and collate various datasets for multimodal large model training, including but not limited to pre-training …☆16Updated 3 months ago
- The official GitHub page for the survey paper "A Survey on Data Augmentation in Large Model Era"☆116Updated 6 months ago
- The official repo for “TextCoT: Zoom In for Enhanced Multimodal Text-Rich Image Understanding”.☆38Updated 3 months ago
- ☆27Updated 2 weeks ago
- [preprint] We propose a novel fine-tuning method, Separate Memory and Reasoning, which combines prompt tuning with LoRA.☆39Updated 3 weeks ago
- ☆78Updated 8 months ago
- LLM+RAG for QA☆21Updated last year
- A Comprehensive Framework for Developing and Evaluating Multimodal Role-Playing Agents☆40Updated 2 months ago
- The official repository for the Scientific Paper Idea Proposer (SciPIP)☆55Updated last month
- ☆20Updated 3 months ago
- Build a simple basic multimodal large model from scratch. 从零搭建一个简单的基础多模态大模型🤖☆27Updated 7 months ago
- The huggingface implementation of Fine-grained Late-interaction Multi-modal Retriever.☆78Updated 3 weeks ago
- Code and Data for Our NeurIPS 2024 paper "AMOR: A Recipe for Building Adaptable Modular Knowledge Agents Through Process Feedback"☆29Updated 2 months ago
- Xtuner Factory☆32Updated 10 months ago
- Happy experimenting with MLLM and LLM models!☆75Updated 3 months ago
- The first Chinese medical large vision-language model designed to integrate the analysis of textual and visual data☆58Updated last year
- 智海三乐-教育大模型☆38Updated last year
- An Easy-to-use Hallucination Detection Framework for LLMs.☆55Updated 8 months ago
- Official code of *Virgo: A Preliminary Exploration on Reproducing o1-like MLLM*☆74Updated this week
- Useful resources on data quality for machine learning and artificial intelligence.☆18Updated 2 weeks ago
- This repo offers advanced tutorials for LLMs, BERT-based models, and multimodal models, covering fine-tuning, quantization, vocabulary ex…☆11Updated last month
- Multimodal Open-O1 (MO1) is designed to enhance the accuracy of inference models by utilizing a novel prompt-based approach. This tool wo…☆28Updated 3 months ago
- Vision Search Assistant: Empower Vision-Language Models as Multimodal Search Engines☆106Updated 2 months ago
- ☆36Updated 4 months ago
- ☆26Updated 8 months ago
- 使用煤矿历史事故案例,事故处理报告、安全规程规章制度、技术文档、煤矿从业人员入职考试题库等数据,微调internlm2模型实现针对煤矿事故和煤矿安全知识的智能问答。☆43Updated last week
- ☆18Updated 4 months ago
- 眼科问诊大模型☆83Updated 6 months ago
- Real-time video understanding and interaction through text,audio,image and video with large multi-modal model. 利用多模态大模型的实时视频理解和交互框架,通过文本…☆22Updated 11 months ago