XuRui314 / GLM4v-FinetuneLinks
Support finetuning GLM4v with zero2
☆15Updated last year
Alternatives and similar repositories for GLM4v-Finetune
Users that are interested in GLM4v-Finetune are comparing it to the libraries listed below
Sorting:
- ☆50Updated last year
- The huggingface implementation of Fine-grained Late-interaction Multi-modal Retriever.☆104Updated 7 months ago
- ☆110Updated last month
- ☆57Updated last year
- MTVQA: Benchmarking Multilingual Text-Centric Visual Question Answering. A comprehensive evaluation of multimodal large model multilingua…☆63Updated 7 months ago
- SuperCLUE-Math6:新一代中文原生多轮多步数学推理数据集的探索之旅☆60Updated last year
- transformers结构的中文OFA模型☆136Updated 2 years ago
- LongQLoRA: Extent Context Length of LLMs Efficiently☆167Updated 2 years ago
- Clustering and Ranking: Diversity-preserved Instruction Selection through Expert-aligned Quality Estimation☆90Updated last year
- 基于baichuan-7b 的开源多模态大语言模型☆72Updated 2 years ago
- Official completion of “Training on the Benchmark Is Not All You Need”.☆38Updated last year
- [ACL'2024 Findings] GAOKAO-MM: A Chinese Human-Level Benchmark for Multimodal Models Evaluation☆75Updated last year
- Copy the MLP of llama3 8 times as 8 experts , created a router with random initialization,add load balancing loss to construct an 8x8b Mo…☆27Updated last year
- Exploring Efficient Fine-Grained Perception of Multimodal Large Language Models☆65Updated last year
- A Toolkit for Table-based Question Answering☆115Updated 2 years ago
- Adapt an LLM model to a Mixture-of-Experts model using Parameter Efficient finetuning (LoRA), injecting the LoRAs in the FFN.☆77Updated 2 months ago
- ☆75Updated last year
- ☆29Updated last year
- ☆187Updated 11 months ago
- ☆36Updated last year
- A Simple MLLM Surpassed QwenVL-Max with OpenSource Data Only in 14B LLM.☆38Updated last year
- ☆57Updated 5 months ago
- The complete training code of the open-source high-performance Llama model, including the full process from pre-training to RLHF.☆67Updated 2 years ago
- ☆54Updated last year
- ☆79Updated last year
- [MM 2025] CMM-Math: A Chinese Multimodal Math Dataset To Evaluate and Enhance the Mathematics Reasoning of Large Multimodal Models☆49Updated last year
- Vary-tiny codebase upon LAVIS (for training from scratch)and a PDF image-text pairs data (about 600k including English/Chinese)☆86Updated last year
- Official Repository of MMLONGBENCH-DOC: Benchmarking Long-context Document Understanding with Visualizations☆114Updated 3 months ago
- ☆142Updated last year
- Data and code for paper "M3Exam: A Multilingual, Multimodal, Multilevel Benchmark for Examining Large Language Models"☆103Updated 2 years ago