XuRui314 / GLM4v-Finetune
Support finetuning GLM4v with zero2
☆13Updated 10 months ago
Alternatives and similar repositories for GLM4v-Finetune
Users that are interested in GLM4v-Finetune are comparing it to the libraries listed below
Sorting:
- transformers结构的中文OFA模型☆134Updated 2 years ago
- Research Code for Multimodal-Cognition Team in Ant Group☆144Updated this week
- 基于baichuan-7b的开源多模态大语言模型☆73Updated last year
- ☆47Updated 11 months ago
- ☆56Updated last year
- The huggingface implementation of Fine-grained Late-interaction Multi-modal Retriever.☆88Updated 3 months ago
- A Simple MLLM Surpassed QwenVL-Max with OpenSource Data Only in 14B LLM.☆37Updated 8 months ago
- A high-throughput and memory-efficient inference and serving engine for LLMs☆136Updated 5 months ago
- This is a personal reimplementation of Google's Infini-transformer, utilizing a small 2b model. The project includes both model and train…☆56Updated last year
- 多轮共情对话模型PICA☆92Updated last year
- Qwen-WisdomVast is a large model trained on 1 million high-quality Chinese multi-turn SFT data, 200,000 English multi-turn SFT data, and …☆18Updated last year
- code for Scaling Laws of RoPE-based Extrapolation☆73Updated last year
- Clustering and Ranking: Diversity-preserved Instruction Selection through Expert-aligned Quality Estimation☆77Updated 6 months ago
- 使用sentencepiece中BPE训练中文词表,并在transformers中进行使用。☆117Updated last year
- 训练一个对中文支持更好的LLaVA模型,并开源训练代码和数据。☆56Updated 8 months ago
- The complete training code of the open-source high-performance Llama model, including the full process from pre-training to RLHF.☆65Updated 2 years ago
- 视觉信息抽取任务中,使用OCR识别结果规范多模态大模型的回答☆33Updated 4 months ago
- MTVQA: Benchmarking Multilingual Text-Centric Visual Question Answering. A comprehensive evaluation of multimodal large model multilingua…☆56Updated last month
- SuperCLUE-Math6:新一代中文原生多轮多步数学推理数据集的探索之旅☆55Updated last year
- SUS-Chat: Instruction tuning done right☆48Updated last year
- 基于 LoRA 和 P-Tuning v2 的 ChatGLM-6B 高效参数微调☆55Updated 2 years ago
- 百川Dynamic NTK-ALiBi的代码实现:无需微调即可推理更长文本☆47Updated last year
- LLM+RAG for QA☆22Updated last year
- LoRA☆19Updated 2 years ago
- ☆26Updated 6 months ago
- [LREC] MMChat: Multi-Modal Chat Dataset on Social Media☆104Updated 2 years ago
- the newest version of llama3,source code explained line by line using Chinese☆22Updated last year
- ☆94Updated last year
- [IJCAI 2024] CMMU: A Benchmark for Chinese Multi-modal Multi-type Question Understanding and Reasoning☆23Updated last year
- GOT的vLLM加速实现 并结合 MinerU 实现RAG中的pdf 解析☆56Updated 6 months ago