deepspeed+trainer简单高效实现多卡微调大模型
☆133May 27, 2023Updated 2 years ago
Alternatives and similar repositories for ChatGLM_mutli_gpu_tuning
Users that are interested in ChatGLM_mutli_gpu_tuning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code to reproduce THUIR‘s submissions for COLIEE 2023 Task1 and Task2☆28May 12, 2023Updated 2 years ago
- SIGIR'22 paper: Axiomatically Regularized Pre-training for Ad hoc Search☆23May 24, 2023Updated 2 years ago
- The official repo for our SIGIR'23 Full paper: Structure-aware Pre-trained Language Model for Legal Case Retrieval☆98May 9, 2023Updated 2 years ago
- The official repo for our SIGIR'23 Full paper: Constructing Tree-based Index for Efficient and Effective Dense Retrieval☆28Jun 7, 2023Updated 2 years ago
- ☆13Jul 25, 2025Updated 7 months ago
- LLM with LuXun (鲁迅) style☆89May 15, 2023Updated 2 years ago
- Large Language Models as Evaluators for Recommendation Explanations (RecSys 2024 Reproducibility)☆20Aug 13, 2025Updated 7 months ago
- Code for KERM: Incorporating Explicit Knowledge in Pre-trained Language Models for Passage Re-ranking, accepted at SIGIR 2022.☆19Oct 31, 2022Updated 3 years ago
- A general framework used on evaluating the performance of large language models (LLMs) based on the peer review mechanism among LLMs☆19Aug 3, 2024Updated last year
- LexiLaw - 中文法律大模型☆985Mar 12, 2026Updated last week
- chatglm多gpu用deepspeed和☆408Jul 8, 2024Updated last year
- The official implementation for Collaborative Word-based Pre-trained Item Representation for Transferable Recommendation.☆25Jan 30, 2024Updated 2 years ago
- A Large-Scale Chinese Legal Case Retrieval Dataset☆85Dec 29, 2024Updated last year
- An evaluation framework to test AI in a trial-and-error process. It is a simplified Natural Selection test.☆22Mar 11, 2025Updated last year
- chatglm3-6b, 微调/LORA/推理/单机多卡/deepspeed/支持多轮对话☆17Nov 30, 2023Updated 2 years ago
- T2Ranking: A large-scale Chinese benchmark for passage ranking.☆162Jul 3, 2023Updated 2 years ago
- BLOOM 模型的指令微调☆24Jun 15, 2023Updated 2 years ago
- LeXFiles and LegalLAMA: Facilitating English Multinational Legal Language Model Development☆21Jul 24, 2023Updated 2 years ago
- Pytorch implementation of CACM (WSDM'20)☆28Dec 27, 2021Updated 4 years ago
- 一套代码指令微调大模型☆39Aug 1, 2023Updated 2 years ago
- Fine-tuning ChatGLM-6B with PEFT | 基于 PEFT 的高效 ChatGLM 微调☆3,730Oct 12, 2023Updated 2 years ago
- This is our implementation of IntEL-Intent-aware Ranking Ensemble for Personalized Recommendation (SIGIR2023)☆23Nov 17, 2023Updated 2 years ago
- 中文对话数据清洗☆32Nov 8, 2022Updated 3 years ago
- Code for the paper "A Comprehensive Evaluation of Large Language Models on Legal Judgment Prediction"☆12Oct 20, 2023Updated 2 years ago
- Code for MBGE-recognition: Emotion recognition based on multi-view body gestures, accepted at ICIP 2019.☆12Apr 6, 2023Updated 2 years ago
- wide deep ctr model by pytorch☆27Sep 25, 2019Updated 6 years ago
- ☆43Dec 15, 2023Updated 2 years ago
- ☆26Jul 25, 2025Updated 7 months ago
- Open ChatGLM Eyes to See the World☆13Mar 30, 2023Updated 2 years ago
- 基于ChatGLM-6B + LoRA的Fintune方案☆3,758Nov 25, 2023Updated 2 years ago
- 基于ChatGLM-6B、ChatGLM2-6B、ChatGLM3-6B 模型,进行下游具体任务微调,涉及Freeze、Lora、P-tuning、全参微调等☆2,781Dec 12, 2023Updated 2 years ago
- 中文法律LLaMA (LLaMA for Chinese legel domain)☆986Aug 28, 2024Updated last year
- llama,chatglm 等模型的微调☆91Jul 18, 2024Updated last year
- ☆22Apr 22, 2025Updated 11 months ago
- Repo. for RLCF.☆15Apr 1, 2024Updated last year
- An optimized deep prompt tuning strategy comparable to fine-tuning across scales and tasks☆2,076Nov 16, 2023Updated 2 years ago
- 基于 LoRA 和 P-Tuning v2 的 ChatGLM-6B 高效参数微调☆55May 17, 2023Updated 2 years ago
- LegalOne: A Family of Foundation Models for Reliable Legal Reasoning☆43Feb 3, 2026Updated last month
- ☆84Sep 9, 2023Updated 2 years ago