deepspeed+trainer简单高效实现多卡微调大模型
☆133May 27, 2023Updated 3 years ago
Alternatives and similar repositories for ChatGLM_mutli_gpu_tuning
Users that are interested in ChatGLM_mutli_gpu_tuning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code to reproduce THUIR‘s submissions for COLIEE 2023 Task1 and Task2☆28May 12, 2023Updated 3 years ago
- SIGIR'22 paper: Axiomatically Regularized Pre-training for Ad hoc Search☆23May 24, 2023Updated 3 years ago
- The official repo for our SIGIR'23 Full paper: Structure-aware Pre-trained Language Model for Legal Case Retrieval☆98May 9, 2023Updated 3 years ago
- The official repo for our SIGIR'23 Full paper: Constructing Tree-based Index for Efficient and Effective Dense Retrieval☆28Jun 7, 2023Updated 3 years ago
- ☆15Jul 25, 2025Updated 10 months ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- LLM with LuXun (鲁迅) style☆91May 15, 2023Updated 3 years ago
- Large Language Models as Evaluators for Recommendation Explanations (RecSys 2024 Reproducibility)☆20Aug 13, 2025Updated 9 months ago
- A general framework used on evaluating the performance of large language models (LLMs) based on the peer review mechanism among LLMs☆19Aug 3, 2024Updated last year
- LexiLaw - 中文法律大模型☆1,019Mar 12, 2026Updated 3 months ago
- chatglm多gpu用deepspeed和☆409Jul 8, 2024Updated last year
- The official implementation for Collaborative Word-based Pre-trained Item Representation for Transferable Recommendation.☆25Jan 30, 2024Updated 2 years ago
- A Large-Scale Chinese Legal Case Retrieval Dataset☆88Dec 29, 2024Updated last year
- An evaluation framework to test AI in a trial-and-error process. It is a simplified Natural Selection test.☆22Mar 11, 2025Updated last year
- chatglm3-6b, 微调/LORA/推理/单机多卡/deepspeed/支持多轮对话☆17Nov 30, 2023Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- T2Ranking: A large-scale Chinese benchmark for passage ranking.☆163Jul 3, 2023Updated 2 years ago
- BLOOM 模型的指令微调☆24Jun 15, 2023Updated 2 years ago
- LeXFiles and LegalLAMA: Facilitating English Multinational Legal Language Model Development☆22Jul 24, 2023Updated 2 years ago
- 一套代码指令微调大模型☆39Aug 1, 2023Updated 2 years ago
- Fine-tuning ChatGLM-6B with PEFT | 基于 PEFT 的高效 ChatGLM 微调☆3,723Oct 12, 2023Updated 2 years ago
- This is our implementation of IntEL-Intent-aware Ranking Ensemble for Personalized Recommendation (SIGIR2023)☆24Nov 17, 2023Updated 2 years ago
- Code for the paper "A Comprehensive Evaluation of Large Language Models on Legal Judgment Prediction"☆12Oct 20, 2023Updated 2 years ago
- Code for MBGE-recognition: Emotion recognition based on multi-view body gestures, accepted at ICIP 2019.☆12Apr 6, 2023Updated 3 years ago
- wide deep ctr model by pytorch☆26Sep 25, 2019Updated 6 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- ☆43Dec 15, 2023Updated 2 years ago
- ☆26Jul 25, 2025Updated 10 months ago
- Open ChatGLM Eyes to See the World☆13Mar 30, 2023Updated 3 years ago
- 基于ChatGLM-6B + LoRA的Fintune方案☆3,746Nov 25, 2023Updated 2 years ago
- 基于ChatGLM-6B、ChatGLM2-6B、ChatGLM3-6B模型,进行下游具体任务微调,涉及Freeze、Lora、P-tuning、全参微调等☆2,778Dec 12, 2023Updated 2 years ago
- 中文法律LLaMA (LLaMA for Chinese legel domain)☆993Aug 28, 2024Updated last year
- 模型压缩的小白入门教程☆22Jul 7, 2024Updated last year
- ☆22Apr 22, 2025Updated last year
- Repo. for RLCF.☆15Apr 1, 2024Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- An optimized deep prompt tuning strategy comparable to fine-tuning across scales and tasks☆2,077Nov 16, 2023Updated 2 years ago
- 基于 LoRA 和 P-Tuning v2 的 ChatGLM-6B 高效参数微调☆55May 17, 2023Updated 3 years ago
- GLM-SIMPLE-EVALS: The evaluation repository for the GLM-4.5 series of models by Z.ai.☆41Oct 17, 2025Updated 7 months ago
- ☆84Sep 9, 2023Updated 2 years ago
- "桃李“: 国际中文教育大模型☆191Nov 13, 2023Updated 2 years ago
- 中文nlp解决方案(大模型、数据、模型、训练、推理)☆3,826Aug 5, 2025Updated 10 months ago
- code for paper 《RankingGPT: Empowering Large Language Models in Text Ranking with Progressive Enhancement》☆35Jan 9, 2024Updated 2 years ago