专注于中文领域大语言模型,落地到某个行业某个领域,成为一个行业大模型、公司级别或行业级别领域大模型。
☆126Mar 6, 2025Updated last year
Alternatives and similar repositories for LLMs
Users that are interested in LLMs are comparing it to the libraries listed below
Sorting:
- Implementation of Chinese ChatGPT☆289Nov 20, 2023Updated 2 years ago
- alpaca中文指令微调数据集☆397Mar 26, 2023Updated 2 years ago
- ChatGLM-6B fine-tuning.☆136Apr 25, 2023Updated 2 years ago
- ☆235May 10, 2024Updated last year
- 受到self-instruct启发,除了通用LLM还能做垂直领域的小LLM实现定制效果,通过GPT获得question和answer来作为训练数据☆18May 12, 2023Updated 2 years ago
- 中文nlp解决方案(大模型、数据、模型、训练、推理)☆3,786Aug 5, 2025Updated 7 months ago
- aigc evals☆10Dec 2, 2023Updated 2 years ago
- ☆149Apr 16, 2024Updated last year
- Firefly: 大模型训练工具,支持训练Qwen2.5、Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、…☆6,648Oct 24, 2024Updated last year
- The GPU RAM Estimator provides a simple tool for estimating GPU memory usage during training and inference.☆35Apr 9, 2024Updated last year
- chatglm-6b微调/LORA/PPO/推理, 样本为自动生成的整数/小数加减乘除运算, 可gpu/cpu☆165Aug 24, 2023Updated 2 years ago
- moss chat finetuning☆51Apr 23, 2024Updated last year
- Using FasterTransformer for accelerating the predict speed of bert and roberta☆14Sep 20, 2019Updated 6 years ago
- Fast LLM Training CodeBase With dynamic strategy choosing [Deepspeed+Megatron+FlashAttention+CudaFusionKernel+Compiler];☆40Jan 4, 2024Updated 2 years ago
- 实现一种多Lora权值集成切换+Zero-Finetune零微调增强的跨模型技术方案,LLM-Base+LLM-X+Alpaca,初期,LLM-Base为Chatglm6B底座模型,LLM-X是LLAMA增强模型。该方案简易高效,目标是使此类语言模型能够低能耗广泛部署,并最…☆116Jul 19, 2023Updated 2 years ago
- chatglm 6b finetuning and alpaca finetuning☆1,537Mar 9, 2025Updated last year
- 🌈 NERpy: Implementation of Named Entity Recognition using Python. 命名实体识别工具,支持BertSoftmax、BertSpan等模型,开箱即用。☆118Feb 19, 2024Updated 2 years ago
- 02. Enabling various applications to be AI-enabled or used by AI.☆31Sep 2, 2024Updated last year
- Deploy ChatGLM on Modelz☆16Mar 20, 2023Updated 3 years ago
- Train a 1B LLM with 1T tokens from scratch by personal☆791Apr 27, 2025Updated 10 months ago
- 中文Mixtral-8x7B(Chinese-Mixtral-8x7B)☆654Aug 17, 2024Updated last year
- TextGen: Implementation of Text Generation models, include LLaMA, BLOOM, GPT2, BART, T5, SongNet and so on. 文本生成模型,实现了包括LLaMA,ChatGLM,BLO…☆979Sep 14, 2024Updated last year
- 人工精调的中文对话数据集和一段chatglm的微调代码☆1,194May 3, 2025Updated 10 months ago
- huggingface ChineseBert Tokenizer☆17Apr 16, 2022Updated 3 years ago
- ☆12Feb 28, 2025Updated last year
- 实现Blip2RWKV+QFormer的多模态图文对话大模型,使用Two-Step Cognitive Psychology Prompt方法,仅3B参数的模型便能够出现类人因果思维链。对标MiniGPT-4,ImageBind等图文对话大语言模型,力求以更小的算力和资源实…☆42Jul 17, 2023Updated 2 years ago
- ☆15Oct 9, 2023Updated 2 years ago
- ChatGLM-6B 指令学习|指令数据|Instruct☆653Apr 10, 2023Updated 2 years ago
- MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training Pipeline. 训练医疗大模型,实现了包括增量预训练(PT)、有监督微调(SFT)、RLHF、DPO、ORPO、GRPO。☆5,052Updated this week
- deep training task☆30Apr 28, 2023Updated 2 years ago
- Panda项目是于2023 年5月启动的开源海外中文大语言模型项目,致力于大模型时代探索整个技术栈,旨在推动中文自然语言处理领域的创新和合作。☆1,036Oct 19, 2023Updated 2 years ago
- BELLE: Be Everyone's Large Language model Engine(开源中文对话大模型)☆8,286Oct 16, 2024Updated last year
- 基于LLM实现CHIP2021-Task3中文临床术语标准化任务,准确率约70%。☆15Dec 16, 2024Updated last year
- OpenLLMWiki: Docs of OpenLLMAI. Survey, reproduction and domain/task adaptation of open source chatgpt alternatives/implementations. PiXi…☆262Dec 10, 2024Updated last year
- ☆14Feb 24, 2025Updated last year
- smart chinese LLm☆19Jan 31, 2024Updated 2 years ago
- NTK scaled version of ALiBi position encoding in Transformer.☆69Aug 16, 2023Updated 2 years ago
- Open ChatGLM Eyes to See the World☆13Mar 30, 2023Updated 2 years ago
- An implementation of MSSRM method☆11Mar 23, 2023Updated 2 years ago