专注于中文领域大语言模型,落地到某个行业某个领域,成为一个行业大模型、公司级别或行业级别领域大模型。
☆126Mar 6, 2025Updated 11 months ago
Alternatives and similar repositories for LLMs
Users that are interested in LLMs are comparing it to the libraries listed below
Sorting:
- Implementation of Chinese ChatGPT☆288Nov 20, 2023Updated 2 years ago
- alpaca中文指令微调数据集☆397Mar 26, 2023Updated 2 years ago
- ChatGLM-6B fine-tuning.☆136Apr 25, 2023Updated 2 years ago
- ☆235May 10, 2024Updated last year
- An implementation of MSSRM method☆11Mar 23, 2023Updated 2 years ago
- 中文nlp解决方案(大模型、数据、模型、训练、推理)☆3,779Aug 5, 2025Updated 6 months ago
- aigc evals☆10Dec 2, 2023Updated 2 years ago
- ☆148Apr 16, 2024Updated last year
- This project is mainly to explore what effect can be achieved by fine-tuning LLM model (ChatGLM-6B)of about 6B in vertical field (Romance…☆26Apr 6, 2023Updated 2 years ago
- Deploy ChatGLM on Modelz☆16Mar 20, 2023Updated 2 years ago
- 基于LLM实现CHIP2021-Task3中文临床术语标准化任务,准确率约70%。☆15Dec 16, 2024Updated last year
- The SQL-RL-GEN is an algorithm based on a Reinforcement Learning approach with a reward function generated by a LLM to guide the agent's …☆19Sep 18, 2025Updated 5 months ago
- Using FasterTransformer for accelerating the predict speed of bert and roberta☆14Sep 20, 2019Updated 6 years ago
- 实现一种多Lora权值集成切换+Zero-Finetune零微调增强的跨模型技术方案,LLM-Base+LLM-X+Alpaca,初期,LLM-Base为Chatglm6B底座模型,LLM-X是LLAMA增强模型。该方案简易高效,目标是使此类语言模型能够低能耗广泛部署,并最…☆116Jul 19, 2023Updated 2 years ago
- moss chat finetuning☆51Apr 23, 2024Updated last year
- 🌈 NERpy: Implementation of Named Entity Recognition using Python. 命名实体识别工具,支持BertSoftmax、BertSpan等模型,开箱即用。☆117Feb 19, 2024Updated 2 years ago
- Firefly: 大模型训练工具,支持训练Qwen2.5、Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、…☆6,635Oct 24, 2024Updated last year
- GOAT(山羊)是中英文大语言模型,基于LlaMa进行SFT。☆12Apr 24, 2023Updated 2 years ago
- Advanced implementation of DeepSeek-R1 featuring Group Relative Policy Optimization (GRPO) for mathematical reasoning AI. Integrates safe…☆13Jan 29, 2025Updated last year
- Fast LLM Training CodeBase With dynamic strategy choosing [Deepspeed+Megatron+FlashAttention+CudaFusionKernel+Compiler];☆40Jan 4, 2024Updated 2 years ago
- The GPU RAM Estimator provides a simple tool for estimating GPU memory usage during training and inference.☆35Apr 9, 2024Updated last year
- Train a 1B LLM with 1T tokens from scratch by personal☆789Apr 27, 2025Updated 10 months ago
- 受到self-instruct启发,除了通用LLM还能做垂直领域的小LLM实现定制效果,通过GPT获得question和answer来作为训练数据☆18May 12, 2023Updated 2 years ago
- huggingface ChineseBert Tokenizer☆16Apr 16, 2022Updated 3 years ago
- TextGen: Implementation of Text Generation models, include LLaMA, BLOOM, GPT2, BART, T5, SongNet and so on. 文本生成模型,实现了包括LLaMA,ChatGLM,BLO…☆981Sep 14, 2024Updated last year
- 中文Mixtral-8x7B(Chinese-Mixtral-8x7B)☆654Aug 17, 2024Updated last year
- chatglm 6b finetuning and alpaca finetuning☆1,536Mar 9, 2025Updated 11 months ago
- 基于nodejs的知乎爬虫,x-zse-96,支持文章,评论,图片下载到本地☆16Nov 8, 2023Updated 2 years ago
- 实现Blip2RWKV+QFormer的多模态图文对话大模型,使用Two-Step Cognitive Psychology Prompt方法,仅3B参数的模型便能够出现类人因果思维链。对标MiniGPT-4,ImageBind等图文对话大语言模型,力求以更小的算力和资源实…☆41Jul 17, 2023Updated 2 years ago
- 怎么训练一个LLM分词器☆153Jul 13, 2023Updated 2 years ago
- MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training Pipeline. 训练医疗大模型,实现了包括增量预训练(PT)、有监督微调(SFT)、RLHF、DPO、ORPO、GRPO。☆4,806Feb 10, 2026Updated 2 weeks ago
- OpenLLMWiki: Docs of OpenLLMAI. Survey, reproduction and domain/task adaptation of open source chatgpt alternatives/implementations. PiXi…☆263Dec 10, 2024Updated last year
- ☆22Jul 15, 2024Updated last year
- Humanable Chat Generative-model Fine-tuning | LLM微调☆206Sep 22, 2023Updated 2 years ago
- ChatGLM-6B 指令学习|指令数据|Instruct☆653Apr 10, 2023Updated 2 years ago
- 视觉信息抽取任务中,使用OCR识别结果规范多模态大模型的回答☆44Dec 31, 2024Updated last year
- ☆15Oct 9, 2023Updated 2 years ago
- chatglm-6b微调/LORA/PPO/推理, 样本为自动生成的整数/小数加减乘除运算, 可gpu/cpu☆165Aug 24, 2023Updated 2 years ago
- CMIVQA☆18Jun 3, 2024Updated last year