CanvaChen / llm-dataset-chinese-poetryLinks
目标:整理一份高质量的大模型古诗词数据集,涵盖先秦到现代
☆99Updated last year
Alternatives and similar repositories for llm-dataset-chinese-poetry
Users that are interested in llm-dataset-chinese-poetry are comparing it to the libraries listed below
Sorting:
- The plan which extend ChatHaruhi into Zero-shot Roleplaying model☆106Updated last year
- 中文聊天小模型,用t5 base在大量数据上有监督。☆100Updated last year
- "桃李“: 国际中文教育大模型☆179Updated last year
- The Silk Magic Book will record the Magic Prompts on some very Large LLMs. The Silk Magic Book belongs to the project Luotuo(骆驼), which c…☆56Updated 2 years ago
- deep learning☆149Updated 3 weeks ago
- 文本去重☆72Updated last year
- CamelBell(驼铃) is be a Chinese Language Tuning project based on LoRA. CamelBell is belongs to Project Luotuo(骆驼), an open sourced Chinese-…☆174Updated last year
- GTS Engine: A powerful NLU Training System。GTS引擎(GTS-Engine)是一款开箱即用且性能强大 的自然语言理解引擎,聚焦于小样本任务,能够仅用小样本就能自动化生产NLP模型。☆91Updated 2 years ago
- 用于大模型 RLHF 进行人工数据标注排序的工具。A tool for manual response data annotation sorting in RLHF stage.☆251Updated last year
- 从小说中提取对话数据集☆191Updated 11 months ago
- Just for debug☆56Updated last year
- Alpaca Chinese Dataset -- 中文指令微调数据集☆205Updated 7 months ago
- [EMNLP'24] CharacterGLM: Customizing Chinese Conversational AI Characters with Large Language Models☆464Updated 4 months ago
- 专注于中文领域大语言模型,落地到某个行业某个领域,成为一个行业大模型、公司级别或行业级别领域大模型。☆118Updated 2 months ago
- ☆221Updated last year
- Mimix: A Text Generation Tool and Pretrained Chinese Models☆155Updated 7 months ago
- chatglm-6b微调/LORA/PPO/推理, 样本为自动生成的整数/小数加减乘除运算, 可gpu/cpu☆164Updated last year
- Firefly中文LLaMA-2大模型,支持增量预训练Baichuan2、Llama2、Llama、Falcon、Qwen、Baichuan、InternLM、Bloom等大模型☆410Updated last year
- Imitate OpenAI with Local Models☆87Updated 9 months ago
- Llama2开源模型中文版-全方位测评,基于SuperCLUE的OPEN基准 | Llama2 Chinese evaluation with SuperCLUE☆126Updated last year
- 国内首个全参数训练的法律大模型 HanFei-1.0 (韩非)☆116Updated last year
- 利用LLM+敏感词库,来自动判别是否涉及敏感词。☆124Updated last year
- 活字通用大模型☆388Updated 8 months ago
- 在中文开源大模型的基础上进行定制化的微调,拥有自己专属的语言模型。☆47Updated 2 years ago
- SeqGPT: An Out-of-the-box Large Language Model for Open Domain Sequence Understanding☆225Updated last year
- ☆308Updated 2 years ago
- llama inference for tencentpretrain☆98Updated last year
- 夫子 •明察司法大模型是由山东大学、浪潮云、中国政法大学联合研发,以 ChatGLM 为大模型底座,基于海量中文无监督司法语料与有监督司法微调数据训练的中文司法大模型。该模型支持法条检索、案例分析、三段论推理判决以及司法对话等功能,旨在为用户提供全方位、高精准的法律咨询与解答…☆343Updated 7 months ago
- pCLUE: 1000000+多任务提示学习数据集☆495Updated 2 years ago
- 想要从零开始训练一个中文的mini大语言模型,可以进行基本的对话,模型大小根据手头的机器决定☆59Updated 9 months ago