xv44586 / Chinese-instruction-datasetsView external linksLinks
中文 Instruction tuning datasets
☆143Apr 10, 2024Updated last year
Alternatives and similar repositories for Chinese-instruction-datasets
Users that are interested in Chinese-instruction-datasets are comparing it to the libraries listed below
Sorting:
- Large-scale exact string matching tool☆17Mar 7, 2025Updated 11 months ago
- LTX-Video-Trainer-GUI 是为LTX视频lora模型训练提供的GUI工具,支持通过简单的界面训练 LoRA 模型用于视频生成。本训练器提供了直观的 GUI 界面,使用户能够轻松设置和启动训练流程,无需编写复杂代码。☆13Jul 18, 2025Updated 6 months ago
- 基于sentence-transformers实现文本转向量的机器人☆46Aug 22, 2022Updated 3 years ago
- aigc evals☆10Dec 2, 2023Updated 2 years ago
- [ACL 2024] IEPile: A Large-Scale Information Extraction Corpus☆211Jan 9, 2025Updated last year
- ChatGLM-6B 指令学习|指令数据|Instruct☆655Apr 10, 2023Updated 2 years ago
- Chinese safety prompts for evaluating and improving the safety of LLMs. 中文安全prompts,用于评估和提升大模型的安全性。☆1,127Feb 27, 2024Updated last year
- 基于langchain设计的智能体任务,包含规划会话场景资源,构建子任务,任务执行器包含(MCTS)☆32Nov 10, 2025Updated 3 months ago
- The Corpus & Code for EMNLP 2022 paper "FCGEC: Fine-Grained Corpus for Chinese Grammatical Error Correction" | FCGEC中文语法纠错语料及STG模型☆120Dec 10, 2024Updated last year
- Panda项目是于2023年5月启动的开源海外中文大语言模型项目,致力于大模型时代探索整个技术栈,旨在推动中文自然语言处理领域的创新和合作。☆1,037Oct 19, 2023Updated 2 years ago
- ☆14Dec 26, 2022Updated 3 years ago
- 使用Qwen1.5-0.5B-Chat模型进行通用信息抽取任务的微调,旨在: 验证生成式方法相较于抽取式NER的效果; 为新手提供简易的模型微调流程,尽量减少代码量; 大模型训练的数据格式处理。☆15Sep 6, 2024Updated last year
- Firefly: 大模型训练工具,支持训练Qwen2.5、Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、…☆6,635Oct 24, 2024Updated last year
- ☆36Sep 6, 2024Updated last year
- alpaca中文指令微调数据集☆397Mar 26, 2023Updated 2 years ago
- 活字通用大模 型☆391Sep 12, 2024Updated last year
- InsTag: A Tool for Data Analysis in LLM Supervised Fine-tuning☆285Aug 20, 2023Updated 2 years ago
- Chinese-LLaMA 1&2、Chinese-Falcon 基础模型;ChatFlow中文对话模型;中文OpenLLaMA模型;NLP预训练/指令微调数据集☆3,055Apr 14, 2024Updated last year
- 大语言模型训练和服务调研☆37Aug 4, 2023Updated 2 years ago
- [EMNLP 2022] ReCo: Reliable Causal Chain Reasoning via Structural Causal Recurrent Neural Networks☆17Apr 24, 2024Updated last year
- Tuning LLMs with no tears💦; Sample Design Engineering (SDE) for more efficient downstream-tuning.☆1,019Apr 27, 2024Updated last year
- 中文自然语言推理与语义相似度数据集☆368Jan 5, 2022Updated 4 years ago
- Multi-turn alpaca is an extension of stanford alpaca and supports multi-turn dialogue 多轮对话版alpaca☆22May 9, 2023Updated 2 years ago
- An automatic MLLM hallucination detection framework☆19Sep 26, 2023Updated 2 years ago
- 知乎话题树可视化☆15Apr 11, 2019Updated 6 years ago
- 一个非常高效的字符串匹配工具,支持正向/反向最大匹配分词和多模式字符串精确匹配☆16Jul 29, 2023Updated 2 years ago
- Awesome Pretrained Chinese NLP Models,高质量中文预训练模型&大模型&多模态模型&大语言模型集合☆5,518Dec 14, 2025Updated 2 months ago
- A Chinese medical ChatGPT based on LLaMa, training from large-scale pretrain corpus and multi-turn dialogue dataset.☆384Dec 12, 2023Updated 2 years ago
- The complete training code of the open-source high-performance Llama model, including the full process from pre-training to RLHF.☆67Mar 27, 2023Updated 2 years ago
- code and data for "CSCD-NS: a Chinese Spelling Check Dataset for Native Speakers"☆82Aug 18, 2024Updated last year
- PromptCLUE, 全中文任务支持零样本学习模型☆665Jun 16, 2023Updated 2 years ago
- 基于rasa_框架实现指自然语言相关功能:实体识别、文本分类、代消解功能、关系抽取等☆17May 22, 2023Updated 2 years ago
- MultiTool-CoT: GPT-3 Can Use Multiple External Tools with Chain of Thought Prompting☆21Jul 11, 2023Updated 2 years ago
- Latex beamer template for RUC☆20Mar 12, 2015Updated 10 years ago
- ☆18Apr 28, 2022Updated 3 years ago
- Rephrasing Language Model for CSC (AAAI 2024)☆44May 14, 2024Updated last year
- 语言模型中文认知能力分析☆235Sep 9, 2023Updated 2 years ago
- ☆129May 27, 2023Updated 2 years ago
- 👋 欢迎来到 ChatGLM 创意世界!你可以使用修订和续写的功能来生成创意内容!☆250Jul 8, 2024Updated last year