FudanNLPLAB / CBook-150K
中文图书语料MD5链接
☆212Updated 9 months ago
Related projects ⓘ
Alternatives and complementary repositories for CBook-150K
- ☆173Updated last year
- 中文 Instruction tuning datasets☆118Updated 7 months ago
- 语言模型中文认知能力分析☆235Updated last year
- ☆125Updated last year
- 文本去重☆67Updated 6 months ago
- 中文大语言模型评测第一期☆106Updated last year
- MEASURING MASSIVE MULTITASK CHINESE UNDERSTANDING☆87Updated 8 months ago
- ☆158Updated last year
- pCLUE: 1000000+多任务提示学习数据集☆469Updated 2 years ago
- Chinese large language model base generated through incremental pre-training on Chinese datasets☆234Updated last year
- ☆157Updated last year
- ☆273Updated 6 months ago
- A framework for cleaning Chinese dialog data☆261Updated 3 years ago
- ☆297Updated last year
- alpaca中文指令微调数据集☆391Updated last year
- ☆62Updated last year
- A Massive Multi-Level Multi-Subject Knowledge Evaluation benchmark☆99Updated last year
- NLU & NLG (zero-shot) depend on mengzi-t5-base-mt pretrained model☆75Updated 2 years ago
- ☆93Updated 8 months ago
- A Chinese Open-Domain Dialogue System☆314Updated last year
- A unified tokenization tool for Images, Chinese and English.☆150Updated last year
- 中文大语言模型评测第二期☆70Updated last year
- T2Ranking: A large-scale Chinese benchmark for passage ranking.☆151Updated last year
- Dataset and evaluation script for "Evaluating Hallucinations in Chinese Large Language Models"☆109Updated 5 months ago
- Summarize all open source Large Languages Models and low-cost replication methods for Chatgpt.☆135Updated last year
- text embedding☆140Updated last year
- Baichuan-13B 指令微调☆89Updated last year
- chatglm-6b微调/LORA/PPO/推理, 样本为自动生成的整数/小数加减乘除运算, 可gpu/cpu☆163Updated last year
- Implementation of Chinese ChatGPT☆287Updated last year