FudanNLPLAB / CBook-150KLinks
中文图书语料MD5链接
☆217Updated last year
Alternatives and similar repositories for CBook-150K
Users that are interested in CBook-150K are comparing it to the libraries listed below
Sorting:
- ☆172Updated 2 years ago
- 语言模型中文认知能力分析☆236Updated 2 years ago
- 中文大语言模型评测第一期☆110Updated last year
- pCLUE: 1000000+多任务提示学习数据集☆500Updated 3 years ago
- ☆128Updated 2 years ago
- Chinese large language model base generated through incremental pre-training on Chinese datasets☆239Updated 2 years ago
- 文本去重☆76Updated last year
- A framework for cleaning Chinese dialog data☆273Updated 4 years ago
- 中文 Instruction tuning datasets☆137Updated last year
- ☆309Updated 2 years ago
- ☆281Updated last year
- ☆163Updated 2 years ago
- 中文大语言模型评测第二期☆71Updated last year
- MEASURING MASSIVE MULTITASK CHINESE UNDERSTANDING☆89Updated last year
- T2Ranking: A large-scale Chinese benchmark for passage ranking.☆161Updated 2 years ago
- NLU & NLG (zero-shot) depend on mengzi-t5-base-mt pretrained model☆76Updated 3 years ago
- Light local website for displaying performances from different chat models.☆87Updated last year
- A Chinese Open-Domain Dialogue System☆324Updated 2 years ago
- alpaca中文指令微调数据集☆395Updated 2 years ago
- ☆460Updated last year
- EVA: Large-scale Pre-trained Chit-Chat Models☆307Updated 2 years ago
- deep learning☆148Updated 5 months ago
- ☆219Updated 2 years ago
- OPD: Chinese Open-Domain Pre-trained Dialogue Model☆75Updated 2 years ago
- 大规模中文语料☆44Updated 5 years ago
- Summarize all open source Large Languages Models and low-cost replication methods for Chatgpt.☆137Updated 2 years ago
- Efficient, Low-Resource, Distributed transformer implementation based on BMTrain☆263Updated last year
- This is the repository of the Ape210K dataset and baseline models.☆196Updated 5 years ago
- 收集了目前为止中文领域的MRC抽取式数据集☆122Updated last year
- The Corpus & Code for EMNLP 2022 paper "FCGEC: Fine-Grained Corpus for Chinese Grammatical Error Correction" | FCGEC中文语法纠错语料及STG模型☆119Updated 10 months ago