CyberCommy / baidu-wiki-500w
百度百科 500 万数据集
☆33Updated last year
Alternatives and similar repositories for baidu-wiki-500w:
Users that are interested in baidu-wiki-500w are comparing it to the libraries listed below
- 中文新词发现算法PNW算法,可以识别任意长度的新词。☆15Updated last year
- 百度QA100万数据集☆47Updated last year
- 专业领域词库构建/中文新词发现/专业词库发现☆29Updated 5 years ago
- 手动实现Elasticsearch的倒排索引以及BM25算法☆46Updated 6 years ago
- ☆20Updated 3 years ago
- 图书名语料库。含部分电影、游戏名称。☆68Updated 10 months ago
- 知乎大语言模型、ChatGPT、Transformers问答☆34Updated last year
- 基于Qwen2模型进行通用信息抽取【实体/关系/事件抽取】☆30Updated 7 months ago
- 医疗语料库。医疗机构名语料库。药品本位码。☆64Updated 10 months ago
- A Python toolkit for file processing, text cleaning and data splitting. 文件处理,文本清洗和数据划分的python工具包。☆31Updated 2 years ago
- 基于qlora对baichuan-7B大模型进行指令微调。☆20Updated last year
- 文本自动摘要☆92Updated last year
- 京东/淘宝客服对话数据公开,seq2seq生成模型设计对话系统获第二名☆41Updated 2 years ago
- ☆37Updated 5 years ago
- 中文纠错☆92Updated 2 years ago