raynium / pyvitkLinks
python越南语分词器
☆10Updated 6 years ago
Alternatives and similar repositories for pyvitk
Users that are interested in pyvitk are comparing it to the libraries listed below
Sorting:
- distill large scale web page text☆12Updated 2 years ago
- English or Chinses GPT2Dialog model from GPT2-chitchat☆12Updated 5 years ago
- Corpus creator for Chinese Wikipedia☆41Updated 4 years ago
- Unsupervised tableQA and databaseQA on chinese finance question and tabular data☆13Updated 2 years ago
- 夸夸机器人☆20Updated 3 years ago
- Large-scale exact string matching tool☆17Updated 9 months ago
- 裁判文书数据☆11Updated 5 years ago
- 个人学习用。请star或fork原作者。☆27Updated 10 years ago
- 新词发现,信息熵,左右互信息☆16Updated 7 years ago
- 国家统计局中国省市县乡村5级地址抓取,http://www.stats.gov.cn/tjsj/tjbz/tjyqhdmhcxhfdm/2018/index.html☆12Updated 5 years ago
- ☆32Updated 2 years ago
- baike schema crawler for baidu baike , hudongbaike. 面向百度百科与互动百科的概念分类体系抓取脚本☆38Updated 7 years ago
- Finetune CPM-1☆24Updated 4 years ago
- 中文文本改写☆20Updated 5 years ago
- OpenLLMDE: An open source data engineering framework for LLMs☆18Updated 2 years ago
- 从门户网站爬取新闻的摘要-标题对使用seq2seq根据摘要生成标题☆45Updated 8 years ago
- Source code and checkpoints for legal pre-trained language models.☆15Updated 4 years ago
- 基于百度webqa与dureader数据集训练的Albert Large QA模型☆77Updated 5 years ago
- BMInf demos.☆15Updated 4 years ago
- 基于腾讯TexSmart分词SDK的ES分词插件☆15Updated 5 years ago
- 中文 NLP 语料库数据集☆20Updated 7 years ago
- 基于电商导购机器人,自然语言理解(NLU),文本纠错,歧义词消歧☆12Updated 5 years ago
- Attentional Neural Network that translates text to phones.☆11Updated 7 years ago
- ☆34Updated 4 years ago
- GLM (General Language Model)☆24Updated 3 years ago
- Xlore2.0 Code[BaiduExtractor, HudongExtractor, WikiExtractor, XloreData, XloreWeb]☆12Updated 8 years ago
- Chinese Couplets Dataset without vulgar words. 不包含敏感内容的对联数据集。☆77Updated 6 years ago
- Qimen表示的是奇门遁甲之术,用于抽取各种实体的工具。☆29Updated 5 years ago
- 🔥 专注于中文的「自然语言处理框架」:中文分词;平衡类别;数据集划分...☆12Updated 5 years ago
- 百川Dynamic NTK-ALiBi的代 码实现:无需微调即可推理更长文本☆49Updated 2 years ago