raynium / pyvitkLinks
python越南语分词器
☆10Updated 5 years ago
Alternatives and similar repositories for pyvitk
Users that are interested in pyvitk are comparing it to the libraries listed below
Sorting:
- Unsupervised tableQA and databaseQA on chinese finance question and tabular data☆12Updated 2 years ago
- English or Chinses GPT2Dialog model from GPT2-chitchat☆12Updated 5 years ago
- 夸夸机器人☆20Updated 3 years ago
- distill large scale web page text☆12Updated last year
- 裁判文书数据☆11Updated 4 years ago
- Corpus creator for Chinese Wikipedia☆41Updated 4 years ago
- 🔥 专注于中文的「自然语言处理框架」:中文分词;平衡类别;数据集划分...☆13Updated 4 years ago
- Large-scale exact string matching tool☆17Updated 4 months ago
- 国家统计局中国省市县乡村5级地址抓取,http://www.stats.gov.cn/tjsj/tjbz/tjyqhdmhcxhfdm/2018/index.html☆12Updated 5 years ago
- BMInf demos.☆15Updated 3 years ago
- Finetune CPM-1☆24Updated 4 years ago
- auto push daily news with ai☆13Updated this week
- GLM (General Language Model)☆24Updated 3 years ago
- Source code and checkpoints for legal pre-trained language models.☆15Updated 4 years ago
- MOSS 003 WebSearchTool: A simple but reliable implementation☆45Updated 2 years ago
- 中文 NLP 语料库数据集☆20Updated 6 years ago
- 新词发现,信息熵,左右互信息☆16Updated 6 years ago
- ☆19Updated 2 years ago
- 基于腾讯TexSmart分词SDK的ES分词插件☆14Updated 4 years ago
- 个人学习用。请star或fork原作者。☆27Updated 10 years ago
- Perform crosstalk with Qian Yu☆54Updated last year
- 词、句拼音转汉字、拼音分割、拼音补全、pygame输入中文☆15Updated 5 years ago
- 基于 TensorFlow & PaddlePaddle 的通用序列标注算法库(目前包含 BiLSTM+CRF, Stacked-BiLSTM+CRF 和 IDCNN+CRF,更多算法正在持续添加中)实现中文分词(Tokenizer / segmentation)、词性标注…☆84Updated 2 years ago
- Qimen表示的是奇门遁甲之术,用于抽取各种实体的工具。☆29Updated 5 years ago
- ☆34Updated 3 years ago
- Attentional Neural Network that translates text to phones.☆11Updated 7 years ago
- 基于百度webqa与dureader数据集训练的Albert Large QA模型☆75Updated 5 years ago
- 中文实体抽取☆14Updated 6 years ago
- Train Wikidata with word2vec for word embedding tasks☆123Updated 7 years ago
- ☆7Updated 2 years ago