zhiyulee-RUC / CTools
Python常见工具集合-繁简转换/繁体转换; 词频统计;
☆20Updated 8 years ago
Alternatives and similar repositories for CTools:
Users that are interested in CTools are comparing it to the libraries listed below
- chinese anti semantic word search interface based on dict crawled from online resources, ChineseAntiword, 针对中文词语的反义词查询接口☆59Updated 6 years ago
- 利用深度学习实现中文分词☆61Updated 7 years ago
- Joint Slot Filling and Intent Prediction Use Attention and Slot Gate. NER, Intent classification☆40Updated 5 years ago
- 人民日报1998年1-4月中文标注语料库☆32Updated 6 years ago
- self complemented SpellCorrection based pinyin similairity, edit distance ,基于拼音相似度与编辑距离的查询纠错。☆82Updated 2 years ago
- 新词发现 基于词频、凝聚系数和左右邻接信息熵☆122Updated 5 years ago
- 新词发现算法(NewWordDetection)☆92Updated 4 years ago
- 基于最小熵原理的NLP工具包☆138Updated 3 years ago
- 利用文本分析算法和Python脚本,自动纠正word中的英语单词拼写错误☆47Updated 6 years ago
- 各大中文分词性能评测☆157Updated 6 years ago
- 2018年机器阅读理解技术竞赛总结,国内外1000多支队伍中BLEU-4评 分排名第6, ROUGE-L评分排名第14。(未ensemble,未嵌入训练好的词向量,无dropout)☆30Updated 6 years ago
- 汉字字符特征提取工具,可以提取出字符中的字音(声母、韵母、声调)、字形(偏旁、部首)、四角编码等特征,同时可作为tensor输入到模型☆137Updated 4 years ago
- 依存关系分析,NLP,自然语言处理☆85Updated 3 years ago
- 中文谐音词/字库(同音词/字)Chinese Homophones☆103Updated 5 years ago
- 常用的中文停用词表☆75Updated 7 years ago
- Self complemented Pinyin2Chinese demo use algorithms including Trie and HMM model , 基于隐马尔科夫模型与Trie树的拼音切分与拼音转中文的简单demo实现。☆86Updated 7 years ago
- 中文 NLP 语料库数据集☆20Updated 6 years ago
- ☆79Updated 8 years ago
- SmoothNLP领域词汇示例 - 基于复旦公开新闻资讯库☆49Updated 5 years ago
- A Chinese word segment model based on BERT, F1-Score 97%☆92Updated 5 years ago
- 中文单词自动纠错☆121Updated 4 years ago
- 新词发现算法(NewWordDetection)☆62Updated 7 years ago
- 基于mlm方式的带有纠错功能的拼音转汉字bert预训练模型,pinyin correcter,基于pytorch框架实现☆45Updated 4 years ago
- 基于 TensorFlow & PaddlePaddle 的通用序列标注算法库(目前包含 BiLSTM+CRF, Stacked-BiLSTM+CRF 和 IDCNN+CRF,更多算法正在持续添加中)实现中文分词(Tokenizer / segmentation)、词性标注…☆84Updated 2 years ago
- lasertagger-chinese;lasertagger中文学习案例,案例数据,注释,shell运行☆75Updated 2 years ago
- Chinese new word discovery☆42Updated 8 months ago
- SMP2017中文人机对话评测数据☆107Updated 7 years ago
- A Chinese Cloze-style RC Dataset: People's Daily & Children's Fairy Tale (CFT)☆170Updated 6 years ago
- BiLSTM-ELMo-CNN-CRF for CoNLL 2003Updated 5 years ago
- 使用python实现了一个简单的trie树结构,可增加/查找/删除关键词,用于中文文本的关键词匹配、停用词删除等。☆64Updated 5 years ago