charlesXu86 / char_featurizer
汉字字符特征提取工具,可以提取出字符中的字音(声母、韵母、声调)、字形(偏旁、部首)、四角编码等特征,同时可作为tensor输入到模型
☆128Updated 4 years ago
Related projects ⓘ
Alternatives and complementary repositories for char_featurizer
- Code for chinese error detection module, using n-gram and bi-lstm☆131Updated 5 years ago
- 基于bert进行中文文本纠错☆225Updated last year
- 李傲龍的博客☆81Updated 3 months ago
- 中文版unilm预训练模型☆82Updated 3 years ago
- 基于BERT的无监督分词和句法分析☆110Updated 4 years ago
- Python toolkit for Chinese Language Understanding(CLUE) Evaluation benchmark☆128Updated last year
- Self complemented Pinyin2Chinese demo use algorithms including Trie and HMM model , 基于隐马尔科夫模型与Trie树的拼音切分与拼音转中文的简单demo实现。☆84Updated 6 years ago
- NLP NER datasets video/music/book bio☆83Updated 3 years ago