explosion / spacy-pkuseg
pkuseg多领域中文分词工具; The pkuseg toolkit for multi-domain Chinese word segmentation
☆57Updated 5 months ago
Alternatives and similar repositories for spacy-pkuseg:
Users that are interested in spacy-pkuseg are comparing it to the libraries listed below
- A convenient Chinese word segmentation tool 简便中文分词器☆46Updated last month
- 中文标点符号模型,可以给文本添加标点符号。☆135Updated last month
- 各大中文分词性能评测☆155Updated 6 years ago
- Grapheme-to-Phoneme lexicons for Chinese dialects☆67Updated 2 years ago
- Pytorch model for https://github.com/imcaspar/gpt2-ml☆79Updated 3 years ago
- Hong Kong Cantonese Corpus of transcribed speech (spontaneous speech, radio programmes and a monologue).☆52Updated 11 months ago
- CINO: Pre-trained Language Models for Chinese Minority (少数民族语言预训练模型)☆228Updated last year
- 基于sentence-transformers实现文本转向量的机器人☆46Updated 2 years ago
- ☆172Updated 2 years ago
- 基于 g2pW 提升 pypinyin 的准确性☆83Updated last year
- 基于Pytorch 1.0 实现的中文断句与标点符号恢复。☆56Updated 5 years ago
- A large high-quality corpus of Chinese synonyms 一个大型、高质量的中文同义词语料库。☆42Updated 3 years ago
- 超快的中文普通话TTS☆117Updated 3 years ago
- ☆125Updated 3 years ago
- python | 高效使用统计语言模型kenlm:新词发现、分词、智能纠错等☆162Updated 5 years ago
- 渊 - A project for Classical Chinese☆97Updated 2 years ago
- 时间抽取、解析、标准化工具☆50Updated 2 years ago
- Self complemented Pinyin2Chinese demo use algorithms including Trie and HMM model , 基于隐马尔科夫模型与Trie树的拼音切分与拼音转中文的简单demo实现。☆86Updated 6 years ago
- 中文纠错☆92Updated 2 years ago
- 📦 快速转化「中文数字」和「阿拉伯数字」~ (最新特性:分数,日期、温度等转化)☆700Updated 2 months ago
- ☆51Updated 4 years ago
- 粤语分词工具☆46Updated 6 years ago
- ☆102Updated 4 years ago
- Pronunciation lexicon covering both English and Chinese languages for Automatic Speech Recognition.☆255Updated 5 years ago
- ☆75Updated last year
- 一个基于预训练的句向量生成工具☆134Updated last year
- 基于bert进行中文文本纠错☆228Updated last year
- Use bert to predict punctuation on IWSLT2012 and The People's Daily 2014☆65Updated 4 years ago
- 最好的汉字数字(中文数字)-阿拉伯数字转换工具。包含"点二八","负百分之四十"等众多汉语表达方法。NLP,机器人工程必备! The Best Tool of Chinese Number to Digits☆357Updated last year
- Text Normalization & Inverse Text Normalization☆534Updated 3 months ago