System-T / DimSimLinks
☆125Updated 4 years ago
Alternatives and similar repositories for DimSim
Users that are interested in DimSim are comparing it to the libraries listed below
Sorting:
- Use bert to predict punctuation on IWSLT2012 and The People's Daily 2014☆66Updated 5 years ago
- python | 高效使用统计语言模型kenlm:新词发现、分词、智能纠错等☆164Updated 5 years ago
- 基于Pytorch 1.0 实现的中文断句与标点符号恢复。☆58Updated 6 years ago
- SOTA punctation restoration (for e.g. automatic speech recognition) deep learning model based on BERT pre-trained model☆181Updated 6 years ago
- A Bert-CNN-LSTM model for punctuation restoration☆58Updated 2 years ago
- 拼音转汉字, convert pinyin to 汉字 using deep networks☆22Updated 4 years ago
- SpellGCN☆252Updated 4 years ago
- A python module that convert chinese written string to read string. 一个python包:将中文书面字符串转换为口语字符串。☆122Updated 5 years ago
- kenlm语言模型,并提供python的rest服务☆29Updated 6 years ago
- 中文谐音词/字库(同音词/字)Chinese Homophones☆108Updated 5 years ago
- A PyTorch implementation of a punctuation prediction system using (B)LSTM, which automatically adds suitable punctuation into text withou…☆61Updated 5 years ago
- Chinese text normalization. 中文文本规范化。☆55Updated 4 years ago
- 用于存储NLP常用模型☆145Updated 5 years ago
- ☆39Updated 4 years ago
- soft_mask_bert model for Chinese Spelling Correction in keras☆21Updated 4 years ago
- A Pytorch based LSTM Punctuation Restoration Implementation/A Simple Tutorial for Leaning Pytorch and NLP☆24Updated 4 years ago
- This repository is for the paper "A Hybrid Approach to Automatic Corpus Generation for Chinese Spelling Check"☆295Updated 5 years ago
- TestB榜第10的方案,bleu32.1☆63Updated 5 years ago
- ☆167Updated 3 years ago
- Estimate the phonetic distance between Chinese words and get similar sounding candidate words.☆37Updated 2 months ago
- g2pC: A Context-aware Grapheme-to-Phoneme Conversion module for Chinese☆241Updated 6 years ago
- ☆173Updated 2 years ago
- DistilBERT for Chinese 海量中文预训练蒸馏bert模型☆92Updated 5 years ago
- 中文单词自动纠错☆121Updated 4 years ago
- 汉字字符特征提取工具,可以提取出字符中的字音(声母、韵母、声调)、字形(偏旁、部首)、四角编码等特征,同时可作为tensor输入到模型☆137Updated 5 years ago
- 人民日报1998年1-4月中文标注语料库☆32Updated 6 years ago
- Code for ACL 2020 paper "Rigid Formats Controlled Text Generation":https://www.aclweb.org/anthology/2020.acl-main.68/☆236Updated 4 years ago
- Source code for the paper "PLOME: Pre-training with Misspelled Knowledge for Chinese Spelling Correction" in ACL2021☆237Updated 2 years ago
- Integrated Semantic and Phonetic Post-correction for Chinese Speech Recognition☆18Updated last month
- The code for our ACL2022 findings paper: CRACSpell: A Contextual Typo Robust Approach with Copy Mechanism to Improve Chinese Spelling Cor…☆75Updated 3 years ago