☆127Mar 12, 2021Updated 5 years ago
Alternatives and similar repositories for DimSim
Users that are interested in DimSim are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Estimate the phonetic distance between Chinese words and get similar sounding candidate words.☆38Sep 17, 2025Updated 6 months ago
- kaldi cnn-tdnnf baseline☆13Aug 31, 2021Updated 4 years ago
- E2E system with LF-MMI; word N-gram for Mandarin☆167Apr 29, 2022Updated 3 years ago
- ☆11Oct 24, 2022Updated 3 years ago
- Integrated Semantic and Phonetic Post-correction for Chinese Speech Recognition☆18Jun 4, 2025Updated 9 months ago
- mWER loss implementation in tensorflow☆31Sep 7, 2020Updated 5 years ago
- it's a train acoustics model code lib☆27May 20, 2020Updated 5 years ago
- ☆24Jun 17, 2020Updated 5 years ago
- Custom decoders for Kaldi☆13Jun 5, 2019Updated 6 years ago
- Pronunciation lexicon covering both English and Chinese languages for Automatic Speech Recognition.☆262Oct 11, 2019Updated 6 years ago
- 基于规则和相似匹配的闲聊机器人☆13Nov 8, 2017Updated 8 years ago
- An extension of thu-spmi/CAT which contains a full-fledged implementation of CTC-CRF for Tensorflow.☆12Jul 5, 2021Updated 4 years ago
- Unsupervised domain adaptation with BERT for Amazon food product reviews sentiment analysis.☆15Oct 6, 2020Updated 5 years ago
- ☆50Dec 26, 2020Updated 5 years ago
- A pytorch wrapper for LF-MMI training and parallel training in Kaldi☆73Jun 8, 2022Updated 3 years ago
- ☆24Mar 13, 2020Updated 6 years ago
- End-to-End Speech Processing Toolkit☆15Jan 20, 2025Updated last year
- pycorrector is a toolkit for text error correction. 文本纠错,实现了Kenlm,T5,MacBERT,ChatGLM3,Qwen2.5等模型应用在纠错场景,开箱即用。☆6,405Jan 12, 2026Updated 2 months ago
- Computes the MWER (minimum WER) Loss with CTC beam search. Knowledge distillation for CTC loss.☆59Sep 6, 2023Updated 2 years ago
- CTC+Beam_Search+kenlm 是用于以汉字为声学模型建模单元的解码系统☆48Jun 27, 2018Updated 7 years ago
- Chinese text normalization for speech processing☆722Mar 18, 2023Updated 3 years ago
- 3M: Multi-loss, Multi-path and Multi-level Neural Networks for speech recognition☆118Jun 22, 2022Updated 3 years ago
- [ICLR 2022] "Audio Lottery: Speech Recognition Made Ultra-Lightweight, Noise-Robust, and Transferable", by Shaojin Ding, Tianlong Chen, Z…☆32Apr 8, 2022Updated 3 years ago
- magicspeech competition recipe☆18Jun 29, 2020Updated 5 years ago
- Python implementation of CTC beam search decoder + agnostic LM scorer☆20Dec 16, 2020Updated 5 years ago
- Compute useful transcriptions metrics (CER, WER, SER, ...)☆27Nov 20, 2014Updated 11 years ago
- ☆13May 9, 2022Updated 3 years ago
- Chinese "spelling" error correction☆263Nov 28, 2017Updated 8 years ago
- Code accompanying Incorporating Chinese Characters of Words for Lexical Sememe Prediction (ACL2018) https://arxiv.org/abs/1806.06349☆24Sep 28, 2018Updated 7 years ago
- 对常用的6700个汉字进行音、形比较,输出音近字、形近字的列表。 # 相近字☆482Mar 28, 2024Updated last year
- Python library for GeneiaTagger☆10May 7, 2015Updated 10 years ago
- A 10000+ hours dataset for Chinese speech recognition☆595Jan 9, 2026Updated 2 months ago
- CommonsenseQA☆10Mar 28, 2020Updated 5 years ago
- Implements a proof-of-concept of a multi-level clustering algorithm designed to enable extremely fast approximate match search in a large…☆12Feb 24, 2013Updated 13 years ago
- Express anger to your professor with just a script.☆12Oct 25, 2021Updated 4 years ago
- 《SpeechGen: Unlocking the Generative Power of Speech Language Models with Prompts》☆77Jun 9, 2023Updated 2 years ago
- [ICASSP2021] Data preperation scripts, training pipeline and baseline experiment results for the Interspeech 2020 Accented English Speech…☆56Oct 9, 2020Updated 5 years ago
- ROUGE for multilingual Summarization☆25Oct 11, 2021Updated 4 years ago
- [ASRU 2021] Efficient Conformer: Progressive Downsampling and Grouped Attention for Automatic Speech Recognition☆219Jun 22, 2023Updated 2 years ago