pengzhendong / pyrnnoise
Python Wrapper for RnNoise v0.2
☆17Updated 2 months ago
Related projects: ⓘ
- kaldi cnn-tdnnf baseline☆13Updated 3 years ago
- AutoPrep: An Automatic Preprocessing Framework for In-the-Wild Speech Data☆28Updated 8 months ago
- [ASRU 2023] Code of paper SALT: Distinguishable Speaker Anonymization Through Latent Space Transformation☆16Updated last month
- [ICASSP2023] Source code, model links and open test sets for paper SeACo-Paraformer.☆24Updated 6 months ago
- Python Wrapper of Silero VAD☆38Updated 2 months ago
- Inference code for Audiodec-Valle-Wenetspeech4TTS☆43Updated 2 months ago
- Speech samples and code of BEdit-TTS☆32Updated 11 months ago
- ConMamba for Automatic Speech Recognition☆38Updated last month
- Computes the MWER (minimum WER) Loss with beam search and negative sampling strategy.☆17Updated last year
- 基于单语种语料的中英混合语音识别算法-同花顺算法挑战赛-2021年9-10月双月赛☆14Updated 3 years ago
- Source Code for the Paper "UNIFIED KEYWORD SPOTTING AND AUDIO TAGGING ON MOBILE DEVICES WITH TRANSFORMERS"☆23Updated last year
- An unofficial PyTorch implementation of Mix-Phoneme-Bert☆39Updated last year
- ☆27Updated 5 months ago
- wav2vec2 audio classification for prosodic boundary detection and other tasks☆31Updated last year
- ☆13Updated this week
- ☆22Updated 2 months ago
- Open Source Speech/Text Data on AI☆18Updated 2 years ago
- The official repository of SpeechCraft dataset, a large-scale expressive bilingual speech dataset with natural language descriptions.☆25Updated 3 weeks ago
- Implementation of CTC alignment-based single step non-autoregressive transformer☆11Updated last year
- ☆35Updated 7 months ago
- ☆18Updated this week
- it's ASR decoder and make graph project☆32Updated 2 years ago
- SD-Eval: A Benchmark Dataset for Spoken Dialogue Understanding Beyond Words☆34Updated 2 months ago
- Multi-Task Speech classification of accent and gender of an english speaker on Mozilla's common voice dataset☆23Updated 2 weeks ago
- ☆13Updated 2 years ago
- The implementation of g2pL with a new open dataset.☆15Updated last year
- Reverse Engineering of Supervised Semantic Speech Tokenizer (S3Tokenizer) proposed in CosyVoice☆58Updated last week
- ☆28Updated this week
- Python wrapper for OpenFST and its extensions from Kaldi. Also support reading/writing ark/scp files☆47Updated 2 months ago
- Went online decode demo☆30Updated 3 years ago