Model for recasing and repunctuating ASR transcripts
β143Apr 10, 2024Updated last year
Alternatives and similar repositories for recasepunc
Users that are interested in recasepunc are comparing it to the libraries listed below
Sorting:
- β17Apr 14, 2023Updated 2 years ago
- π·πΊ Punctuation restoration production-ready model for Russian language π·πΊβ59Jul 9, 2021Updated 4 years ago
- β13Dec 7, 2022Updated 3 years ago
- β57Apr 18, 2023Updated 2 years ago
- Support tools for punctuation and boundary detection for ASR output.β55Dec 8, 2022Updated 3 years ago
- Evaluation of STT models for german languageβ15Jan 22, 2022Updated 4 years ago
- β13Oct 27, 2021Updated 4 years ago
- β37Nov 22, 2025Updated 3 months ago
- β56Dec 19, 2022Updated 3 years ago
- Incorporating KenLM language model with HuggingFace implementation of Wav2Vec2CTC Model using beam search decodingβ75Oct 11, 2021Updated 4 years ago
- Punctuation Restoration using Transformer Models for High-and Low-Resource Languagesβ227Jul 29, 2024Updated last year
- β14Jun 12, 2015Updated 10 years ago
- Code release for "TinySpeech: Attention Condensers for Deep Speech Recognition Neural Networks on Edge Devices"β21Jun 7, 2025Updated 8 months ago
- This repository describes our reproducible framework for assessing self-supervised representation learning from speechβ51Oct 8, 2021Updated 4 years ago
- This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.β27Feb 15, 2024Updated 2 years ago
- [ICLR 2022] "Audio Lottery: Speech Recognition Made Ultra-Lightweight, Noise-Robust, and Transferable", by Shaojin Ding, Tianlong Chen, Zβ¦β32Apr 8, 2022Updated 3 years ago
- Open source cross-platform implementation of MRCP protocolβ20Mar 3, 2022Updated 4 years ago
- One-shot TTS with Improved Unseen Speaker and Style Transferβ37Mar 2, 2022Updated 4 years ago
- β20Jul 22, 2022Updated 3 years ago
- A fast and lightweight python-based CTC beam search decoder for speech recognition.β469Jul 13, 2023Updated 2 years ago
- Artie Bias Corpus: an audio corpus + code for detecting demographic biasβ20Jul 21, 2020Updated 5 years ago
- Making Espnet easier to useβ54Apr 9, 2021Updated 4 years ago
- This repository contains data used in the NAACL 2021 Paper - Proteno: Text Normalization with Limited Data for Fast Deployment in Text toβ¦β45May 25, 2021Updated 4 years ago
- A torch implementation of a recursion which turns out to be useful for RNN-T.β150Aug 25, 2023Updated 2 years ago
- A GPU language model, based on btree backed tries.β29Mar 6, 2018Updated 8 years ago
- Kaldi style neural network training in pytorch for use in place of nnet3 in Kaldi.β26Jul 25, 2024Updated last year
- Segment a given audio into utterances using a trained end-to-end ASR model.β74Oct 9, 2020Updated 5 years ago
- wake word spotting with kaldiβ19Dec 3, 2020Updated 5 years ago
- β16Jun 13, 2022Updated 3 years ago
- Server framework for Kaldi ASR Toolkitβ98Sep 17, 2023Updated 2 years ago
- BurrMill coreβ22Nov 2, 2021Updated 4 years ago
- A Benchmark Corpus for Low-Resource Cantonese Punctuation Restoration from Speech Transcriptsβ16Dec 3, 2024Updated last year
- This is an extension of kaldi speech recognition software which allows to perform decoding of speech with hybrid word and phoneme graphs.β¦β11Feb 4, 2020Updated 6 years ago
- Gstreamer plugin for VOSK voice recognition engineβ14Oct 2, 2022Updated 3 years ago
- Segment an audio file and obtain utterance alignments. (Python package)β345May 15, 2024Updated last year
- Tool to make high quality text to speech (tts) corpus from audio + text books.β28Jul 31, 2025Updated 7 months ago
- End-to-end spoken language identification out of the box.β48Dec 13, 2020Updated 5 years ago
- My hybrid TTS network that combines, VALL-E, VoiceBox, SpeechFlow, Seamless and TortoiseTTS into oneβ26Aug 5, 2024Updated last year
- This repository contains all the code necessary for running the multilingual distilwhisper from Ferraz et al. 2024 IEEE ICASSP paper.β33Oct 23, 2025Updated 4 months ago