1-800-BAD-CODE / punctuatorsLinks
Package for inference for punctuation, true-casing, and sentence boundary detection
☆25Updated 11 months ago
Alternatives and similar repositories for punctuators
Users that are interested in punctuators are comparing it to the libraries listed below
Sorting:
- Putting flows on top of neural transducers for better TTS☆62Updated last week
- ☆56Updated 2 years ago
- Collection of scripts from mHuBERT-147.☆25Updated 6 months ago
- Zero-shot multimodal punctuation insertion and truecasing using Whisper☆114Updated 2 years ago
- This repository contains data used in the NAACL 2021 Paper - Proteno: Text Normalization with Limited Data for Fast Deployment in Text to…☆45Updated 4 years ago
- Simple diarization model☆49Updated last year
- The case study and multilingfual performance of ICASSP submission☆24Updated 2 years ago
- ☆80Updated last year
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GP…☆98Updated 7 months ago
- This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.☆27Updated last year
- Implementation of Acoustic BPE (Shen et al., 2024), extended for RVQ-based Neural Audio Codecs☆62Updated last month
- Some fast-ish algorithms for batch text search in moderate-sized collections, intended for data cleanup☆71Updated 9 months ago
- ☆22Updated 3 years ago
- ☆103Updated last week
- ☆56Updated 2 years ago
- This repository contains all the code necessary for running the multilingual distilwhisper from Ferraz et al. 2024 IEEE ICASSP paper.☆24Updated last year
- Onnx wrapper for espnet infrernce model☆162Updated 7 months ago
- asr2k☆50Updated last year
- ☆36Updated last month
- ForceAlign is a Python library for forced alignment of English text to English audio. You can use ForceAlign to get word or phoneme level…☆15Updated 6 months ago
- Convert English text from written expressions into spoken forms☆25Updated 2 years ago
- Official Code for ParrotTTS☆51Updated 7 months ago
- CML-TTS: A Multilingual Dataset for Speech Synthesis☆31Updated 10 months ago
- ☆19Updated last year
- SpeechGLUE is a speech version of the GLUE benchmark, driven by text-to-speech.☆13Updated 2 years ago
- StyleTTS 2 Optimized Training Fork☆29Updated 4 months ago
- PnG BERT: Augmented BERT on Phonemes and Graphemes for Neural TTS☆23Updated 3 years ago
- 56 language, 1 model Multilingual ASR☆25Updated 3 years ago
- A fast parallel implementation of RNN Transducer.☆12Updated last month
- ☆26Updated 4 months ago