1-800-BAD-CODE / punctuatorsLinks
Package for inference for punctuation, true-casing, and sentence boundary detection
☆28Updated last year
Alternatives and similar repositories for punctuators
Users that are interested in punctuators are comparing it to the libraries listed below
Sorting:
- Putting flows on top of neural transducers for better TTS☆64Updated last week
- Onnx wrapper for espnet infrernce model☆168Updated 5 months ago
- one script for xls-r/xlsr/whisper fine-tuning☆42Updated 2 years ago
- Non Parallel Voice Conversion based on VITS☆24Updated 2 years ago
- ☆55Updated 3 years ago
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GP…☆104Updated last year
- ☆28Updated 2 years ago
- Some fast-ish algorithms for batch text search in moderate-sized collections, intended for data cleanup☆79Updated 7 months ago
- Zero-shot multimodal punctuation insertion and truecasing using Whisper☆119Updated 2 years ago
- pytorch model for contexless-phoneme prediction from speech audio☆30Updated 3 months ago
- Official code for Wav2Seq☆97Updated 3 years ago
- Fine-Tune Whisper with Transformers and PEFT☆58Updated 2 years ago
- ☆45Updated 3 years ago
- ☆64Updated last year
- Grapheme-to-Phoneme transductions that preserve input and output indices, and support cross-lingual g2p!☆186Updated this week
- Export an ONNX graph that performs ISTFT. Designed for TTS models.☆27Updated last year
- This repository contains data used in the NAACL 2021 Paper - Proteno: Text Normalization with Limited Data for Fast Deployment in Text to…☆45Updated 4 years ago
- Standalone implementation of the CUDA-accelerated WFST Decoder available in Riva☆91Updated 11 months ago
- multilingual speech aligner☆76Updated 2 years ago
- This repository contains all the code necessary for running the multilingual distilwhisper from Ferraz et al. 2024 IEEE ICASSP paper.☆33Updated 3 months ago
- asr2k☆52Updated last year
- ☆58Updated last year
- Finetuning VITS Efficiently☆33Updated 2 years ago
- ☆15Updated 2 months ago
- Explore different way to mix speech model(wav2vec2, hubert) and nlp model(BART,T5,GPT) together☆46Updated 6 months ago
- The case study and multilingfual performance of ICASSP submission☆24Updated 3 years ago
- Implementation of Acoustic BPE (Shen et al., 2024), extended for RVQ-based Neural Audio Codecs☆77Updated last month
- Timething is a library for aligning text transcripts with their audio recordings.☆128Updated last year
- Syllable Segmentation and Cross-Lingual Generalization in a Visually Grounded, Self-Supervised Speech Model☆34Updated 2 years ago
- Unofficial implementation of ConvNeXt-TTS powered by lightning☆18Updated last year