1-800-BAD-CODE / punctuators
Package for inference for punctuation, true-casing, and sentence boundary detection
☆24Updated 7 months ago
Alternatives and similar repositories for punctuators:
Users that are interested in punctuators are comparing it to the libraries listed below
- Putting flows on top of neural transducers for better TTS☆62Updated 3 weeks ago
- ☆68Updated last year
- one script for xls-r/xlsr/whisper fine-tuning☆40Updated last year
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GP…☆90Updated 3 months ago
- Implementation of Acoustic BPE (Shen et al., 2024), extended for RVQ-based Neural Audio Codecs☆50Updated 3 months ago
- ☆63Updated last month
- Easy-to-Use Speech MOS predictors☆251Updated last year
- ☆28Updated last year
- Zero-shot multimodal punctuation insertion and truecasing using Whisper☆107Updated last year
- CML-TTS: A Multilingual Dataset for Speech Synthesis☆29Updated 5 months ago
- Convert English text from written expressions into spoken forms☆22Updated 2 years ago
- ☆52Updated 6 months ago
- This repository contains all the code necessary for running the multilingual distilwhisper from Ferraz et al. 2024 IEEE ICASSP paper.☆20Updated 10 months ago
- ☆56Updated 2 years ago
- asr2k☆48Updated 7 months ago
- Repository containing the open source code of works published at the FBK MT unit.☆42Updated last week
- ☆79Updated 7 months ago
- WarpRNNT loss ported in Numba CPU/CUDA for Pytorch☆16Updated 2 years ago
- Official Code for ParrotTTS☆46Updated 3 months ago
- ☆18Updated 4 years ago
- Collection of scripts from mHuBERT-147.☆23Updated 2 months ago
- Grapheme-to-Phoneme transductions that preserve input and output indices, and support cross-lingual g2p!☆144Updated this week
- ☆33Updated 3 years ago
- Text utilities, including beam search decoding, tokenizing, and more, built for use in Flashlight.☆66Updated 3 weeks ago
- ☆20Updated 5 months ago
- Speech-MASSIVE is a multilingual Spoken Language Understanding (SLU) dataset comprising the speech counterpart for a portion of the MASSI…☆19Updated 4 months ago
- We introduce the LLAMA1 Test Set, a comprehensive open-domain world knowledge QA dataset for evaluating question-answering systems. We pr…☆15Updated 10 months ago
- ☆84Updated 3 years ago
- ☆34Updated last week
- 56 language, 1 model Multilingual ASR☆24Updated 3 years ago