1-800-BAD-CODE / punctuators
Package for inference for punctuation, true-casing, and sentence boundary detection
☆24Updated 8 months ago
Alternatives and similar repositories for punctuators:
Users that are interested in punctuators are comparing it to the libraries listed below
- Zero-shot multimodal punctuation insertion and truecasing using Whisper☆107Updated 2 years ago
- one script for xls-r/xlsr/whisper fine-tuning☆40Updated last year
- This repository contains data used in the NAACL 2021 Paper - Proteno: Text Normalization with Limited Data for Fast Deployment in Text to…☆43Updated 3 years ago
- ☆80Updated 8 months ago
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GP…☆93Updated 4 months ago
- Simple Diarization model☆47Updated last year
- ☆56Updated 2 years ago
- Explore different way to mix speech model(wav2vec2, hubert) and nlp model(BART,T5,GPT) together☆44Updated last year
- This repository contains all the code necessary for running the multilingual distilwhisper from Ferraz et al. 2024 IEEE ICASSP paper.☆20Updated 11 months ago
- Speech-MASSIVE is a multilingual Spoken Language Understanding (SLU) dataset comprising the speech counterpart for a portion of the MASSI…☆21Updated 5 months ago
- ☆71Updated last year
- CML-TTS: A Multilingual Dataset for Speech Synthesis☆29Updated 6 months ago
- SpeechGLUE is a speech version of the GLUE benchmark, driven by text-to-speech.☆13Updated last year
- Collection of scripts from mHuBERT-147.☆24Updated 3 months ago
- This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.☆27Updated last year
- Prosodic Speech Segmentation with Transformers☆25Updated 11 months ago
- asr2k☆49Updated 8 months ago
- Convert English text from written expressions into spoken forms☆24Updated 2 years ago
- ☆34Updated 3 years ago
- Forced alignment decoder for Whisper.☆14Updated 11 months ago
- Putting flows on top of neural transducers for better TTS☆62Updated this week
- Implementation of Acoustic BPE (Shen et al., 2024), extended for RVQ-based Neural Audio Codecs☆50Updated 4 months ago
- ☆18Updated 4 years ago
- Text utilities, including beam search decoding, tokenizing, and more, built for use in Flashlight.☆67Updated 2 months ago
- ☆56Updated 2 years ago
- The case study and multilingfual performance of ICASSP submission☆20Updated 2 years ago
- ☆19Updated last year
- Grapheme-to-Phoneme transductions that preserve input and output indices, and support cross-lingual g2p!☆147Updated last month
- Official code for Wav2Seq☆96Updated 2 years ago
- We introduce the LLAMA1 Test Set, a comprehensive open-domain world knowledge QA dataset for evaluating question-answering systems. We pr…☆16Updated 11 months ago