1-800-BAD-CODE / punctuators
Package for inference for punctuation, true-casing, and sentence boundary detection
☆23Updated 5 months ago
Related projects ⓘ
Alternatives and complementary repositories for punctuators
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GP…☆84Updated last month
- one script for xls-r/xlsr/whisper fine-tuning☆39Updated last year
- Text utilities, including beam search decoding, tokenizing, and more, built for use in Flashlight.☆65Updated last week
- Putting flows on top of neural transducers for better TTS☆63Updated 3 weeks ago
- ☆28Updated last year
- Zero-shot multimodal punctuation insertion and truecasing using Whisper☆100Updated last year
- LLaST: Improved End-to-end Speech Translation System Leveraged by Large Language Models☆22Updated 3 months ago
- C++ version of pyannote audio overlapped speech detection pipeline☆9Updated 9 months ago
- ☆66Updated last year
- Simple Diarization model☆42Updated 11 months ago
- This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.☆27Updated 9 months ago
- ☆54Updated this week
- ☆33Updated 3 years ago
- 56 language, 1 model Multilingual ASR☆24Updated 3 years ago
- ☆56Updated last year
- This repository contains data used in the NAACL 2021 Paper - Proteno: Text Normalization with Limited Data for Fast Deployment in Text to…☆42Updated 3 years ago
- Speech-MASSIVE is a multilingual Spoken Language Understanding (SLU) dataset comprising the speech counterpart for a portion of the MASSI…☆19Updated 2 months ago
- Finetuning VITS Efficiently☆32Updated last year
- Convert English text from written expressions into spoken forms☆21Updated 2 years ago
- ☆77Updated 6 months ago
- A toolkit for Spoken Language Understanding Evaluation (SLUE) benchmark. Refer paper https://arxiv.org/abs/2111.10367 for more details. O…☆63Updated 8 months ago
- Fine-Tune Whisper with Transformers and PEFT☆38Updated last year
- CML-TTS: A Multilingual Dataset for Speech Synthesis☆29Updated 3 months ago
- Collection of scripts from mHuBERT-147.☆22Updated this week
- Official Code for ParrotTTS☆43Updated last month
- A bidirectional recurrent neural network model with attention mechanism for restoring missing punctuation in unsegmented text☆36Updated 4 years ago
- Implementation of Acoustic BPE (Shen et al., 2024), extended for RVQ-based Neural Audio Codecs☆47Updated last month
- PromptTTS++: Controlling Speaker Identity in Prompt-Based Text-To-Speech Using Natural Language Descriptions☆61Updated last month
- ☆22Updated 3 years ago
- ☆40Updated 2 years ago