1-800-BAD-CODE / punctuatorsLinks
Package for inference for punctuation, true-casing, and sentence boundary detection
☆25Updated last year
Alternatives and similar repositories for punctuators
Users that are interested in punctuators are comparing it to the libraries listed below
Sorting:
- Putting flows on top of neural transducers for better TTS☆62Updated 3 weeks ago
- Zero-shot multimodal punctuation insertion and truecasing using Whisper☆116Updated 2 years ago
- one script for xls-r/xlsr/whisper fine-tuning☆42Updated 2 years ago
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GP…☆100Updated 9 months ago
- ☆56Updated 2 years ago
- Timething is a library for aligning text transcripts with their audio recordings.☆122Updated 7 months ago
- The case study and multilingfual performance of ICASSP submission☆24Updated 2 years ago
- Convert English text from written expressions into spoken forms☆25Updated 3 years ago
- Reproducible experimental protocols for multimedia (audio, video, text) database☆104Updated 5 months ago
- Onnx wrapper for espnet infrernce model☆165Updated 9 months ago
- ☆20Updated 4 years ago
- Simple diarization model☆50Updated last month
- Voice activity engine benchmark framework☆17Updated 2 months ago
- Grapheme-to-Phoneme transductions that preserve input and output indices, and support cross-lingual g2p!☆170Updated last month
- The YouTube Text-To-Speech dataset is comprised of waveform audio extracted from YouTube videos alongside their English transcriptions☆51Updated 4 years ago
- ☆29Updated last year
- Fine-Tune Whisper with Transformers and PEFT☆57Updated last year
- A model that predicts the punctuation of English, Italian, French and German texts.☆80Updated 2 years ago
- ☆57Updated last year
- This repository contains data used in the NAACL 2021 Paper - Proteno: Text Normalization with Limited Data for Fast Deployment in Text to…☆45Updated 4 years ago
- ForceAlign is a Python library for forced alignment of English text to English audio. You can use ForceAlign to get word or phoneme level…☆17Updated 7 months ago
- Chinese and English Bilinguish G2P☆21Updated 2 years ago
- Some fast-ish algorithms for batch text search in moderate-sized collections, intended for data cleanup☆71Updated 2 weeks ago
- Monotonic Alignment Search☆96Updated last month
- LLaST: Improved End-to-end Speech Translation System Leveraged by Large Language Models☆25Updated 11 months ago
- This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.☆27Updated last year
- asr2k☆51Updated last year
- This repository contains all the code necessary for running the multilingual distilwhisper from Ferraz et al. 2024 IEEE ICASSP paper.☆27Updated last year
- [IJCAI'23] Learning to Speak from Text for Low-Resource TTS☆63Updated 2 years ago
- Prosodic Speech Segmentation with Transformers☆25Updated last year