1-800-BAD-CODE / punctuatorsLinks

Package for inference for punctuation, true-casing, and sentence boundary detection

☆25

Alternatives and similar repositories for punctuators

Users that are interested in punctuators are comparing it to the libraries listed below

Sorting:

shivammehta25 / OverFlow
Putting flows on top of neural transducers for better TTS
☆62Updated 3 weeks ago
jumon / whisper-punctuator
Zero-shot multimodal punctuation insertion and truecasing using Whisper
☆116Updated 2 years ago
voidful / asr-trainer
one script for xls-r/xlsr/whisper fine-tuning
☆42Updated 2 years ago
NeuralVox / OpenPhonemizer
An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GP…
☆100Updated 9 months ago
MiniXC / LightningFastSpeech2
☆56Updated 2 years ago
feldberlin / timething
Timething is a library for aligning text transcripts with their audio recordings.
☆122Updated 7 months ago
Yaoming95 / UniPunc
The case study and multilingfual performance of ICASSP submission
☆24Updated 2 years ago
tomaarsen / TTSTextNormalization
Convert English text from written expressions into spoken forms
☆25Updated 3 years ago
pyannote / pyannote-database
Reproducible experimental protocols for multimedia (audio, video, text) database
☆104Updated 5 months ago
espnet / espnet_onnx
Onnx wrapper for espnet infrernce model
☆165Updated 9 months ago
laboroai / TEDxJP-10K
☆20Updated 4 years ago
JaesungHuh / SimpleDiarization
Simple diarization model
☆50Updated last month
Picovoice / voice-activity-benchmark
Voice activity engine benchmark framework
☆17Updated 2 months ago
roedoejet / g2p
Grapheme-to-Phoneme transductions that preserve input and output indices, and support cross-lingual g2p!
☆170Updated last month
ryanrudes / YTTTS
The YouTube Text-To-Speech dataset is comprised of waveform audio extracted from YouTube videos alongside their English transcriptions
☆51Updated 4 years ago
p0p4k / vits3_pytorch
☆29Updated last year
fengredrum / finetune-whisper-lora
Fine-Tune Whisper with Transformers and PEFT
☆57Updated last year
oliverguhr / fullstop-deep-punctuation-prediction
A model that predicts the punctuation of English, Italian, French and German texts.
☆80Updated 2 years ago
xincanfeng / vitsGPT
☆57Updated last year
amazon-science / proteno
This repository contains data used in the NAACL 2021 Paper - Proteno: Text Normalization with Limited Data for Fast Deployment in Text to…
☆45Updated 4 years ago
lukerbs / forcealign
ForceAlign is a Python library for forced alignment of English text to English audio. You can use ForceAlign to get word or phoneme level…
☆17Updated 7 months ago
skysbird / g2p-zh-en
Chinese and English Bilinguish G2P
☆21Updated 2 years ago
k2-fsa / text_search
Some fast-ish algorithms for batch text search in moderate-sized collections, intended for data cleanup
☆71Updated 2 weeks ago
resemble-ai / monotonic_align
Monotonic Alignment Search
☆96Updated last month
openaudiolab / LLaST
LLaST: Improved End-to-end Speech Translation System Leveraged by Large Language Models
☆25Updated 11 months ago
egorsmkv / asr-corpus-creator
This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.
☆27Updated last year
xinjli / asr2k
asr2k
☆51Updated last year
naver / multilingual-distilwhisper
This repository contains all the code necessary for running the multilingual distilwhisper from Ferraz et al. 2024 IEEE ICASSP paper.
☆27Updated last year
Takaaki-Saeki / zm-text-tts
[IJCAI'23] Learning to Speak from Text for Low-Resource TTS
☆63Updated 2 years ago
Nathan-Roll1 / PSST
Prosodic Speech Segmentation with Transformers
☆25Updated last year