A python package for deep multilingual punctuation prediction.
☆157Aug 21, 2024Updated last year
Alternatives and similar repositories for deepmultilingualpunctuation
Users that are interested in deepmultilingualpunctuation are comparing it to the libraries listed below
Sorting:
- A model that predicts the punctuation of English, Italian, French and German texts.☆84Feb 22, 2023Updated 3 years ago
- The case study and multilingfual performance of ICASSP submission☆24Sep 24, 2022Updated 3 years ago
- 📝An easy-to-use package to restore punctuation of the text.☆119Apr 5, 2023Updated 2 years ago
- Punctuation Restoration using Transformer Models for High-and Low-Resource Languages☆228Jul 29, 2024Updated last year
- ☆37Nov 18, 2025Updated 4 months ago
- C++ version of pyannote audio overlapped speech detection pipeline☆13Feb 14, 2024Updated 2 years ago
- 🌼 Daisy-TTS: Simulating Wider Spectrum of Emotions via Prosody Embedding Decomposition☆14Nov 15, 2025Updated 4 months ago
- ☆13Dec 7, 2022Updated 3 years ago
- Calculates the word error rate of two strings, and the result is written into beautify HTML.☆19Mar 19, 2020Updated 6 years ago
- Thai smart home corpus with "Gowajee" hotword☆18Jul 30, 2023Updated 2 years ago
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GP…☆108Updated this week
- ☆32Jul 27, 2022Updated 3 years ago
- [APSIPA'22] Exploring Speaker Age Estimation on Different Self-Supervised Learning Models☆14Oct 19, 2022Updated 3 years ago
- ☆15Aug 4, 2025Updated 7 months ago
- 5Hz Deep-Compression Speech VAE for AR-Diffusion and CALMs☆57Nov 19, 2025Updated 4 months ago
- Code release for "TinySpeech: Attention Condensers for Deep Speech Recognition Neural Networks on Edge Devices"☆21Jun 7, 2025Updated 9 months ago
- This is a legacy repo. Dev occurs now on GitHub.☆11Mar 28, 2021Updated 4 years ago
- IPA Phonetic dataset lexicon☆18Mar 14, 2026Updated last week
- ☆14Aug 19, 2024Updated last year
- This repository is for the paper Incorporating External POS Tagger for Punctuation Restoration. Proc. Interspeech 2021, 1987-1991, doi: 1…☆11Jul 5, 2023Updated 2 years ago
- Verbatim Automatic Speech Recognition with improved word-level timestamps and filler detection☆925Jun 3, 2025Updated 9 months ago
- StyleTTS2 + Vocos as a Decoder☆13Mar 24, 2025Updated 11 months ago
- ☆45Dec 15, 2022Updated 3 years ago
- Open-Ended Speaking Style Modeling via Fine-Grained and Multi-Granular Contrastive Language-Speech Pre-training☆70Feb 7, 2026Updated last month
- Supervoice diffusion enhance☆28Jul 15, 2024Updated last year
- Zero-shot multimodal punctuation insertion and truecasing using Whisper☆119Feb 4, 2023Updated 3 years ago
- This is the M-AILABS Speech Dataset☆107Jan 8, 2026Updated 2 months ago
- (WIP) A retrain of F5-TTS on permissively-licensed data☆13Apr 6, 2025Updated 11 months ago
- Context-Sensitive Neural Spelling Checker☆20Sep 25, 2024Updated last year
- ☆32Dec 4, 2022Updated 3 years ago
- Experimental code: sound file preprocessing to optimize Whisper transcriptions without hallucinated texts☆348Nov 12, 2024Updated last year
- Joint CTC-S2S Phoneme-level ASR for Voice Conversion and TTS (Text-Mel Alignment)☆125Jun 16, 2022Updated 3 years ago
- Avalinguo Audio Dataset: Dataset for Speaker Fluency Level Classification☆13Aug 13, 2018Updated 7 years ago
- Differentiable Mean Opinion Score Regularization for Perceptual Speech Enhancement☆25Apr 16, 2023Updated 2 years ago
- Project for HIDING SPEAKER’S SEX IN SPEECH USING ZERO-EVIDENCE SPEAKER REPRESENTATION IN AN ANALYSIS/SYNTHESIS PIPELINE☆15Nov 30, 2022Updated 3 years ago
- Sequence to sequence model for Arabic punctuation prediction.☆12Feb 13, 2020Updated 6 years ago
- a Frontier Japanese Speech Generation net☆62May 15, 2025Updated 10 months ago
- Transcribe desktop audio/computer audio in real-time and locally (Streaming ASR), using TorchAudio and Emformer-RNNT model for inference,…☆14May 7, 2024Updated last year
- ☆24Jan 14, 2021Updated 5 years ago