Picovoice/tts-latency-benchmark

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Picovoice/tts-latency-benchmark)

Picovoice / tts-latency-benchmark

Text-to-Speech Latency Benchmark

☆22

Alternatives and similar repositories for tts-latency-benchmark

Users that are interested in tts-latency-benchmark are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

fyvo / WMT-Biomed-Test
View on GitHub
☆13Aug 23, 2024Updated last year
jhuang448 / MultilingualALT
View on GitHub
Repo of the paper "Towards Building an End-to-End Multilingual Automatic Lyrics Transcription Model""
☆15Jun 28, 2024Updated last year
ArenAcikgoz / Whisper-Alignment
View on GitHub
Forced alignment decoder for Whisper.
☆15Mar 13, 2024Updated 2 years ago
NTIA / alignnet
View on GitHub
Train no-reference speech quality estimators with multiple datasets via learned, per-dataset alignments.
☆18Aug 1, 2025Updated 7 months ago
DaiYvhang / AISHELL-5
View on GitHub
In-car multi-channel speech transcription system of AISHELL-5.
☆42Jun 9, 2025Updated 9 months ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
microsoft / Distill-MOS
View on GitHub
Distillation of Self-Supervised Representation-Based Speech Quality Assessment
☆44May 15, 2025Updated 10 months ago
lars76 / fastspeech2-clean
View on GitHub
Clean and modernized implementation of FastSpeech2/LightSpeech using IPA
☆18Aug 16, 2024Updated last year
FunAudioLLM / CV3-Eval
View on GitHub
☆177Aug 25, 2025Updated 7 months ago
kyutai-labs / tts_longeval
View on GitHub
☆30Jan 22, 2026Updated 2 months ago
ogunlao / glowtts_stdp
View on GitHub
Glow-TTS with Stochastic Duration Predictor and Stochastic Pitch Predictor
☆19Jun 5, 2023Updated 2 years ago
reppy4620 / convnext_tts
View on GitHub
Unofficial implementation of ConvNeXt-TTS powered by lightning
☆18Oct 20, 2024Updated last year
duerig / StyleTTS2
View on GitHub
StyleTTS 2 Optimized Training Fork
☆33Feb 2, 2025Updated last year
MuSAELab / AUDDT
View on GitHub
A toolkit for benchmarking on a wide variety of audio deepfake datasets.
☆29Oct 9, 2025Updated 5 months ago
xiquan-li / MeanAudio
View on GitHub
MeanAudio: Fast and Faithful Text-to-Audio Generation with Mean Flows
☆130Sep 2, 2025Updated 6 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
facebookresearch / llama-hd-dataset
View on GitHub
This is a balanced dataset for English homograph disambiguation (HD), generated with Meta's Llama 2-Chat 70B model.
☆22Jan 22, 2024Updated 2 years ago
lourson1091 / audiobertscore
View on GitHub
☆15Nov 10, 2025Updated 4 months ago
Stylish-TTS / stylish-tts
View on GitHub
High quality text-to-speech based on StyleTTS 2.
☆75Feb 25, 2026Updated last month
Chengyuann / AutoStyle-TTS
View on GitHub
Official PyTorch implementation of (ICME2025 oral) "AutoStyle-TTS: Retrieval-Augmented Generation based Automatic Style Matching Text-to-…
☆24Feb 1, 2026Updated last month
codebyzeb / g2p-plus
View on GitHub
Grapheme-to-phoneme tool for corpus conversion, where phonemes match Phoible inventories
☆20Apr 10, 2025Updated 11 months ago
fakerybakery / OpenF5-TTS
View on GitHub
(WIP) A retrain of F5-TTS on permissively-licensed data
☆13Apr 6, 2025Updated 11 months ago
mt-upc / ZeroSwot
View on GitHub
Pushing the Limits of Zero-shot End-to-End Speech Translation
☆26Dec 12, 2024Updated last year
X-LANCE / LSCodec-Inference
View on GitHub
Inference code for Interspeech 2025 paper, "LSCodec: Low-Bitrate and Speaker-Decoupled Discrete Speech Codec"
☆35Oct 23, 2025Updated 5 months ago
ozspeech / OZSpeech
View on GitHub
[ACL 2025] OZSpeech: One-step Zero-shot Speech Synthesis with Learned-Prior-Conditioned Flow Matching
☆45Feb 9, 2025Updated last year
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
p1an-lin-jung / wv_tts
View on GitHub
☆19Mar 22, 2024Updated 2 years ago
slp-rl / WhiStress
View on GitHub
The official repo of "WhiStress: Enriching Transcriptions with Sentence Stress Detection" (Interspeech 2025)
☆37Jul 24, 2025Updated 8 months ago
huangruizhe / audio
View on GitHub
Data manipulation and transformation for audio signal processing, powered by PyTorch
☆10Sep 30, 2024Updated last year
lucadellalib / discrete-wavlm-codec
View on GitHub
A neural speech codec based on discrete WavLM representations
☆26Aug 28, 2024Updated last year
LudovicTuncay / Audio-JEPA
View on GitHub
Audio-JEPA is an adaptation of the Joint-Embedding Predictive Architecture (JEPA) for self-supervised audio representation learning. Buil…
☆44Mar 19, 2026Updated last week
k2-fsa / multi_quantization
View on GitHub
☆46Nov 2, 2023Updated 2 years ago
poleval / 2021-punctuation-restoration
View on GitHub
PolEval 2021 Task 1
☆15Jun 28, 2022Updated 3 years ago
yoongi43 / VRVQ
View on GitHub
Implementation of the paper "Variable Bitrate Residual Vector Quantization for Audio Coding"
☆11Apr 10, 2025Updated 11 months ago
audiodemo / voice-conversion
View on GitHub
Vocoder-Free Non-Parallel Conversion of Whispered Speech With Masked Cycle-Consistent Generative Adversarial Networks
☆17Aug 18, 2023Updated 2 years ago
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
asappresearch / simple-tts
View on GitHub
Contains the code associated with the ICLR submission for our text-to-speech diffusion model
☆57Oct 31, 2023Updated 2 years ago
soham97 / mellow
View on GitHub
small audio language model for reasoning
☆85Dec 4, 2025Updated 3 months ago
xuchennlp / S2T
View on GitHub
The project for speech translation
☆12Sep 28, 2023Updated 2 years ago
google-deepmind / librispeech-long
View on GitHub
LibriSpeech-Long is a benchmark dataset for long-form speech generation and processing. Released as part of "Long-Form Speech Generation …
☆93Dec 28, 2024Updated last year
llm-lab-org / CLASP
View on GitHub
CLASP: Contrastive Language-Speech Pretraining for Multilingual Multimodal Information Retrieval
☆13Jun 27, 2025Updated 9 months ago
Audio-WestlakeU / Rec-RIR
View on GitHub
Official PyTorch implementation of 'Rec-RIR: Monaural Blind Room Impulse Response Identification via DNN-based Reverberant Speech Reconst…
☆32Dec 25, 2025Updated 3 months ago
Audio-Foundation-Models / ConversationTTS
View on GitHub
☆100Jan 19, 2026Updated 2 months ago