VoiceHub: A Unified Inference Interface for TTS Models
☆67Feb 16, 2026Updated 2 weeks ago
Alternatives and similar repositories for VoiceHub
Users that are interested in VoiceHub are comparing it to the libraries listed below
Sorting:
- Bu Course LLM(Large Language Model) Fine Tune işlemlerini Türkçe klavuz olarak☆11Mar 29, 2025Updated 11 months ago
- ☆12Dec 24, 2024Updated last year
- Torchreid-Pip: Packaged version of Torchreid☆14Oct 16, 2022Updated 3 years ago
- A collection of all our phonemeizers for dataset construction and inference☆27Feb 21, 2025Updated last year
- Interactive web-based digital logic circuit designer & simulator with AI-powered features.☆21Feb 25, 2026Updated last week
- Callytics is an advanced call analytics solution that leverages speech recognition and large language models (LLMs) technologies to analy…☆78Apr 7, 2025Updated 10 months ago
- StrongSort-Pip: Packaged version of StrongSort☆10Sep 3, 2022Updated 3 years ago
- ☆16Jun 26, 2023Updated 2 years ago
- LossHub: Loss Functions Library for Image Classification and Detection☆14Oct 9, 2022Updated 3 years ago
- Mason-Alberta Phonetic Segmenter☆15Feb 24, 2026Updated last week
- Bu repo SAHI uygulamasını mantığını öğreniyoruz.☆12Mar 11, 2022Updated 3 years ago
- NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment☆16Apr 13, 2022Updated 3 years ago
- 🌼 Daisy-TTS: Simulating Wider Spectrum of Emotions via Prosody Embedding Decomposition☆14Nov 15, 2025Updated 3 months ago
- Code for ACL-IJCNLP 2021 paper "N-Best-ASR-Transformer: Enhancing SLU Performance using Multiple ASR Hypotheses."☆17Nov 30, 2021Updated 4 years ago
- a Neural Vocoder supporting Ring Attention, Conformer and NSF.☆24Aug 1, 2025Updated 7 months ago
- ☆16Apr 25, 2025Updated 10 months ago
- Fast profanity word, curse word, swear word, bad word filtering tool for English, Spanish, Chinese, Turkish and more.☆49Dec 27, 2025Updated 2 months ago
- Efficient Personalized Speech Enhancement through Self-Supervised Learning☆23Mar 12, 2023Updated 2 years ago
- Source Code for the Paper "UNIFIED KEYWORD SPOTTING AND AUDIO TAGGING ON MOBILE DEVICES WITH TRANSFORMERS"☆23Mar 6, 2023Updated 2 years ago
- TMT: Tri-Modal Translation between Speech, Image, and Text by Processing Different Modalities as Different Languages☆18May 23, 2024Updated last year
- ☆18Jul 13, 2024Updated last year
- This repository contains the training codes of the fine-tuned SpeechT5 on a Turkish dataset.☆21Sep 4, 2024Updated last year
- A toolkit dedicate for speech evaluation.☆23Sep 26, 2024Updated last year
- My hybrid TTS network that combines, VALL-E, VoiceBox, SpeechFlow, Seamless and TortoiseTTS into one☆26Aug 5, 2024Updated last year
- Explore AI Capabilities for Your .NET Projects with OpenAI's API: Unlock the power of AI in your applications☆26Sep 23, 2025Updated 5 months ago
- The repository of Typhoon2-Audio, Thai audio-language model that supports speech-in and speech-out☆34Feb 13, 2026Updated 2 weeks ago
- Automatic speech annotator processing speech with voice activaty detection, overlapping speech detection, speaker diarization and automat…☆33Jun 14, 2024Updated last year
- it's a train acoustics model code lib☆27May 20, 2020Updated 5 years ago
- Rescoring methods for end-to-end Automatic Speech Recognition☆26Sep 23, 2020Updated 5 years ago
- Syllable Segmentation and Cross-Lingual Generalization in a Visually Grounded, Self-Supervised Speech Model☆34Aug 27, 2023Updated 2 years ago
- Speech-To-Text forced-alignment Speech processing Universal PERformance Benchmark☆35May 7, 2025Updated 9 months ago
- This repository implement a novel zero-shot TTS framework, named Flamed-TTS, focusing on the efficient generation and dynamic pacing in …☆57Aug 9, 2025Updated 6 months ago
- Official repo for DisCoder: High-Fidelity Music Vocoder using Neural Audio Codecs presented at ICASSP 2025☆38Feb 24, 2025Updated last year
- [ICASSP2023] Source code, model links and open test sets for paper SeACo-Paraformer.☆39Mar 15, 2024Updated last year
- ☆36Sep 6, 2025Updated 5 months ago
- Audio samples accompanying publications related to DF-Conformer, a speech enhancement model.☆31May 22, 2025Updated 9 months ago
- Repository for speech paper reading☆33Aug 19, 2021Updated 4 years ago
- Official Pytorch implementation of "Large Language Models are Strong Audio-Visual Speech Recognition Learners" [ICASSP 2025] and "Mitigat…☆56Jan 18, 2026Updated last month
- Teknofest 2023 Türkçe Doğal Dil İşleme yarışması için gerçekleştirilen bu çalışma, Shap Analizi yöntemi kullanılarak modelin tahminlerini…☆28Mar 31, 2023Updated 2 years ago