Vyvo-Labs/SpeechPlus

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Vyvo-Labs/SpeechPlus)

Vyvo-Labs / SpeechPlus

SpeechPlus: Small LLM-Based Text-to-Speech Library 🚀

☆21

Alternatives and similar repositories for SpeechPlus

Users that are interested in SpeechPlus are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

EZ-VC / EZ-VC
View on GitHub
[EMNLP 2025 Findings] Official code for EZ-VC: Easy Zero-shot Any-to-Any Voice Conversion
☆41Sep 9, 2025Updated 10 months ago
llm-jp / llama-mimi
View on GitHub
Llama-Mimi is a speech language model that uses a unified tokenizer (Mimi) and a single Transformer decoder (Llama) to jointly model sequ…
☆31Sep 20, 2025Updated 9 months ago
sushant-t / tts-trainer
View on GitHub
Generate audio datasets for training Text-To-Speech models, through smart audio splitting with silence detection, and transcription using…
☆30May 27, 2023Updated 3 years ago
cnaigithub / SpeechDewarping
View on GitHub
Official implementation of "Unsupervised Pre-training for Data-Efficient Text-to-Speech on Low Resource Languages", ICASSP 2023
☆27Apr 27, 2023Updated 3 years ago
rhasspy / glow-speak
View on GitHub
Neural text to speech system that uses eSpeak as a text/phoneme front-end
☆16Oct 20, 2021Updated 4 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
shengcanxu / canoSpeech
View on GitHub
text to speech
☆10Mar 19, 2024Updated 2 years ago
dafyddg / RFA
View on GitHub
Implementation of the Rhythm Formant Analysis methodology for identifying speech rhythms and rhythm variation in the low frequency spectr…
☆17Apr 27, 2023Updated 3 years ago
msalhab96 / MultiSpeech
View on GitHub
pytorch implementation for MultiSpeech: Multi-Speaker Text to Speech with Transformer paper
☆21Jun 23, 2022Updated 4 years ago
sil-ai / tts-singlish
View on GitHub
TTS for Singlish using Tacotron2, the IMDA corpus, and Pachyderm.
☆11Jan 11, 2020Updated 6 years ago
huangruizhe / audio
View on GitHub
Data manipulation and transformation for audio signal processing, powered by PyTorch
☆11Sep 30, 2024Updated last year
jdh-algo / JoyTTS
View on GitHub
☆41Jul 15, 2025Updated 11 months ago
PanagiotisP / svs-multiband
View on GitHub
Code for the paper "MULTI-BAND MASKING FOR WAVEFORM-BASED SINGING VOICE SEPARATION" that was accepted on EUSIPCO2022
☆15Jun 18, 2022Updated 4 years ago
zelaki / DisfluentFA
View on GitHub
A Weakly Supervised Forced Alignment for disluent speech
☆15Nov 12, 2023Updated 2 years ago
Adibian / Persian-MultiSpeaker-Tacotron2
View on GitHub
Implementation of Transfer Learning from Speaker Verification to Multi-speaker Text-To-Speech Synthesis (SV2TTS) in Persian language.
☆13Oct 2, 2025Updated 9 months ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
jplhughes / emotion_detection_cpc
View on GitHub
Emotion detection in audio utilising self-supervised representations trained with Contrastive Predictive Coding (CPC).
☆43Feb 16, 2022Updated 4 years ago
jhuang448 / MultilingualALT
View on GitHub
Repo of the paper "Towards Building an End-to-End Multilingual Automatic Lyrics Transcription Model""
☆15Jun 28, 2024Updated 2 years ago
llm-lab-org / CLASP
View on GitHub
CLASP: Contrastive Language-Speech Pretraining for Multilingual Multimodal Information Retrieval
☆13Jun 27, 2025Updated last year
b-sigpro / sed-hsmm
View on GitHub
Onset-and-Offset-Aware Sound Event Detection
☆21Feb 10, 2025Updated last year
KdaiP / conformer-RoPE
View on GitHub
Conformer block with Rotary Position Embedding, modified from lucidrains' implement
☆19Sep 13, 2024Updated last year
duerig / StyleTTS2
View on GitHub
StyleTTS 2 Optimized Training Fork
☆32Feb 2, 2025Updated last year
lucasnewman / nanospeech
View on GitHub
A simple, hackable text-to-speech system in PyTorch and MLX
☆190Aug 3, 2025Updated 11 months ago
tihu-nlp / tihudict
View on GitHub
Tihu dictionary for Persian language
☆13Sep 8, 2019Updated 6 years ago
huaidanquede / Dense-TSNet
View on GitHub
offical code for Dense-TSNet
☆12Sep 17, 2024Updated last year
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
de-mh / g2p_fa
View on GitHub
A Grapheme to Phoneme model using LSTM implemented in pytorch
☆14Jul 6, 2022Updated 4 years ago
reppy4620 / convnext_tts
View on GitHub
Unofficial implementation of ConvNeXt-TTS powered by lightning
☆18Oct 20, 2024Updated last year
ArenAcikgoz / Whisper-Alignment
View on GitHub
Forced alignment decoder for Whisper.
☆16Mar 13, 2024Updated 2 years ago
auspicious3000 / ProsodyLM
View on GitHub
ProsodyLM: Uncovering the Emerging Prosody Processing Capabilities in Speech Language Models
☆46Nov 18, 2025Updated 7 months ago
tuanh123789 / Spark-TTS-finetune
View on GitHub
finetune llm part for spark-tts model
☆124Mar 25, 2025Updated last year
Stylish-TTS / stylish-tts
View on GitHub
High quality text-to-speech based on StyleTTS 2.
☆77Apr 6, 2026Updated 3 months ago
xingchensong / CosyVoice-ttsfrd
View on GitHub
☆25Jun 19, 2025Updated last year
nineninesix-ai / KaniTTS-Finetune-pipeline
View on GitHub
☆27Nov 3, 2025Updated 8 months ago
adelacvg / detail_tts
View on GitHub
All generative model in one for better TTS model
☆74Sep 8, 2024Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
mush42 / mantoq
View on GitHub
Arabic Grapheme-to-Phoneme (G2P) Conversion
☆16Mar 15, 2025Updated last year
winddori2002 / DEX-TTS
View on GitHub
DEX-TTS: Diffusion-based EXpressive TTS with Style Modeling on Time Variability
☆108Jan 17, 2025Updated last year
dubverse-ai / MahaTTS
View on GitHub
☆275Jun 8, 2024Updated 2 years ago
that-cod / awesome-grok-imagine-prompts
View on GitHub
A curated collection of prompts for Grok Imagine by xAI
☆32Jun 6, 2026Updated last month
Vyvo-Labs / VyvoTTS
View on GitHub
VyvoTTS: LLM-Based Text-to-Speech Training Framework
☆257Apr 8, 2026Updated 3 months ago
salesforce / speech-datasets
View on GitHub
Simplified recipes for preparing commonly used speech datasets, and a PyTorch-compatible Python data loader that can perform standard fea…
☆15Jun 25, 2026Updated last week
rishikksh20 / MiniMax-TTS-pytorch
View on GitHub
Try to replicate the architecture of MiniMaxTTS mentioned in it's technical report
☆47Sep 2, 2025Updated 10 months ago