nivibilla/local-llasa-tts

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/nivibilla/local-llasa-tts)

nivibilla / local-llasa-tts

Examples of using the llasa-tts models locally

☆178

Alternatives and similar repositories for local-llasa-tts

Users that are interested in local-llasa-tts are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

zhenye234 / LLaSA_training
View on GitHub
LLaSA: Scaling Train-time and Inference-time Compute for LLaMA-based Speech Synthesis
☆660Jan 21, 2026Updated 6 months ago
croquelois / forgeChroma
View on GitHub
Add Chroma architecture to forge
☆40Jun 24, 2025Updated last year
Zuellni / LLaSA-WebUI
View on GitHub
LLaSA WebUI using ExLlamaV2 and FastAPI.
☆28Mar 30, 2025Updated last year
zhenye234 / LLaSA_inference
View on GitHub
☆43Feb 8, 2025Updated last year
seastar105 / pflow-encodec
View on GitHub
Implementation of TTS model based on NVIDIA P-Flow TTS Paper
☆77Jul 13, 2026Updated last week
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
jisang93 / VISinger
View on GitHub
Unofficial pytorch implementation of VISinger: Variational Inference with Adversarial Learning for End-to-end Singing Voice Synthesis (IC…
☆20May 12, 2023Updated 3 years ago
Andong-Li-speech / BridgeVoC
View on GitHub
This is the repository for the work "BridgeVoC: Revitalizing Neural Vocoder from a Restoration Perspective".
☆67Nov 5, 2025Updated 8 months ago
Aria-K-Alethia / laughter-synthesis
View on GitHub
Official implementation of the paper "Laughter Synthesis using Pseudo Phonetic Tokens with a Large-scale In-the-wild Laughter Corpus" acc…
☆77Jul 16, 2023Updated 3 years ago
SonyResearch / VRVQ
View on GitHub
Variable Bitrate Residual Vector Quantization for Audio Coding
☆54May 1, 2025Updated last year
naver-ai / RapFlow-TTS
View on GitHub
☆56Jul 16, 2025Updated last year
ORI-Muchim / AudioSR-Upsampling
View on GitHub
AudioSR-Upsampling (any -> 48kHz)
☆42Feb 13, 2024Updated 2 years ago
JusperLee / Gull-Codec-Training
View on GitHub
☆12Mar 11, 2025Updated last year
redmist328 / APNet2
View on GitHub
Source code of APNet2, a vocoder
☆60Nov 23, 2023Updated 2 years ago
ramyma / a8r8
View on GitHub
A8R8 (Alternate Reality), an opinionated interface for Stable Diffusion image generation, and more.
☆121Oct 19, 2025Updated 9 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
erew123 / alltalk_tts
View on GitHub
AllTalk is based on the Coqui TTS engine, similar to the Coqui_tts extension for Text generation webUI, however supports a variety of adv…
☆2,416Jan 9, 2026Updated 6 months ago
X-LANCE / VoiceFlow-TTS
View on GitHub
[ICASSP 2024] This is the official code for "VoiceFlow: Efficient Text-to-Speech with Rectified Flow Matching"
☆376Sep 3, 2024Updated last year
isaiahbjork / orpheus-tts-local
View on GitHub
Run Orpheus 3B Locally With LM Studio
☆545Mar 20, 2025Updated last year
X-E-Speech / X-E-Speech-code
View on GitHub
X-E-Speech: Joint Training Framework of Non-Autoregressive Cross-lingual Emotional Text-to-Speech and Voice Conversion
☆112Apr 1, 2024Updated 2 years ago
ryota-komatsu / speech_resynth
View on GitHub
Speech Resynthesis and Language Modeling
☆27Jun 11, 2025Updated last year
zhenye234 / X-Codec-2.0
View on GitHub
Codec for paper: LLaSA: Scaling Train-time and Inference-time Compute for LLaMA-based Speech Synthesis
☆360Jun 25, 2026Updated 3 weeks ago
NVIDIA / audio-intelligence
View on GitHub
Elucidated Text-To-Audio (ETTA) is a SOTA text-to-audio model with a holistic understanding of the design space and trained with syntheti…
☆136Mar 3, 2026Updated 4 months ago
DrewThomasson / doc2interview
View on GitHub
This is an interface that will offline convert anything pdf document you give it into an interview between two people discussing it.
☆17Dec 8, 2024Updated last year
ETH-DISCO / discoder
View on GitHub
Official repo for DisCoder: High-Fidelity Music Vocoder using Neural Audio Codecs presented at ICASSP 2025
☆42Feb 24, 2025Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
yl4579 / DMOSpeech2
View on GitHub
☆302Jul 22, 2025Updated last year
X-LANCE / LSCodec-Inference
View on GitHub
Inference code for Interspeech 2025 paper, "LSCodec: Low-Bitrate and Speaker-Decoupled Discrete Speech Codec"
☆36Oct 23, 2025Updated 8 months ago
Ereboas / MagiCodec
View on GitHub
A single-layer, streaming codec model providing SOTA audio quality and discrete tokens designed for superior downstream modelability.
☆124Jun 4, 2025Updated last year
zengchang233 / xiaoicesing2
View on GitHub
The source code for the paper XiaoiceSing2 (interspeech2023)
☆49Jan 15, 2024Updated 2 years ago
huutuongtu / Lightvoc
View on GitHub
LIGHTVOC AN UPSAMPLING-FREE GAN VOCODER BASED ON CONFORMER AND INVERSE SHORT-TIME FOURIER TRANSFORM
☆18May 17, 2024Updated 2 years ago
kanttouchthis / ComfyUI-SDNQ
View on GitHub
SDNQ support for ComfyUI
☆20Jan 6, 2026Updated 6 months ago
Zyin055 / Keep-this-prompt-for-later
View on GitHub
Extension for Automatic1111
☆22Jun 2, 2024Updated 2 years ago
yangdongchao / RSTnet
View on GitHub
Real-time Speech-Text Foundation Model Toolkit (wip)
☆256Mar 26, 2025Updated last year
NeuralVox / StyleTTS2
View on GitHub
☆98Apr 27, 2024Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
IDEA-Emdoor-Lab / DistilCodec
View on GitHub
A Neural Audio Codec (NAC) for Universal Audio
☆46May 30, 2025Updated last year
cocktailpeanut / ideoprompt
View on GitHub
☆21Jun 10, 2026Updated last month
meaningTeam / tidy-tunes
View on GitHub
Tidy Tunes is an easy-to-use pipeline for mining high-quality audio data for speech generation models. To do so, it chains multiple open …
☆23May 19, 2026Updated 2 months ago
fakerybakery / OpenF5-TTS
View on GitHub
(WIP) A retrain of F5-TTS on permissively-licensed data
☆14Apr 6, 2025Updated last year
sn0w12 / ComfyUI-Sn0w-Scripts
View on GitHub
Collection of lora management and misc nodes for ComfyUI.
☆18Jun 27, 2026Updated 3 weeks ago
keonlee9420 / evaluate-zero-shot-tts
View on GitHub
Evaluation Protocol for Large-Scale Zero-Shot TTS Literature
☆97Mar 12, 2025Updated last year
e-c-k-e-r / vall-e
View on GitHub
An unofficial PyTorch implementation of VALL-E
☆88Aug 3, 2025Updated 11 months ago