OpenT2S / LlamaVoiceLinks

LlamaVoice is a llama-based large voice generation model, providing inference and training ability.

☆233

Alternatives and similar repositories for LlamaVoice

Users that are interested in LlamaVoice are comparing it to the libraries listed below

Sorting:

zhenye234 / X-Codec-2.0
Codec for paper: LLaSA: Scaling Train-time and Inference-time Compute for LLaMA-based Speech Synthesis
☆279Updated last week
yangdongchao / RSTnet
Real-time Speech-Text Foundation Model Toolkit (wip)
☆237Updated 2 months ago
hlt-mt / mosel
Collection of Open Source Speech Data
☆159Updated 7 months ago
wenet-e2e / wesr
We Speech Transcript based on LLM, in 300 lines of code.
☆164Updated this week
theodorblackbird / lina-speech
Official implementation of the TTS model Lina-Speech
☆165Updated 5 months ago
huggingface / dataspeech
☆365Updated 9 months ago
alibabasglab / MossFormer2
This is the audio sample repository for speech separation model "MossFormer2".
☆133Updated 6 months ago
MatthewCYM / VoiceBench
VoiceBench: Benchmarking LLM-Based Voice Assistants
☆222Updated last week
PolyAI-LDN / pheme
☆258Updated last year
Lollipop / Qwen2-Audio
The official repo of Qwen2-Audio chat & pretrained large audio language model proposed by Alibaba Cloud.
☆35Updated 9 months ago
KdaiP / StableTTS
Next-generation TTS model using flow-matching and DiT, inspired by Stable Diffusion 3
☆411Updated 9 months ago
zhenye234 / LLaSA_inference
☆40Updated 4 months ago
ScottishFold007 / TTSAudioNormalizer
TTSAudioNormalizer is a specialized tool for TTS data production, featuring descriptive statistical analysis of audio loudness and loud…
☆99Updated 6 months ago
kyutai-labs / moshi-finetune
☆238Updated 2 months ago
e-c-k-e-r / vall-e
An unofficial PyTorch implementation of VALL-E
☆87Updated 3 weeks ago
zhenye234 / CoMoSpeech
ACM MM 2023 CoMoSpeech: One-Step Speech and Singing Voice Synthesis via Consistency Model
☆208Updated last year
baichuan-inc / Baichuan-Audio
Baichuan-Audio: A Unified Framework for End-to-End Speech Interaction
☆197Updated 3 months ago
0nutation / USLM
Unified Speech Language Model for paper "SpeechTokenizer: Unified Speech Tokenizer for Speech Large Language Models"(ICLR 2024)
☆145Updated last year
mush42 / optispeech
A lightweight end-to-end text-to-speech model
☆114Updated 3 months ago
adelacvg / ttts
Train the next generation of TTS systems.
☆165Updated 9 months ago
taresh18 / TTSizer
🎙️ Automatically transcribe audio/video into high-quality, speaker-specific Text-To-Speech datasets ✨
☆84Updated last month
WangHelin1997 / SSR-Speech
SSR-Speech: Towards Stable, Safe and Robust Zero-shot Speech Editing and Synthesis
☆135Updated 5 months ago
hrnoh24 / stream-vc
An unofficial PyTorch implementation of the StreamVC(Real-Time Low-Latency Voice Conversion)
☆124Updated 10 months ago
emo-box / EmoBox
[INTERSPEECH 2024] EmoBox: Multilingual Multi-corpus Speech Emotion Recognition Toolkit and Benchmark
☆250Updated 2 months ago
xingchensong / S3Tokenizer
Reverse Engineering of Supervised Semantic Speech Tokenizer (S3Tokenizer) proposed in CosyVoice
☆335Updated this week
ex3ndr / supervoice-voicebox
VoiceBox neural network implementation
☆109Updated 10 months ago
skirdey / voicerestore
VoiceRestore: Flow-Matching Transformers for Universal Speech Restoration
☆173Updated 2 months ago
jzq2000 / MoonCast
☆240Updated 2 months ago
yzGuu830 / efficient-speech-codec
[EMNLP 2024] ESC: Efficient Speech Coding with Cross-Scale Residual Vector Quantized Transformers
☆119Updated 3 months ago
Aria-K-Alethia / BigCodec
Official implementation of the paper "BigCodec: Pushing the Limits of Low-Bitrate Neural Speech Codec"
☆170Updated 9 months ago