LqNoob / Neural-Codec-and-Speech-Language-ModelsLinks
Awesome Neural Codec Models, Text-to-Speech Synthesizers & Speech Language Models
☆195Updated this week
Alternatives and similar repositories for Neural-Codec-and-Speech-Language-Models
Users that are interested in Neural-Codec-and-Speech-Language-Models are comparing it to the libraries listed below
Sorting:
- FACodec: Speech Codec with Attribute Factorization used for NaturalSpeech 3☆220Updated last year
- UTokyo-SaruLab MOS Prediction System☆257Updated last month
- Official implementation of the paper "BigCodec: Pushing the Limits of Low-Bitrate Neural Speech Codec"☆200Updated last year
- Audio-FLAN☆160Updated last month
- Training code for FAcodec presented in NaturalSpeech3☆228Updated last year
- This is an evolving repo for the paper "Towards Controllable Speech Synthesis in the Era of Large Language Models: A Systematic Survey".☆185Updated this week
- The official repository of SpeechCraft dataset, a large-scale expressive bilingual speech dataset with natural language descriptions.☆173Updated 7 months ago
- Codebase for 'Scaling Rich Style-Prompted Text-to-Speech Datasets'☆143Updated 7 months ago
- ☆103Updated last month
- AAAI 2025: Codec Does Matter: Exploring the Semantic Shortcoming of Codec for Audio Language Model☆261Updated last month
- Audio Codec Speech processing Universal PERformance Benchmark☆275Updated 4 months ago
- Reference-aware automatic speech evaluation toolkit☆168Updated 11 months ago
- FlashCosyVoice: A lightweight vLLM implementation built from scratch for CosyVoice.☆202Updated 3 months ago
- A large-scale speech corpus introduced in Spark-TTS, built from diverse open-source datasets for training text-to-speech (TTS) systems.