MatthewCYM / VoiceBench
VoiceBench: Benchmarking LLM-Based Voice Assistants
☆101Updated this week
Alternatives and similar repositories for VoiceBench:
Users that are interested in VoiceBench are comparing it to the libraries listed below
- Unified Speech Language Model for paper "SpeechTokenizer: Unified Speech Tokenizer for Speech Large Language Models"(ICLR 2024)☆140Updated last year
- Models and code for RepCodec: A Speech Representation Codec for Speech Tokenization☆166Updated 6 months ago
- The open source code for LLM-Codec☆123Updated 5 months ago
- AudioBench: A Universal Benchmark for Audio Large Language Models☆116Updated last week
- [NeurIPS 2024] SD-Eval: A Benchmark Dataset for Spoken Dialogue Understanding Beyond Words☆49Updated 7 months ago
- Real-time Speech-Text Foundation Model Toolkit (wip)☆126Updated 3 months ago
- This is a list of speech tasks and datasets, which can provide training data for Generative AI, AIGC, AI model training, intelligent spee…☆73Updated 7 months ago
- Official release of StyleTalk dataset.☆60Updated 6 months ago
- [AAAI 2024] CTX-txt2vec, the acoustic model in UniCATS☆64Updated 2 months ago
- Awesome Neural Codec Models, Text-to-Speech Synthesizers & Speech Language Models☆95Updated last week
- The official repository of SpeechCraft dataset, a large-scale expressive bilingual speech dataset with natural language descriptions.☆82Updated 3 weeks ago
- Source for the Interspeech 2024 Paper "Scaling up masked audio encoder learning for general audio classification"☆49Updated 3 weeks ago
- Official implementation for Fast-HuBERT: An Efficient Training Framework for Self-Supervised Speech Representation Learning☆84Updated 2 months ago
- This is an evolving repo for the paper "Towards Controllable Speech Synthesis in the Era of Large Language Models: A Survey".☆112Updated 2 weeks ago
- 《SpeechPrompt v2: Prompt Tuning for Speech Classification Tasks》Speech processing with prompting paradigm☆81Updated last year
- ZMM-TTS: Zero-shot Multilingual and Multispeaker Speech Synthesis Conditioned on Self-supervised Discrete Speech Representations☆139Updated 10 months ago
- [AAAI 2024] Code for CTX-vec2wav in UniCATS☆126Updated 7 months ago
- SLMTokBench for paper "SpeechTokenizer: Unified Speech Tokenizer for Speech Large Language Models"☆33Updated last year
- [ICASSP 2024] StoryTTS: A Highly Expressive Text-to-Speech Dataset with Rich Textual Expressiveness Annotations☆139Updated 9 months ago
- AutoPrep: An Automatic Preprocessing Framework for In-the-Wild Speech Data☆29Updated last year
- ☆12Updated 10 months ago
- Official repo for CoVoMix: Advancing Zero-Shot Speech Generation for Human-like Multi-talker Conversations☆44Updated 2 weeks ago
- BLSP-Emo: Towards Empathetic Large Speech-Language Models☆42Updated 7 months ago
- Generative Expressive Conversational Speech Synthesis (Accepted by MM'2024)☆52Updated 2 months ago
- Evaluation Protocol for Large-Scale Zero-Shot TTS Literature☆70Updated 4 months ago
- VoxInstruct: Expressive Human Instruction-to-Speech Generation with Unified Multilingual Codec Language Modelling☆61Updated 2 months ago
- [ACL 2024] Generative Pre-Trained Speech Language Model with Efficient Hierarchical Transformer☆43Updated 2 months ago
- ☆63Updated 4 months ago
- ConMamba for Automatic Speech Recognition☆54Updated 5 months ago