amphionspace / SD-EvalLinks

[NeurIPS 2024] SD-Eval: A Benchmark Dataset for Spoken Dialogue Understanding Beyond Words

☆56

Alternatives and similar repositories for SD-Eval

Users that are interested in SD-Eval are comparing it to the libraries listed below

Sorting:

yangdongchao / LLM-Codec
The open source code for LLM-Codec
☆145Updated last year
walker-hyf / NCSSD
Generative Expressive Conversational Speech Synthesis (Accepted by MM'2024)
☆62Updated last year
jishengpeng / WavReward
WavReward: Spoken Dialogue Models With Generalist Reward Evaluators
☆54Updated 8 months ago
SparkAudio / VoxBox
A large-scale speech corpus introduced in Spark-TTS, built from diverse open-source datasets for training text-to-speech (TTS) systems.
☆105Updated 9 months ago
thuhcsi / VoxInstruct
VoxInstruct: Expressive Human Instruction-to-Speech Generation with Unified Multilingual Codec Language Modelling
☆96Updated last year
y-ren16 / TiCodec
☆78Updated 6 months ago
KexinHUANG19 / InstructTTSEval
☆36Updated 7 months ago
ajd12342 / paraspeechcaps
Codebase for 'Scaling Rich Style-Prompted Text-to-Speech Datasets'
☆153Updated 10 months ago
0nutation / USLM
Unified Speech Language Model for paper "SpeechTokenizer: Unified Speech Tokenizer for Speech Large Language Models"(ICLR 2024)
☆152Updated 2 years ago
vivian556123 / NeurIPS2024-CoVoMix
Official repo for CoVoMix: Advancing Zero-Shot Speech Generation for Human-like Multi-talker Conversations
☆62Updated last year
lmxue / Audio-FLAN
Audio-FLAN
☆160Updated 4 months ago
youngsheen / GPST
[ACL 2024] Generative Pre-Trained Speech Language Model with Efficient Hierarchical Transformer
☆67Updated last year
Ruiqi-Yan / URO-Bench
Towards Comprehensive Evaluation for End-to-End Spoken Dialogue Models
☆50Updated 5 months ago
ictnlp / SLED-TTS
Streamable Text-to-Speech model using a language modeling approach, without vector quantization
☆110Updated 8 months ago
Shy-98 / MELLE
Unofficial PyTorch implementation of "Autoregressive Speech Synthesis without Vector Quantization (MELLE)"
☆41Updated 7 months ago
Hannieliao / Emilia-NV
Official Repository of Paper: "Emilia-NV: A Non-Verbal Speech Dataset with Word-Level Annotation for Human-Like Speech Modeling"
☆83Updated 4 months ago
X-LANCE / UniCATS-CTX-txt2vec
[AAAI 2024] CTX-txt2vec, the acoustic model in UniCATS
☆64Updated last year
MrSupW / ContextASR-Bench
A Massive Contextual Speech Recognition Benchmark.
☆99Updated 6 months ago
yangdongchao / ALMTokenizer
The demo page for ALMTokenizer
☆58Updated 9 months ago
jzmzhong / Automatic-Prosody-Annotator-with-SSWP-CLAP
An automatic prosodic boundary annotation tool for Text-to-Speech Synthesis (TTS).
☆51Updated last year
RicherMans / Dasheng
Source for the Interspeech 2024 Paper "Scaling up masked audio encoder learning for general audio classification"
☆79Updated 3 months ago
DanielLin94144 / Full-Duplex-Bench
A Benchmark for Evaluating Turn-Taking and Overlap Handling in Full-Duplex Spoken Dialogue Models
☆124Updated 4 months ago
Jiang-Yidi / UniCodec
[ACL 2025 Main] UniCodec: a unified audio codec with a single codebook to support multi-domain audio data, including speech, music, and s…
☆154Updated 8 months ago
yanghaha0908 / FastHuBERT
Official implementation for Fast-HuBERT: An Efficient Training Framework for Self-Supervised Speech Representation Learning
☆96Updated last year
X-LANCE / UniCATS-CTX-vec2wav
[AAAI 2024] Code for CTX-vec2wav in UniCATS
☆130Updated last year
mct10 / RepCodec
Models and code for RepCodec: A Speech Representation Codec for Speech Tokenization
☆191Updated last year
NKU-HLT / DIFFA
The official implementation of the DIFFA series for dLLM-based large audio language model
☆59Updated last week
nonverbalspeech38k / nonverspeech38k
The official repository for the paper “NonVerbalSpeech-38K: A Scalable Pipeline for Enabling Non-Verbal Speech Generation and Understandi…
☆63Updated last month
alibaba / vstyle
☆30Updated 4 months ago
ZhikangNiu / Semantic-VAE
Official code for "Semantic-VAE: Semantic-Alignment Latent Representation for Better Speech Synthesis"
☆107Updated last month