lijin0120 / CELSDSLinks
A Chinese Expressive Long-dialogue Speech Dataset with Scripts
☆20Updated last year
Alternatives and similar repositories for CELSDS
Users that are interested in CELSDS are comparing it to the libraries listed below
Sorting:
- ☆27Updated last year
- The repoduction codes for Qwen-Audio Fine-tuning☆52Updated last year
- Speaker-aware CTC (SACTC) for multi-talker overlapped speech recognition.☆21Updated 5 months ago
- Official repo for CoVoMix: Advancing Zero-Shot Speech Generation for Human-like Multi-talker Conversations☆62Updated 10 months ago
- [INTERSPEECH 2025 Oral]Official code for "Accelerating Diffusion-based Text-to-Speech Model Training with Dual Modality Alignment"☆63Updated 5 months ago
- Generative Expressive Conversational Speech Synthesis (Accepted by MM'2024)☆62Updated last year
- Data Pipeline, Models, and Benchmark for Omni-Captioner.☆90Updated last month
- UMETTS: A Unified Framework for Emotional Text-to-Speech Synthesis with Multimodal Prompts☆38Updated 5 months ago
- ☆21Updated last year
- We propose C2SER, a novel audio-language model designed to enhance the stability and accuracy of speech emotion recognition through conte…☆39Updated 8 months ago
- This is a repository for fine-tuning Qwen2-Audio, currently supporting Distributed Data Parallel (DDP) and DeepSpeed.☆44Updated 3 months ago
- Official repository for the WenetSpeech-Chuan dataset.☆73Updated last month
- STARS: A Unified Framework for Singing Transcription, Alignment, and Refined Style Annotation☆59Updated last week
- ☆16Updated last year
- [INTERSPEECH 2023 Best Paper Shortlist] Official implementation for MT4SSL: Boosting Self-Supervised Speech Representation Learning by In…☆45Updated last year
- Code for the paper "JELLY: Joint Emotion Recognition and Context Reasoning with LLMs for Conversational Speech Synthesis"☆14Updated last year
- Trainging, inference, and testing of the SAC speech codec model.☆83Updated 3 weeks ago
- FCTalker: Fine and Coarse Grained Context Modeling for Expressive Conversational Speech Synthesis (Accepted by ISCSLP'2024)☆25Updated last year
- This repository collects papers related to Speech Tokenizer.☆17Updated last year
- Official Repository of Paper: "Emilia-NV: A Non-Verbal Speech Dataset with Word-Level Annotation for Human-Like Speech Modeling"☆79Updated 2 months ago
- ☆31Updated 5 months ago
- Visual Speech Recongnition☆19Updated 11 months ago
- ☆59Updated last month
- Official release of StyleTalk dataset.☆70Updated last year
- Emotion Rendering for Conversational Speech Synthesis with Heterogeneous Graph-Based Context Modeling (Accepted by AAAI'2024)☆58Updated last year
- Multi-Task Speech classification of accent and gender of an english speaker on Mozilla's common voice dataset☆27Updated 5 months ago
- ☆24Updated 2 months ago
- The open-source code of UniAudio2.0☆73Updated 2 months ago
- [ICASSP 2024] KNN-CTC: Enhancing ASR via Retrieval of CTC Pseudo Labels☆42Updated last year
- The implementation of paper "SpeechTripleNet: End-to-End Disentangled Speech Representation Learning for Content, Timbre and Prosody"☆34Updated 2 years ago