ictnlp / ComSpeech
Code for ACL 2024 main conference paper "Can We Achieve High-quality Direct Speech-to-Speech Translation Without Parallel Speech Data?".
☆24Updated 6 months ago
Alternatives and similar repositories for ComSpeech:
Users that are interested in ComSpeech are comparing it to the libraries listed below
- A fast speech-to-speech & speech-to-text translation model that supports simultaneous decoding and offers 28× speedup.☆63Updated 2 months ago
- LLaST: Improved End-to-end Speech Translation System Leveraged by Large Language Models☆23Updated 5 months ago
- Official release of StyleTalk dataset.☆60Updated 6 months ago
- ☆34Updated 9 months ago
- Generative Expressive Conversational Speech Synthesis (Accepted by MM'2024)☆60Updated 2 months ago
- [ACL 2024] Generative Pre-Trained Speech Language Model with Efficient Hierarchical Transformer☆42Updated 2 months ago
- SLMTokBench for paper "SpeechTokenizer: Unified Speech Tokenizer for Speech Large Language Models"☆33Updated last year
- ☆18Updated 8 months ago
- VoiceBench: Benchmarking LLM-Based Voice Assistants☆90Updated this week
- BLSP-Emo: Towards Empathetic Large Speech-Language Models☆41Updated 7 months ago
- Code and pretrained models for "DUB: Discrete Unit Back-translation for Speech Translation" (ACL 2023 Findings)☆27Updated last year
- Generative Expressive Conversational Speech Synthesis (Accepted by MM'2024)☆51Updated 2 months ago
- [ICASSP 2024] KNN-CTC: Enhancing ASR via Retrieval of CTC Pseudo Labels☆32Updated 9 months ago
- Official repo for CoVoMix: Advancing Zero-Shot Speech Generation for Human-like Multi-talker Conversations☆43Updated this week
- ☆20Updated 5 months ago
- ☆28Updated 11 months ago
- [Findings of NAACL 2024] Source code of paper CM-TTS: Enhancing Real Time Text-to-Speech Synthesis Efficiency through Weighted Samplers a…☆65Updated 9 months ago
- VoxInstruct: Expressive Human Instruction-to-Speech Generation with Unified Multilingual Codec Language Modelling☆57Updated 2 months ago
- ☆35Updated 4 months ago
- All generative model in one for better TTS model☆66Updated 4 months ago
- Inference code for Audiodec-Valle-Wenetspeech4TTS☆48Updated 6 months ago
- Official Code for ParrotTTS☆46Updated 3 months ago
- [WIP] Unofficial Implementation of Microsoft's PromptTTS2☆51Updated last year
- ☆28Updated last year
- ☆24Updated 6 months ago
- [IJCAI'23] Learning to Speak from Text for Low-Resource TTS☆64Updated last year
- Implementation of the model "AudioFlamingo" from the paper: "Audio Flamingo: A Novel Audio Language Model with Few-Shot Learning and Dial…☆40Updated 2 months ago
- Just another FastSpeech 2 but cleaner code :)☆25Updated 6 months ago
- ☆25Updated last year