ANLGBOY / ICLR-2024-OpenReview-Ratings
☆30Updated last year
Alternatives and similar repositories for ICLR-2024-OpenReview-Ratings:
Users that are interested in ICLR-2024-OpenReview-Ratings are comparing it to the libraries listed below
- [ACL 2024] Generative Pre-Trained Speech Language Model with Efficient Hierarchical Transformer☆47Updated 3 months ago
- Official PyTorch implementation of "Paralinguistics-Aware Speech-Empowered LLMs for Natural Conversation" (NeurIPS 2024)☆79Updated 2 months ago
- Evaluation Protocol for Large-Scale Zero-Shot TTS Literature☆73Updated 4 months ago
- BLSP-Emo: Towards Empathetic Large Speech-Language Models☆42Updated 8 months ago
- Official release of StyleTalk dataset.☆61Updated 7 months ago
- The open source code for LLM-Codec☆126Updated 6 months ago
- The official repository of SpeechCraft dataset, a large-scale expressive bilingual speech dataset with natural language descriptions.☆88Updated last month
- VoxInstruct: Expressive Human Instruction-to-Speech Generation with Unified Multilingual Codec Language Modelling☆63Updated 3 months ago
- ☆139Updated 5 months ago
- Generative Expressive Conversational Speech Synthesis (Accepted by MM'2024)☆53Updated 3 months ago
- Pytorch implementation for “V2C: Visual Voice Cloning”☆30Updated 2 years ago
- ☆34Updated 10 months ago
- [InterSpeech'2024] FluentEditor:Text-based Speech Editing by Considering Acoustic and Prosody Consistency☆50Updated 3 months ago
- (Interspeech 2023 & ICASSP 2024) Official repository for ARMHuBERT and STaRHuBERT☆39Updated 5 months ago
- The open source code for SimpleSpeech series☆127Updated 4 months ago
- ☆59Updated 3 months ago
- ☆15Updated 10 months ago
- Official code for Interspeech 2023 paper "Self-supervised Fine-tuning for Improved Content Representations by Speaker-invariant Clusterin…☆49Updated last year
- Models and code for RepCodec: A Speech Representation Codec for Speech Tokenization☆168Updated 7 months ago
- Unofficial pytorch reproduction for the paper "Utilizing Neural Transducers for Two-Stage Text-to-Speech via Semantic Token Prediction" (…☆60Updated 10 months ago
- Official Implementation of EnCLAP (ICASSP 2024)☆90Updated 8 months ago
- ☆50Updated last year
- ACM MM 2024 FlashSpeech: Efficient Zero-Shot Speech Synthesis☆125Updated 5 months ago
- ☆43Updated 3 weeks ago
- ☆64Updated last year
- Unified Speech Language Model for paper "SpeechTokenizer: Unified Speech Tokenizer for Speech Large Language Models"(ICLR 2024)☆139Updated last year
- Implementation of CoBERT: Self-Supervised Speech Representation Learning Through Code Representation Learning☆46Updated last year
- Emotion Rendering for Conversational Speech Synthesis with Heterogeneous Graph-Based Context Modeling (Accepted by AAAI'2024)☆53Updated 8 months ago
- [AAAI 2024] CTX-txt2vec, the acoustic model in UniCATS☆63Updated 3 months ago