ANLGBOY / ICLR-2024-OpenReview-RatingsLinks
☆31Updated last year
Alternatives and similar repositories for ICLR-2024-OpenReview-Ratings
Users that are interested in ICLR-2024-OpenReview-Ratings are comparing it to the libraries listed below
Sorting:
- [ACL 2024] Generative Pre-Trained Speech Language Model with Efficient Hierarchical Transformer☆58Updated 7 months ago
- ☆17Updated last year
- Official PyTorch implementation of "Paralinguistics-Aware Speech-Empowered LLMs for Natural Conversation" (NeurIPS 2024)☆88Updated 6 months ago
- The open source code for LLM-Codec☆135Updated 10 months ago
- ☆84Updated 3 weeks ago
- Generative Expressive Conversational Speech Synthesis (Accepted by MM'2024)☆59Updated 7 months ago
- Evaluation Protocol for Large-Scale Zero-Shot TTS Literature☆81Updated 3 months ago
- Official release of StyleTalk dataset.☆67Updated 11 months ago
- Official Implementation of EnCLAP (ICASSP 2024)☆92Updated last year
- [Official Implementation] Acoustic Autoregressive Modeling 🔥☆70Updated 10 months ago
- Pytorch implementation for “V2C: Visual Voice Cloning”☆32Updated 2 years ago
- [ICASSP2025] Official code for VoiceDiT: Dual-Condition Diffusion Transformer for Environment-Aware Speech Synthesis☆18Updated 2 months ago
- DinoSR: Self-Distillation and Online Clustering for Self-supervised Speech Representation Learning☆49Updated last year
- BLSP-Emo: Towards Empathetic Large Speech-Language Models☆46Updated last year
- small audio language model for reasoning☆64Updated 2 months ago
- [ACL 2024] This is the Pytorch code for our paper "StyleDubber: Towards Multi-Scale Style Learning for Movie Dubbing"☆84Updated 7 months ago
- ☆55Updated 2 years ago
- VoxInstruct: Expressive Human Instruction-to-Speech Generation with Unified Multilingual Codec Language Modelling☆82Updated 7 months ago
- ☆39Updated 9 months ago
- This repository presents an evaluation framework for speech-to-speech (S2S) models, following the methodology described in the EmphAsses …☆21Updated last year
- Official Implementation of "Prefix tuning for Automated Audio Captioning(ICASSP 2023)"☆30Updated last year
- Official code for Interspeech 2023 paper "Self-supervised Fine-tuning for Improved Content Representations by Speaker-invariant Clusterin…☆52Updated 2 years ago
- ☆61Updated 7 months ago
- Source code for DM-Codec.☆45Updated 3 weeks ago
- ☆37Updated 2 months ago
- Benchmark for evaluating TTS models on complex prosodic, expressiveness, and linguistic challenges.☆45Updated 3 weeks ago
- [InterSpeech'2024] FluentEditor:Text-based Speech Editing by Considering Acoustic and Prosody Consistency☆55Updated 8 months ago
- Implementation of Multi-Source Music Generation with Latent Diffusion.☆24Updated 9 months ago
- The demo page for ALMTokenizer☆51Updated 2 months ago
- Emotion Rendering for Conversational Speech Synthesis with Heterogeneous Graph-Based Context Modeling (Accepted by AAAI'2024)☆57Updated last year