Code for Talk With Human-like Agents: Empathetic Dialogue Through Perceptible Acoustic Reception and Reaction (ACL24))
☆50Aug 6, 2024Updated last year
Alternatives and similar repositories for PerceptiveAgent
Users that are interested in PerceptiveAgent are comparing it to the libraries listed below
Sorting:
- text to speech☆10Mar 19, 2024Updated last year
- Incorporating AutoVocoder to MB-iSTFT-VITS☆48Dec 1, 2022Updated 3 years ago
- Official Access to ICIP2024 "THQA: A Perceptual Quality Assessment Database for Talking Heads"☆37Jul 23, 2025Updated 7 months ago
- ☆25Mar 12, 2022Updated 3 years ago
- [BMVC'24] G3FA: Geometry-guided GAN for Face Animation☆20Mar 14, 2025Updated 11 months ago
- Generative Expressive Conversational Speech Synthesis (Accepted by MM'2024)☆78Nov 1, 2024Updated last year
- Efficient Personalized Speech Enhancement through Self-Supervised Learning☆23Mar 12, 2023Updated 2 years ago
- An AR+AR TTS attempt.☆18Jan 13, 2025Updated last year
- Text frontend for ESPnet tts recipes☆34Jun 1, 2021Updated 4 years ago
- Compressed version of Tacotron 2 using Tensor Train + Waveglow.☆22Dec 26, 2019Updated 6 years ago
- PromptTTS++: Controlling Speaker Identity in Prompt-Based Text-To-Speech Using Natural Language Descriptions☆84Oct 11, 2024Updated last year
- ERISHA is a mulitilingual multispeaker expressive speech synthesis framework. It can transfer the expressivity to the speaker's voice for…☆43Dec 17, 2020Updated 5 years ago
- [ACL 2024] Generative Pre-Trained Speech Language Model with Efficient Hierarchical Transformer☆69Nov 1, 2024Updated last year
- Interface for Controllable Expressive Talking Machine☆40Sep 20, 2025Updated 5 months ago
- [ICASSP 2024] KNN-CTC: Enhancing ASR via Retrieval of CTC Pseudo Labels☆42Mar 20, 2024Updated last year
- NU-Wave: A Diffusion Probabilistic Model for Neural Audio Upsampling☆37May 25, 2021Updated 4 years ago
- Code for paper titled "Using generative modelling to produce varied intonation for speech synthesis" submitted to the Speech Synthesis Wo…☆24Dec 8, 2019Updated 6 years ago
- ☆26Jun 5, 2024Updated last year
- ☆11Sep 1, 2024Updated last year
- MelGAN and Tacotron 2 in PyTorch☆11Oct 22, 2019Updated 6 years ago
- Testing sets for semanticVAD☆20Feb 18, 2025Updated last year
- 🎵 muse: Music Separation☆11Feb 14, 2024Updated 2 years ago
- T5Voice is a lightweight PyTorch implementation of T5-based text-to-speech synthesis, supporting both streaming and non-streaming speech …☆28Nov 7, 2025Updated 4 months ago
- The project for speech translation☆12Sep 28, 2023Updated 2 years ago
- ☆11May 9, 2023Updated 2 years ago
- Implementation of the paper "Variable Bitrate Residual Vector Quantization for Audio Coding"☆11Apr 10, 2025Updated 10 months ago
- ☆10Apr 17, 2024Updated last year
- Implementation of Monte Carlo Word Movers Distance in Python with TensorFlow☆12Sep 12, 2016Updated 9 years ago
- Neural Lexicon Reader: Reduce Pronunciation Errors in End-to-end TTS by Leveraging External Textual Knowledge☆21Jul 25, 2022Updated 3 years ago
- Code for ACL 2023 main conference paper "Back Translation for Speech-to-text Translation Without Transcripts".☆12Oct 25, 2023Updated 2 years ago
- Normalize Text in Russian☆28Nov 7, 2023Updated 2 years ago
- CML-TTS: A Multilingual Dataset for Speech Synthesis☆33Jul 31, 2024Updated last year
- Labels for kiritan_singing data with extra resources for DNN-based singing voice synthesis (SVS) systems.☆29Dec 31, 2023Updated 2 years ago
- BLSP-Emo: Towards Empathetic Large Speech-Language Models☆58Jun 7, 2024Updated last year
- This repository provides UNOFFICIAL Bunched LPCNet implementations with Pytorch.☆14Jun 17, 2021Updated 4 years ago
- Project of Singing Voice Conversion.☆16Oct 27, 2023Updated 2 years ago
- ☆14Aug 1, 2025Updated 7 months ago
- ☆49May 3, 2020Updated 5 years ago
- Cantonese Grapheme-to-Phoneme Converter based on GitYCC/g2pW☆15Dec 10, 2024Updated last year