Code for Talk With Human-like Agents: Empathetic Dialogue Through Perceptible Acoustic Reception and Reaction (ACL24))
☆50Aug 6, 2024Updated last year
Alternatives and similar repositories for PerceptiveAgent
Users that are interested in PerceptiveAgent are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Incorporating AutoVocoder to MB-iSTFT-VITS☆48Dec 1, 2022Updated 3 years ago
- text to speech☆10Mar 19, 2024Updated 2 years ago
- Generative Expressive Conversational Speech Synthesis (Accepted by MM'2024)☆78Nov 1, 2024Updated last year
- [ACL 2024] Generative Pre-Trained Speech Language Model with Efficient Hierarchical Transformer☆69Nov 1, 2024Updated last year
- The project for speech translation☆12Sep 28, 2023Updated 2 years ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- PromptTTS++: Controlling Speaker Identity in Prompt-Based Text-To-Speech Using Natural Language Descriptions☆85Oct 11, 2024Updated last year
- Official Access to ICIP2024 "THQA: A Perceptual Quality Assessment Database for Talking Heads"☆37Jul 23, 2025Updated 8 months ago
- ☆25Mar 12, 2022Updated 4 years ago
- ☆14Aug 1, 2025Updated 7 months ago
- Efficient Personalized Speech Enhancement through Self-Supervised Learning☆23Mar 12, 2023Updated 3 years ago
- [BMVC'24] G3FA: Geometry-guided GAN for Face Animation☆20Mar 14, 2025Updated last year
- using world vocoder to extract features and make data for training neural networks☆11Oct 9, 2017Updated 8 years ago
- BLSP-Emo: Towards Empathetic Large Speech-Language Models☆60Jun 7, 2024Updated last year
- NU-Wave: A Diffusion Probabilistic Model for Neural Audio Upsampling☆37May 25, 2021Updated 4 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- Testing sets for semanticVAD☆20Feb 18, 2025Updated last year
- Labels for kiritan_singing data with extra resources for DNN-based singing voice synthesis (SVS) systems.☆29Dec 31, 2023Updated 2 years ago
- UnivNet: A Neural Vocoder with Multi-Resolution Spectrogram Discriminators for High-Fidelity Waveform Generation☆76Aug 30, 2021Updated 4 years ago
- Implementation of Monte Carlo Word Movers Distance in Python with TensorFlow☆12Sep 12, 2016Updated 9 years ago
- The official implementation of the DIFFA series for dLLM-based large audio language model☆68Mar 12, 2026Updated 2 weeks ago
- The official repository for the paper “NonVerbalSpeech-38K: A Scalable Pipeline for Enabling Non-Verbal Speech Generation and Understandi…☆64Dec 26, 2025Updated 3 months ago
- Compressed version of Tacotron 2 using Tensor Train + Waveglow.☆22Dec 26, 2019Updated 6 years ago
- ☆22Nov 25, 2025Updated 4 months ago
- Emotive Speech generation based on DAVID: An open-source platform for real-time emotional speech transformation using pysox☆13Feb 20, 2018Updated 8 years ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- Interface for Controllable Expressive Talking Machine☆40Sep 20, 2025Updated 6 months ago
- [ICASSP 2024] KNN-CTC: Enhancing ASR via Retrieval of CTC Pseudo Labels☆42Mar 20, 2024Updated 2 years ago
- Legible, Scalable, Reproducible Foundation Models with Named Tensors and Jax☆16Jun 16, 2024Updated last year
- MSP-Podcast Challenge Baseline Code☆31Jun 12, 2024Updated last year
- [IJCAI'23] Learning to Speak from Text for Low-Resource TTS☆64May 30, 2023Updated 2 years ago
- Synth-Empathy: Towards High-Quality Synthetic Empathy Data☆18Feb 28, 2025Updated last year
- TTS for pitch-accented language. Korean dialect DB.☆157May 12, 2023Updated 2 years ago
- ☆44Aug 17, 2024Updated last year
- This is the code of the ICASSP 2020 paper "Joint phoneme alignment and text-informed speech separation on highly corrupted speech"☆15Apr 8, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- FluentTTS: Text-dependent Fine-grained Style Control for Multi-style TTS☆20Nov 15, 2022Updated 3 years ago
- [ACL 2025 Main] ControlSpeech: Towards Simultaneous Zero-shot Speaker Cloning and Zero-shot Language Style Control With Decoupled Codec☆275Nov 22, 2024Updated last year
- ☆26Jun 5, 2024Updated last year
- MelGAN and Tacotron 2 in PyTorch☆11Oct 22, 2019Updated 6 years ago
- Text frontend for ESPnet tts recipes☆34Jun 1, 2021Updated 4 years ago
- 🎵 muse: Music Separation☆11Feb 14, 2024Updated 2 years ago
- PyTorch Implementation of VAENAR-TTS: Variational Auto-Encoder based Non-AutoRegressive Text-to-Speech Synthesis.☆73Aug 3, 2021Updated 4 years ago