madhavmk / QA_VoiceBot_Desktop_Application
end-to-end voicebot that answers open domain questions.
☆10Updated 3 years ago
Alternatives and similar repositories for QA_VoiceBot_Desktop_Application:
Users that are interested in QA_VoiceBot_Desktop_Application are comparing it to the libraries listed below
- Interface for Controllable Expressive Talking Machine☆38Updated last year
- 24-hour Automatic Speech Recognition☆27Updated 3 years ago
- Emotive Speech generation based on DAVID: An open-source platform for real-time emotional speech transformation using pysox☆13Updated 6 years ago
- A neural network for filtering target speaker's voice from audio written in tensorflow☆21Updated 6 years ago
- WaveNet Vocoder Samples☆23Updated 5 years ago
- Implementation of MelNet in PyTorch to generate high-fidelity audio samples☆23Updated 4 years ago
- The History of Speech Recognition to the Year 2030☆11Updated 3 years ago
- Unsupervised Speech Decomposition via Triple Information Bottleneck☆14Updated 4 years ago
- Won't it be cool to build a speech assistant like Alexa or Siri yourself without voice API and network connection?☆34Updated 6 years ago
- An High-resolution implementation of HiFi-GAN Vocoder for Voice Conversion.☆30Updated last year
- How to run GPU accelerated Signal Processing in TensorFlow☆23Updated 6 years ago
- ParallelWaveGAN adaptation for Mozilla TTS☆15Updated 4 years ago
- Code base for WaveTransformer: A novel architecture for automated audio captioning☆43Updated 3 years ago
- Voice Conversion using Tacotron.☆11Updated 2 years ago
- Implementation of "FastSpeech: Fast, Robust and Controllable Text to Speech"☆64Updated last year
- Source code for INTERSPEECH2020☆11Updated 4 years ago
- Coqui STT Model Manager - install, manage and try out Coqui STT models from the Model Zoo☆25Updated last year
- Interface for using TTS and vocoder models in the form of a text editor☆19Updated 2 years ago
- Streamlit app to visualize and edit TTS datasets☆14Updated 3 years ago
- One-shot TTS with Improved Unseen Speaker and Style Transfer☆37Updated 2 years ago
- Estimating the Age, Height, and Gender of a speaker with their speech signal. https://arxiv.org/pdf/2110.13653.pdf☆64Updated 3 years ago
- Implementation and reviews of Audio & Computer vision related papers in python using keras and tensorflow.☆40Updated 6 years ago
- ERISHA is a mulitilingual multispeaker expressive speech synthesis framework. It can transfer the expressivity to the speaker's voice for…☆43Updated 4 years ago
- follow NVIDIA, simplify it and support data parallel.☆13Updated 5 years ago
- Real-time Speech Separation, Noise Suppression & Speaker Recognition☆17Updated 5 years ago
- Inspired work by the project of SER using ELM at Microsoft Research☆19Updated 6 years ago
- Implementation of "FastSpeech: Fast, Robust and Controllable Text to Speech"☆51Updated 4 years ago
- NU-Wave 2: A General Neural Audio Upsampling Model for Various Sampling Rates [WIP]☆24Updated 2 years ago
- NU-Wave: A Diffusion Probabilistic Model for Neural Audio Upsampling☆37Updated 3 years ago
- Anonymous ICLR Submission☆14Updated 5 years ago