Robofied / Voicenet
Comprehensive Python library for speech and voice.
☆32Updated 2 years ago
Alternatives and similar repositories for Voicenet:
Users that are interested in Voicenet are comparing it to the libraries listed below
- The History of Speech Recognition to the Year 2030☆13Updated 3 years ago
- Anonymous ICLR Submission☆14Updated 5 years ago
- A speech signal processing library in Python with emphasis on deep learning.☆31Updated 2 years ago
- ☆27Updated 6 years ago
- Collection of models and extensions for deployment in PyTorch☆24Updated 2 years ago
- PyTorch implementation of the Feed-Forward Attention Mechanism.☆18Updated 6 years ago
- ☆33Updated 6 years ago
- ASR project with pytorch-lightning☆20Updated last month
- The Additive Margin MobileNet1D is a new light weight deep learning model for Speaker Recognition which is based on the MobileNetV2 archi…☆30Updated last year
- ☆29Updated 5 years ago
- This repo contains the baseline model recipes and pre-trained model for GramVanni hindi ASR challenge☆14Updated 3 years ago
- Similarity Learning applied to Speaker Verification and Semantic Textual Similarity☆12Updated 5 years ago
- follow NVIDIA, simplify it and support data parallel.☆13Updated 5 years ago
- An extensible speech synthesis system, build with PyTorch and the original code is from r9y9's https://github.com/r9y9/nnmnkwii_gallery☆26Updated 5 years ago
- Sound augmentation using Large-scale audio dataset (Audioset)☆44Updated 3 years ago
- Demo page of our paper Efficiently Trainable Text-to-Speech System Based on Deep Convolutional Networks With Guided Attention, ICASSP 201…☆15Updated 3 years ago
- Filter Bank Implementaion as Convolutional Neural Network using Python Keras☆17Updated 4 months ago
- bumble bee transformer☆14Updated 4 years ago
- Online (real-time) decoder to be used with DeepSpeech2 model☆25Updated 5 years ago
- Voice Conversion using Tacotron.☆11Updated 2 years ago
- Tensor2tensor experiment with SpecAugment☆46Updated 5 years ago
- Python implementation of CTC beam search decoder + agnostic LM scorer☆19Updated 4 years ago
- ESPnet-TTS Audio Sample HP☆21Updated 5 years ago
- Implementation of "FastSpeech: Fast, Robust and Controllable Text to Speech"☆64Updated last year
- Data processing tools for preparing speech and labels for training TTS voices☆27Updated 4 years ago
- ☆10Updated last year
- Enable RNNLM lattice rescoring with Pytorch [kaldi]☆12Updated 4 years ago
- ☆12Updated 2 years ago
- The Additive Margin SincNet (AM-SincNet) is a new approach for speaker recognition problems which is based in the neural network architec…☆44Updated last year
- Prabhupadavani: A Code-mixed Speech Translation Data for 25 languages☆13Updated 2 years ago