vincentqb / audio-tutorial
Experiments and tutorials with and for torchaudio
☆13Updated 3 years ago
Alternatives and similar repositories for audio-tutorial:
Users that are interested in audio-tutorial are comparing it to the libraries listed below
- Single Pass Spectrogram Inversion in a Jupyter Python notebook☆33Updated 7 years ago
- This sample includes simeple CNN classifier for music and audio-folder dataloader just like ImageFolder in torchvision.☆11Updated 6 years ago
- A neural network for filtering target speaker's voice from audio written in tensorflow☆21Updated 6 years ago
- WaveNet implementation using tf.estimator☆21Updated last year
- How to run GPU accelerated Signal Processing in TensorFlow☆23Updated 6 years ago
- Code for the paper: Unified Gradient Reweighting for Model Biasing with Applications to Source Separation☆14Updated 4 years ago
- An extensible speech synthesis system, build with PyTorch and the original code is from r9y9's https://github.com/r9y9/nnmnkwii_gallery☆26Updated 5 years ago
- A Text2Speech Engine built in Pytorch.☆11Updated 6 years ago
- FFTNet: a Real-Time Speaker-Dependent Neural Vocoder☆64Updated 6 years ago
- ☆21Updated 6 years ago
- Data processing tools for preparing speech and labels for training TTS voices☆24Updated 4 years ago
- Code base for WaveTransformer: A novel architecture for automated audio captioning☆43Updated 3 years ago
- ☆27Updated 5 years ago
- Anonymous ICLR Submission☆14Updated 5 years ago
- ☆16Updated 5 years ago
- The History of Speech Recognition to the Year 2030☆12Updated 3 years ago
- A speech signal processing library in Python with emphasis on deep learning.☆31Updated 2 years ago
- ☆22Updated 3 years ago
- Download and create a tfreader for the audioset dataset☆16Updated 4 years ago
- Generate audio datasets for training Text-To-Speech models, through smart audio splitting with silence detection, and transcription using…☆28Updated last year
- Demo page of our paper Efficiently Trainable Text-to-Speech System Based on Deep Convolutional Networks With Guided Attention, ICASSP 201…☆15Updated 3 years ago
- ESPnet-TTS Audio Sample HP☆21Updated 5 years ago
- Real-time melgan based on cpu !!!☆13Updated 5 years ago
- A library of speech gadgets.☆13Updated 2 years ago
- This is an implementation of "Generative adversarial network-based postfilter for statistical parametric speech synthesis"☆16Updated 6 years ago
- Python implementation of CTC beam search decoder + agnostic LM scorer☆19Updated 4 years ago
- Demos, pretrained models, and (WIP) code supporting Representation Mixing☆51Updated 6 years ago
- ☆31Updated 6 years ago
- WaveNet Vocoder Samples☆23Updated 5 years ago
- ☆10Updated 10 months ago