vincentqb / audio-tutorialLinks
Experiments and tutorials with and for torchaudio
☆13Updated 4 years ago
Alternatives and similar repositories for audio-tutorial
Users that are interested in audio-tutorial are comparing it to the libraries listed below
Sorting:
- Sound augmentation using Large-scale audio dataset (Audioset)☆45Updated 4 years ago
- A speech signal processing library in Python with emphasis on deep learning.☆31Updated 3 years ago
- ☆21Updated 7 years ago
- ☆56Updated 7 years ago
- FFTNet: a Real-Time Speaker-Dependent Neural Vocoder☆64Updated 7 years ago
- Real-time melgan based on cpu !!!☆13Updated 5 years ago
- Losses and decoders for end-to-end ASR and OCR☆34Updated 4 years ago
- The Additive Margin MobileNet1D is a new light weight deep learning model for Speaker Recognition which is based on the MobileNetV2 archi…☆30Updated last year
- Tensorflow Implementation of WaveGlow☆37Updated 5 years ago
- Demo page of our paper Efficiently Trainable Text-to-Speech System Based on Deep Convolutional Networks With Guided Attention, ICASSP 201…☆15Updated 4 years ago
- Code base for WaveTransformer: A novel architecture for automated audio captioning☆44Updated 4 years ago
- ☆27Updated 6 years ago
- A PyTorch implementation of Tacotron2, an end-to-end text-to-speech(TTS) system described in "Natural TTS Synthesis By Conditioning Waven…☆52Updated 6 years ago
- Comprehensive Python library for speech and voice.☆32Updated 2 years ago
- Contrastive Language-Audio Pretraining☆15Updated 4 years ago
- Single Pass Spectrogram Inversion in a Jupyter Python notebook☆34Updated 8 years ago
- Implementation of "FastSpeech: Fast, Robust and Controllable Text to Speech"☆64Updated 2 years ago
- Download and create a tfreader for the audioset dataset☆16Updated 5 years ago
- pytorch implementation of lyre.ai's char2wav model☆32Updated 8 years ago
- Compressed version of Tacotron 2 using Tensor Train + Waveglow.☆22Updated 5 years ago
- Network specification and demo☆35Updated 8 years ago
- A deep learning solution to the Query By Singing/Humming (QBSH) problem in Music Information Retrieval (MIR).☆15Updated 8 years ago
- Convolutional neural networks for sound classification☆20Updated 7 years ago
- Vocode spectrograms to audio with generative adversarial networks☆63Updated 6 years ago
- This sample includes simeple CNN classifier for music and audio-folder dataloader just like ImageFolder in torchvision.☆11Updated 6 years ago
- A pytorch implementation of FFTNet.☆37Updated 6 years ago
- Data processing tools for preparing speech and labels for training TTS voices☆27Updated 5 years ago
- How to run GPU accelerated Signal Processing in TensorFlow☆23Updated 6 years ago
- ☆32Updated 3 years ago
- Code for the paper: Unified Gradient Reweighting for Model Biasing with Applications to Source Separation☆14Updated 4 years ago