phrasenmaeher / audio-transformation-visualization
A streamlit application that lets you explore the effect of different audio augmentation techniques
☆27Updated 2 years ago
Alternatives and similar repositories for audio-transformation-visualization:
Users that are interested in audio-transformation-visualization are comparing it to the libraries listed below
- CTC Decoder implementation with python only. Also supports language model decoding using KenLM.☆37Updated 11 months ago
- Compute useful transcriptions metrics (CER, WER, SER, ...)☆27Updated 10 years ago
- A pakage for crawling audio from Youtube☆41Updated last year
- This is Pytorch Implementation of Google's Non-attentive Tacotron.☆58Updated 2 years ago
- Transformer-based online speech recognition system with TensorFlow 2☆26Updated 4 years ago
- The Additive Margin SincNet (AM-SincNet) is a new approach for speaker recognition problems which is based in the neural network architec…☆44Updated last year
- Companion repository for the paper "A Comparison of Metric Learning Loss Functions for End-to-End Speaker Verification" published at SLSP…☆59Updated 4 years ago
- ☆56Updated 2 years ago
- This repository contains the code related to the paper 'DENet: a deep architecture for audio surveillance applications'.☆42Updated last year
- SpeechYOLO Interspeech 2019☆43Updated 2 years ago
- The official code repo for "Zero-shot Audio Source Separation through Query-based Learning from Weakly-labeled Data", in AAAI 2022☆199Updated 2 years ago
- Phoneme prediction from speech mel-spectrograms using RNN.☆13Updated 5 years ago
- Tensorflow 2 Speech Recognition Code (Transformer)☆25Updated 4 years ago
- Dataset and baseline code for the VocalSound dataset (ICASSP2022).☆134Updated 2 years ago
- Implementation of "FastSpeech: Fast, Robust and Controllable Text to Speech"☆64Updated last year
- Estimating the Age, Height, and Gender of a speaker with their speech signal. https://arxiv.org/pdf/2110.13653.pdf☆65Updated 3 years ago
- An easy way to fine-tune Wav2Vec 2.0 for low-resource languages.☆82Updated last year
- Speaker change detection using SincNet and an LSTM/Transformer☆50Updated 9 months ago
- PyTorch transcribed audioset classifier, including VGGish and YAMNet, along with utils to manipulate autioset category ontology.☆73Updated last week
- Unofficial Implementation of Zero-Shot Text-to-Speech for Text-Based Insertion in Audio Narration☆34Updated 3 years ago
- Transformer implementation speciaized in speech recognition tasks using Pytorch.☆64Updated 3 years ago
- PyTorch Implementation of Google's Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions. This implementation supp…☆48Updated last year
- Unsupervised Speech Decomposition via Triple Information Bottleneck☆14Updated 4 years ago
- The Additive Margin MobileNet1D is a new light weight deep learning model for Speaker Recognition which is based on the MobileNetV2 archi…☆30Updated last year
- Attention Backend for Aotumatic Speaker Verification with Multiple Enrollment Utterances☆49Updated 2 years ago
- PyTorch implementation of automatic speech recognition models.☆38Updated 4 years ago
- WavEncoder is a Python library for encoding audio signals, transforms for audio augmentation, and training audio classification models wi…☆90Updated 3 years ago
- A PyTorch implementation of the paper: "LaSAFT: Latent Source Attentive Frequency Transformation for Conditioned Source Separation" (ICAS…☆85Updated 2 years ago
- Library of TensorFlow layers for audio data processing and data augmentation☆20Updated 3 years ago
- PyTorch implementation of "ContextNet: Improving Convolutional Neural Networks for Automatic Speech Recognition with Global Context" (INT…☆38Updated 3 years ago