phrasenmaeher / audio-transformation-visualization
A streamlit application that lets you explore the effect of different audio augmentation techniques
☆27Updated 2 years ago
Related projects: ⓘ
- Voice activity engine benchmark framework☆12Updated last year
- A pakage for crawling audio from Youtube☆41Updated last year
- PyTorch Implementation of Google's Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions. This implementation supp…☆45Updated last year
- CTC Decoder implementation with python only. Also supports language model decoding using KenLM.☆36Updated 4 months ago
- ☆56Updated last year
- Simple Python library, distributed via binary wheels with few direct dependencies, for easily using wav2vec 2.0 models for speech recogni…☆24Updated 3 years ago
- Speaker change detection using SincNet and an LSTM/Transformer☆39Updated 2 months ago
- Modular and extensible speech recognition library leveraging pytorch-lightning and hydra.☆45Updated 3 years ago
- This is Pytorch Implementation of Google's Non-attentive Tacotron.☆57Updated last year
- Tensorflow 2 Speech Recognition Code (Transformer)☆25Updated 4 years ago
- Final training script from HuggingFace Whisper Fine tuning event - to get best results on finetuned model.☆12Updated last year
- Compute useful transcriptions metrics (CER, WER, SER, ...)☆26Updated 9 years ago
- This is a template for the Non-autoregressive Deep Learning-Based TTS model (in PyTorch).☆15Updated 3 years ago
- Unofficial Implementation of Zero-Shot Text-to-Speech for Text-Based Insertion in Audio Narration☆34Updated 2 years ago
- Transformer implementation speciaized in speech recognition tasks using Pytorch.☆63Updated 2 years ago
- GE2E Speaker Encoder - Generalized End-To-End Loss for Speaker Verification☆12Updated 4 years ago
- Implementation of "FastSpeech: Fast, Robust and Controllable Text to Speech"☆64Updated last year
- SpeechYOLO Interspeech 2019☆42Updated 2 years ago
- neural network based speaker embedder☆25Updated last year
- This is the source code of the paper "Neural grapheme-to-phoneme conversion with pretrained grapheme models☆44Updated 2 years ago
- asr2k☆48Updated 3 months ago
- PyTorch implementation of "ContextNet: Improving Convolutional Neural Networks for Automatic Speech Recognition with Global Context" (INT…☆33Updated 2 years ago
- Refactored version of https://github.com/ming024/FastSpeech2☆13Updated 2 years ago
- Transformer-based online speech recognition system with TensorFlow 2☆25Updated 3 years ago
- Rescoring methods for end-to-end Automatic Speech Recognition☆27Updated 3 years ago
- The Additive Margin SincNet (AM-SincNet) is a new approach for speaker recognition problems which is based in the neural network architec…☆43Updated 11 months ago
- An unofficial implementation of https://arxiv.org/abs/2005.05106☆46Updated 3 years ago
- Open-Source Toolkit for End-to-End Speech Recognition leveraging PyTorch-Lightning and Hydra.☆34Updated 2 years ago
- Pytorch version of Voice Activity Detection (VAD) based on Deep Learning (https://github.com/filippogiruzzi)☆26Updated 3 years ago
- Awesome list of TTS papers with audio samples☆59Updated 4 years ago