haloboy777 / wav-to-pcmLinks
This contains python scripts for converting wav files to pcm data for further processing.
☆11Updated 8 years ago
Alternatives and similar repositories for wav-to-pcm
Users that are interested in wav-to-pcm are comparing it to the libraries listed below
Sorting:
- Example python scripts to evaluate various ASR methods☆12Updated 3 years ago
- SoundNet, built in Keras with pre-trained 8-layer model.☆29Updated 6 years ago
- flask+tornado based NVIDIA tacotron2+waveglow tts web app☆29Updated 2 years ago
- LogMMSE speech enhancement/noise reduction☆30Updated 5 years ago
- Wrapper of well-known transcribers that transform text into phoneme codes☆15Updated 4 years ago
- Deep neural network (DNN) for noise reduction, removal of background music, and speech separation☆173Updated 2 years ago
- Two-stage GANs that generate fingerstyle guitarist images from audio.☆59Updated 7 years ago
- Sequence Modelling with CTC☆50Updated 2 years ago
- A deep learning solution to the Query By Singing/Humming (QBSH) problem in Music Information Retrieval (MIR).☆15Updated 8 years ago
- Spleeter implementation in pytorch☆39Updated 3 years ago
- A PyTorch implementation of Tacotron2, an end-to-end text-to-speech(TTS) system described in "Natural TTS Synthesis By Conditioning Waven…☆52Updated 6 years ago
- Automated lyrics-to-audio alignment using syllabic nuclei detection. Developed during Google Summer of Code 2019.☆51Updated 2 years ago
- Identify sounds in short audio clips☆156Updated last month
- Audio Classification - Multilayer Neural Networks using TensorFlow☆28Updated 8 years ago
- Python library for audio augmentation☆84Updated 2 years ago
- A implementation voice morphing using relgan with tensorflow☆25Updated 2 years ago
- convert spleeter pretrained model to pytorch and onnx, then convert to mnn☆20Updated 4 years ago
- Removes silence segments from wav audio files☆29Updated 5 years ago
- Speech noise reduction which was generated using existing post-production techniques implemented in Python☆181Updated 3 years ago
- Demo page of our paper Efficiently Trainable Text-to-Speech System Based on Deep Convolutional Networks With Guided Attention, ICASSP 201…☆15Updated 4 years ago
- SpeechYOLO Interspeech 2019☆46Updated 3 years ago
- The Additive Margin MobileNet1D is a new light weight deep learning model for Speaker Recognition which is based on the MobileNetV2 archi…☆30Updated 2 years ago
- Generate embedding vectors from audio files☆59Updated last month
- A Pytorch Implementation of MelNet☆26Updated 5 years ago
- Finally, some decent sample sentences☆23Updated last year
- Python C extension for the eSpeak speech synthesizer☆12Updated 4 years ago
- Compute useful transcriptions metrics (CER, WER, SER, ...)☆27Updated 10 years ago
- Streamlit app to visualize and edit TTS datasets☆14Updated 3 years ago
- Real-time Audio time-scale and pitch modification in Python☆59Updated 6 years ago
- A PyTorch implementation of DeepSpeech and DeepSpeech2.☆50Updated 6 years ago