haloboy777 / wav-to-pcmLinks
This contains python scripts for converting wav files to pcm data for further processing.
☆12Updated 8 years ago
Alternatives and similar repositories for wav-to-pcm
Users that are interested in wav-to-pcm are comparing it to the libraries listed below
Sorting:
- flask+tornado based NVIDIA tacotron2+waveglow tts web app☆29Updated 2 years ago
- A deep learning solution to the Query By Singing/Humming (QBSH) problem in Music Information Retrieval (MIR).☆15Updated 8 years ago
- Example python scripts to evaluate various ASR methods☆12Updated 3 years ago
- SoundNet, built in Keras with pre-trained 8-layer model.☆29Updated 5 years ago
- ☆25Updated 6 years ago
- Two-stage GANs that generate fingerstyle guitarist images from audio.☆59Updated 6 years ago
- An extensible speech synthesis system, build with PyTorch and the original code is from r9y9's https://github.com/r9y9/nnmnkwii_gallery☆26Updated 6 years ago
- A Python library that can apply: darth vader, echo, radio, robotic, and ghost effects to audio samples.☆57Updated last year
- Detecting segments belonging to which song in database, and return Nil if does not exist in a database.☆21Updated 4 years ago
- This repository contains the migrated code of Spleeter from Deezer in TF2.0☆27Updated 4 years ago
- Spleeter implementation in pytorch☆39Updated 3 years ago
- Creates video from TTS output and viseme images.☆12Updated 3 years ago
- 2019_ML_Course Singing Voice Conversion Using Cycle-GAN:VC2☆16Updated 4 years ago
- Modified version of OpenVINO noise_suppression_demo. This version can handle real-time audio stream from microphone and output to headpho…☆15Updated 3 years ago
- Finally, some decent sample sentences☆23Updated last year
- Python library for audio augmentation☆84Updated 2 years ago
- An Attention Based Open-Source End to End Speech Synthesis Framework, No CNN, No RNN, No MFCC!!!☆85Updated 4 years ago
- One-shot TTS with Improved Unseen Speaker and Style Transfer☆37Updated 3 years ago
- 📊 Easily apply audio-related machine learning models trained on the AudioSet dataset (527+ models/classes).☆30Updated last year
- Demo page of our paper Efficiently Trainable Text-to-Speech System Based on Deep Convolutional Networks With Guided Attention, ICASSP 201…☆15Updated 4 years ago
- This tool can be used to convert mp3 to processable wav files, generate chunks of wav's and generate spectrograms.☆31Updated 7 years ago
- A PyTorch implementation of Tacotron2, an end-to-end text-to-speech(TTS) system described in "Natural TTS Synthesis By Conditioning Waven…☆52Updated 6 years ago
- Comprehensive Python library for speech and voice.☆32Updated 2 years ago
- ☆15Updated 5 years ago
- Removing background noise in a sound file☆63Updated 6 years ago
- Neural Voice Cloning with a few voice samples, using the speaker adaptation method. Speaker adaptation is based on fine-tuning a multi-sp…☆57Updated 6 years ago
- ☆29Updated 5 years ago
- Text detection and recognition in natural videos☆27Updated 7 years ago
- Aligns text (lyrics) with monophonic singing voice (audio). The algorithm uses structural segmentation to segment the audio into structur…☆92Updated 7 years ago
- Audio Classification - Multilayer Neural Networks using TensorFlow☆28Updated 8 years ago