haloboy777 / wav-to-pcmLinks
This contains python scripts for converting wav files to pcm data for further processing.
☆12Updated 8 years ago
Alternatives and similar repositories for wav-to-pcm
Users that are interested in wav-to-pcm are comparing it to the libraries listed below
Sorting:
- Creates video from TTS output and viseme images.☆15Updated 3 years ago
- Python library for audio augmentation☆84Updated 2 years ago
- Emotion recognition of Speaker's Speech Data. Employ speaker detection classifiers for emotion recognition, a multiclass classification p…☆16Updated 10 years ago
- flask+tornado based NVIDIA tacotron2+waveglow tts web app☆29Updated 2 years ago
- Deep neural network (DNN) for noise reduction, removal of background music, and speech separation☆173Updated 3 years ago
- ☆26Updated 6 years ago
- SoundNet, built in Keras with pre-trained 8-layer model.☆29Updated 6 years ago
- Spleeter implementation in pytorch☆39Updated 3 years ago
- Detecting segments belonging to which song in database, and return Nil if does not exist in a database.☆22Updated 4 years ago
- Neural Voice Cloning with a few voice samples, using the speaker adaptation method. Speaker adaptation is based on fine-tuning a multi-sp…☆57Updated 6 years ago
- Audio Classification using Image Classification☆48Updated 5 years ago
- mirror of VoxCeleb dataset - a large-scale speaker identification dataset☆73Updated 6 years ago
- A PyTorch implementation of Tacotron2, an end-to-end text-to-speech(TTS) system described in "Natural TTS Synthesis By Conditioning Waven…☆52Updated 6 years ago
- A deep learning solution to the Query By Singing/Humming (QBSH) problem in Music Information Retrieval (MIR).☆15Updated 8 years ago
- A implementation voice morphing using relgan with tensorflow☆25Updated 2 years ago
- Project page for our paper "DurIAN : DurIAN-SC: Duration Informed Attention Network based Singing Voice Conversion System".☆10Updated 5 years ago
- Takes in a .wav file outputs a pitch shifted wav file of the same length☆46Updated 2 years ago
- Two-stage GANs that generate fingerstyle guitarist images from audio.☆59Updated 7 years ago
- One-shot TTS with Improved Unseen Speaker and Style Transfer☆37Updated 3 years ago
- Extract frequency, power, width and dissonance of formants from wav files☆27Updated 3 years ago
- LogMMSE speech enhancement/noise reduction☆30Updated 5 years ago
- A PyTorch implementation of DeepSpeech and DeepSpeech2.☆50Updated 7 years ago
- Using an LSTM and 4d convolutional network for lip reading☆12Updated 7 years ago
- Python implementation of the "Shazam" algorithm☆53Updated 6 years ago
- convert spleeter pretrained model to pytorch and onnx, then convert to mnn☆20Updated 4 years ago
- ObamaNet fork☆12Updated 6 years ago
- Converts TensorFlow checkpoints (with index, meta and data files) to PyTorch, HDF5 and JSON☆18Updated 4 years ago
- Example python scripts to evaluate various ASR methods☆12Updated 3 years ago
- Supplementary information and code for INTERSPEECH 2018 paper: Singing voice phoneme segmentation by hierarchically inferring syllable an…☆46Updated 7 years ago
- A packaged convolutional voice activity detector for noisy environments.☆14Updated 6 years ago