Anwarvic / CNN-for-Raw-Waveforms
This is my PyTorch implementation of the "Very Deep Convolutional Neural Networks For Raw Waveforms" research paper published in 2016.
☆14Updated 3 years ago
Alternatives and similar repositories for CNN-for-Raw-Waveforms:
Users that are interested in CNN-for-Raw-Waveforms are comparing it to the libraries listed below
- A PyTorch implementation of Speech Transformer, an End-to-End ASR with Transformer network on Mandarin Chinese.☆11Updated 5 years ago
- Constrained Permutation Invariant Training, Speech Separation☆47Updated 4 years ago
- Speech command recognition with capsule network & various NNs / KWS on Google Speech Command Dataset.☆25Updated 6 years ago
- Implementation for paper "iMetricGAN: Intelligibility Enhancement for Speech-in-Noise using Generative Adversarial Network-based Metric L…☆54Updated last year
- Feature extraction of speech signal is the initial stage of any speech recognition system.☆92Updated 4 years ago
- Official repo for "A MODULATION-DOMAIN LOSS FOR NEURAL-NETWORK-BASED REAL-TIME SPEECH ENHANCEMENT" to appear in ICASSP 2021☆39Updated 3 years ago
- End-to-End Keyword Spotting (E2E-KWS) using a character level LSTM☆39Updated 2 years ago
- (tensorflow) Wiener Filter based Speech Enhancement(LSTM/BLSTM, GRU/BGRU, Transformer)☆15Updated 5 years ago
- Trained speaker embedding deep learning models and evaluation pipelines in pytorch and tesorflow for speaker recognition.☆36Updated 5 years ago
- Transformer-based online speech recognition system with TensorFlow 2☆26Updated 4 years ago
- Few-Shot Keyword Spotting☆63Updated 3 years ago
- Improving the Goodness of Pronunciation with DNNs and RNNs☆32Updated 6 years ago
- Bidirectional dynamic RNN + CTC for phoneme recognition☆45Updated 4 years ago
- Estimating the Age, Height, and Gender of a speaker with their speech signal. https://arxiv.org/pdf/2110.13653.pdf☆65Updated 3 years ago
- An advance kaldi wrapper for Pyhton☆38Updated 4 years ago
- Clustering-based methods for overlapping diarization☆78Updated last year
- Blind Source Separation and Dereverberation☆19Updated 4 years ago
- A program to generate microphone wind noise audio. Ideal for generating example data for designing noise removal algorithms.☆17Updated 6 years ago
- Classification of 11 types of audio clips using MFCCs features and LSTM. Pretrained on Speech Command Dataset with intensive data augment…☆42Updated 2 years ago
- an Audio-Visual Voice Activity Detection using Deep Learning☆48Updated 5 years ago
- Develop speaker recognition model based on i-vector using TIMIT database☆16Updated 5 years ago
- Pypi installable TDNN and TDNN-F layers for PyTorch based acoustic model training☆39Updated 4 years ago
- System for identifying speaker from given speech signal using MFCC,LPC features and Gaussian Mixture Models☆21Updated 7 years ago
- A set of audio augmentation techniques to perform noise insertion in datasets used for Automatic Speech Recognition.☆40Updated 3 years ago
- Filtering and Noise Adding Tool☆29Updated 2 years ago
- Supplementary information and code for INTERSPEECH 2018 paper: Singing voice phoneme segmentation by hierarchically inferring syllable an…☆46Updated 6 years ago
- Rescoring methods for end-to-end Automatic Speech Recognition☆27Updated 4 years ago
- Source Code for the Paper "UNIFIED KEYWORD SPOTTING AND AUDIO TAGGING ON MOBILE DEVICES WITH TRANSFORMERS"☆23Updated 2 years ago
- Dynamic Chunk Streaming and Offline Conformer based on athena-team/Athena.☆44Updated 2 years ago
- The project is related to the development of labs for the ITMO Speaker Recognition Course.☆10Updated 2 years ago