LuisKay / Spec_ResNetLinks
Spectrogram is selected as preprocessing feature of audio clips and a feature representation method based on deep residual network (Spec-ResNet) is proposed to detect audio steganography.
☆25Updated 4 years ago
Alternatives and similar repositories for Spec_ResNet
Users that are interested in Spec_ResNet are comparing it to the libraries listed below
Sorting:
- Adversarial attack and defense strategies for deep speaker recognition systems☆41Updated 4 years ago
- Speaker recognition ,Voiceprint recognition☆53Updated 5 years ago
- Audio data augmentation examples☆34Updated 7 years ago
- SELD-TCN: Sound Event Detection & Localization via Temporal Convolutional Network | Python w/ Tensorflow☆64Updated 4 years ago
- The details that matter: Frequency resolution of spectrograms in acoustic scene classification - paper replication data☆39Updated 7 years ago
- implement Wave-U-Net by pytorch☆56Updated 6 years ago
- Adversarial Unsupervised Domain Adaptation for Acoustic Scene Classification☆35Updated 6 years ago
- Surrey CVSSP DCASE 2018 Task 2 system☆19Updated 2 years ago
- Sound classification using neural networks☆12Updated 7 years ago
- Implement a GRU/LSTM model using Keras, and train it to classify the languages using MFCC features☆26Updated 11 months ago
- Sound Classification using Librosa, ffmpeg, CNN, Keras, XGBOOST, Random Forest.☆70Updated last year
- Improved speech enhancement with the Wave-U-Net, a deep convolutional neural network architecture for audio source separation, implemente…☆220Updated 2 years ago
- An audio steganalysis method based on CNN in the time domain.☆10Updated 4 years ago
- MetricGAN: Generative Adversarial Networks based Black-box Metric Scores Optimization for Speech Enhancement (ICML 2019, with Travel awar…☆139Updated 4 years ago
- The Additive Margin SincNet (AM-SincNet) is a new approach for speaker recognition problems which is based in the neural network architec…☆45Updated last year
- PyTorch Implementation of SubSpectralNet - Using Sub-Spectrogram based Convolutional Neural Networks for Acoustic Scene Classification, a…☆21Updated 6 years ago
- PyTorch implementation of the 1D-Triplet-CNN neural network model described in Fusing MFCC and LPC Features using 1D Triplet CNN for Spea…☆29Updated 5 years ago
- Urban Sound Classification: With Random Forest, SVM, DNN, RNN, and CNN Classifiers☆53Updated 8 years ago
- The source code of "Audio steganalysis with Improved CNN"☆14Updated 6 years ago
- Repository for Weak Label Learning for Audio Events - A closer look. Uses Audioset subset data provided for reproducibility.☆32Updated last year
- Supplementary information and code for INTERSPEECH 2018 paper: Singing voice phoneme segmentation by hierarchically inferring syllable an…☆46Updated 6 years ago
- PyTorch implementation of a self-attentive speaker embedding☆17Updated 5 years ago
- Classification of Urban Sound Audio Dataset using LSTM-based model.☆74Updated 2 years ago
- Pytorch code for the paper 'Attention-based Atrous Convolutional Neural Networks: Visualisation and Understanding Perspectives of Acousti…☆14Updated 4 years ago
- This is the implementation of the paper "Adversarial Attacks on Spoofing Countermeasures of automatic speaker verification".☆43Updated 2 years ago
- 📊 Easily apply audio-related machine learning models trained on the AudioSet dataset (527+ models/classes).☆30Updated last year
- Multimodal speech recognition using lipreading (with CNNs) and audio (using LSTMs). Sensor fusion is done with an attention network.☆69Updated 2 years ago
- Training General-Purpose Audio Tagging Networks with Noisy Labels and Iterative Self-Verification☆29Updated 6 years ago
- This is the code&dataset for our paper [Modeling Attention and Memory for Auditory Selection in a Cocktail Party Environment. AAAI 2018]☆57Updated 7 years ago
- Face Landmark-based Speaker-Independent Audio-Visual Speech Enhancement in Multi-Talker Environments☆109Updated last year