LuisKay / Spec_ResNetLinks
Spectrogram is selected as preprocessing feature of audio clips and a feature representation method based on deep residual network (Spec-ResNet) is proposed to detect audio steganography.
☆26Updated 5 years ago
Alternatives and similar repositories for Spec_ResNet
Users that are interested in Spec_ResNet are comparing it to the libraries listed below
Sorting:
- Audio data augmentation examples☆34Updated 7 years ago
- Speaker recognition ,Voiceprint recognition☆53Updated 5 years ago
- The Additive Margin SincNet (AM-SincNet) is a new approach for speaker recognition problems which is based in the neural network architec…☆45Updated 2 years ago
- Implement a GRU/LSTM model using Keras, and train it to classify the languages using MFCC features☆25Updated last year
- The details that matter: Frequency resolution of spectrograms in acoustic scene classification - paper replication data☆39Updated 8 years ago
- Repository for Weak Label Learning for Audio Events - A closer look. Uses Audioset subset data provided for reproducibility.☆32Updated 2 years ago
- Surrey CVSSP DCASE 2018 Task 2 system☆20Updated 3 years ago
- Adversarial attack and defense strategies for deep speaker recognition systems☆42Updated 4 years ago
- PyTorch Implementation of SubSpectralNet - Using Sub-Spectrogram based Convolutional Neural Networks for Acoustic Scene Classification, a…☆22Updated 6 years ago
- 📊 Easily apply audio-related machine learning models trained on the AudioSet dataset (527+ models/classes).☆31Updated last year
- Download and create a tfreader for the audioset dataset☆16Updated 5 years ago
- Face Landmark-based Speaker-Independent Audio-Visual Speech Enhancement in Multi-Talker Environments☆111Updated last year
- SELD-TCN: Sound Event Detection & Localization via Temporal Convolutional Network | Python w/ Tensorflow☆66Updated 5 years ago
- The source code of "Audio steganalysis with Improved CNN"☆15Updated 6 years ago
- Improved speech enhancement with the Wave-U-Net, a deep convolutional neural network architecture for audio source separation, implemente…☆222Updated 2 years ago
- PyTorch implementation of the 1D-Triplet-CNN neural network model described in Fusing MFCC and LPC Features using 1D Triplet CNN for Spea…☆32Updated 5 years ago
- Classification of Urban Sound Audio Dataset using LSTM-based model.☆76Updated 3 years ago
- Pytorch code for the paper 'Attention-based Atrous Convolutional Neural Networks: Visualisation and Understanding Perspectives of Acousti…☆14Updated 5 years ago
- The Additive Margin MobileNet1D is a new light weight deep learning model for Speaker Recognition which is based on the MobileNetV2 archi…☆30Updated 2 years ago
- Training General-Purpose Audio Tagging Networks with Noisy Labels and Iterative Self-Verification☆29Updated 6 years ago
- Targeted Adversarial Examples for Black Box Audio Systems☆71Updated 5 years ago
- implement Wave-U-Net by pytorch☆57Updated 7 years ago
- An audio steganalysis method based on CNN in the time domain.☆12Updated 4 years ago
- A perceptual weighting filter loss for DNN training in speech enhancement☆24Updated 3 years ago
- Calculate MFCC/Fbank feature for wav files☆14Updated 8 years ago
- speaker recognition using keras☆36Updated 3 years ago
- SE-Resnet+AMSoftmax for Speaker Verification☆47Updated 7 years ago
- A pytorch implementation of MFCC.☆33Updated 3 years ago
- ICASSP 2021 accepted paper☆20Updated 4 years ago
- This is the implementation of the paper "Adversarial Attacks on Spoofing Countermeasures of automatic speaker verification".☆42Updated 2 years ago