awsaf49 / audio_classification_models
Tensorflow Audio Classification Models
☆12Updated last year
Alternatives and similar repositories for audio_classification_models:
Users that are interested in audio_classification_models are comparing it to the libraries listed below
- This repository includes the code to reproduce our paper "Raw Differentiable Architecture Search for Speech Deepfake and Spoofing Detecti…☆11Updated last year
- Deepfake cross-lingual evaluation dataset (DECRO) is constructed to evaluate the influence of language differences on deepfake detection.…☆11Updated last year
- A speaker gender classifier. MFC feature engineering and a pre-trained ResNet-50. GradCAM interpretation.☆27Updated 3 years ago
- Implementation of "SpecRNet: Towards Faster and More Accessible Audio DeepFake Detection" paper☆33Updated last year
- PyTorch Implementation of Google's Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions. This implementation supp…☆48Updated last year
- An implementation for "Conformer: Convolution-augmented Transformer for Speech Recognition" Paper☆17Updated 2 years ago
- This is the official repository of the papers "Parameter-Efficient Transfer Learning of Audio Spectrogram Transformers" and "Efficient Fi…☆36Updated 6 months ago
- Implementation of Attack Agnostic Dataset: Towards Generalization and Stabilization of Audio DeepFake Detection paper☆57Updated last year
- FastAudio is a Learnable Audio Frontend team Magnum's designed for the ASVspoof 2021 challenge☆45Updated last year
- Official repository for the "Powerset multi-class cross entropy loss for neural speaker diarization" paper published in Interspeech 2023.☆80Updated last year
- This repository includes the code to reproduce our paper Partially-Connected Differentiable Architecture Search for Deepfake and Spoofing…☆17Updated 2 years ago
- The Pytorch implementation of paper: Masked Spectrogram Prediction For Self-Supervised Audio Pre-Training☆42Updated 2 months ago
- Official implementation of our ASVspoof 2021 paper, "UR Channel-Robust Synthetic Speech Detection System for ASVspoof 2021"☆53Updated 3 years ago
- ☆85Updated 3 years ago
- An implementation of Speech Emotion Recognition, based on HuBERT model, training with PyTorch and HuggingFace framework, and fine-tuning …☆33Updated 2 years ago
- Pytorch implementation of INTEGRATED PARAMETER-EFFICIENT TUNING FOR GENERAL-PURPOSE AUDIO MODELS☆10Updated last year
- Official implementation of the Odyssey paper "A Probabilistic Fusion Framework for Spoofing Aware Speaker Verification"☆17Updated 2 years ago
- Transformer implementation speciaized in speech recognition tasks using Pytorch.☆66Updated 3 years ago
- SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition☆74Updated 4 years ago
- Speech to Text with self-supervised learning based on wav2vec 2.0 framework using Hugging Face's Transformer☆30Updated 3 years ago
- This repository is related to our Dataset and Detection code from the paper: AI-Synthesized Voice Detection Using Neural Vocoder Artifact…☆109Updated 5 months ago
- ☆47Updated last year
- Audio classification is a popular topic, here I implement several models using TenserFlow and Keras.☆24Updated 4 years ago
- Speaker change detection using SincNet and an LSTM/Transformer☆46Updated 7 months ago
- SUTD 50.039 Deep Learning Course Project (2022 Spring)☆73Updated last year
- The description of FMFCC-A (audio track of FMFCC) dataset and Challenge resluts.☆24Updated 2 years ago
- [InterSpeech'2023] "Betray Oneself: A Novel Audio DeepFake Detection Model via Mono-to-Stereo Conversion"☆12Updated 11 months ago
- Incorporating KenLM language model with HuggingFace implementation of Wav2Vec2CTC Model using beam search decoding☆74Updated 3 years ago
- ☆12Updated last year