pooya-mohammadi / audio-classification-pytorch
In this project, several approaches for training/finetuning an audio gender recognition is provided. The code can simply be used for any other audio classification task by simply changing the number of classes and the input dataset.
☆38Updated last month
Related projects ⓘ
Alternatives and complementary repositories for audio-classification-pytorch
- ☆19Updated 3 months ago
- FastAudio is a Learnable Audio Frontend team Magnum's designed for the ASVspoof 2021 challenge☆44Updated last year
- Implementation of "A conformer-based classifier for variable-length utterance processing in anti-spoofing" published in Interspeech 2023.☆19Updated last year
- Time-domain synthetic speech detection net (TSSDNet), having the classic ResNet and Inception Net style structures (Res-TSSDNet and Inc-T…☆65Updated 3 years ago
- Pytorch implementation of "LEVERAGING POSITIONAL-RELATED LOCAL-GLOBAL DEPENDENCY FOR SYNTHETIC SPEECH DETECTION"☆26Updated last year
- Official implementation of the INTERSPEECH 2024 paper: Temporal-Channel Modeling in Multi-head Self-Attention for Synthetic Speech Detect…☆24Updated last month
- Official repository of NeXt-TDNN for speaker verification☆56Updated last month
- Baseline for the Spoofing-aware Speaker Verification Challenge 2022☆64Updated 2 years ago
- Code and data repository for paper "VoxCeleb enrichment for Age and Gender recognition" submitted at ASRU 2021☆63Updated 2 years ago
- Finetune Wa2vec 2.0 For Speech Recognition☆111Updated last year
- EfficientNet-Absolute Zero for Continuous Speech Keyword Spotting☆23Updated 2 years ago
- An implementation of Speech Emotion Recognition, based on HuBERT model, training with PyTorch and HuggingFace framework, and fine-tuning …☆31Updated 2 years ago
- Audio classification is a popular topic, here I implement several models using TenserFlow and Keras.☆23Updated 4 years ago
- A speaker embedding network in Pytorch that is very quick to set up and use for whatever purposes.☆85Updated last year
- Building a Deep learning model that predicts the gender of a speaker using TensorFlow 2☆109Updated last year
- Framework for training and evaluating self-supervised learning methods for speaker verification.☆19Updated last week
- Official implementation of the Odyssey paper "A Probabilistic Fusion Framework for Spoofing Aware Speaker Verification"☆17Updated 2 years ago
- An implementation of the paper titled "Arabic Speech Emotion Recognition Employing Wav2vec2.0 and HuBERT Based on BAVED Dataset" https://…☆11Updated 2 years ago
- Implementation of the paper "Improved DeepFake Detection Using Whisper Features"☆89Updated 6 months ago
- Official implementation of our ASVspoof 2021 paper, "UR Channel-Robust Synthetic Speech Detection System for ASVspoof 2021"☆53Updated 2 years ago
- Light-SERNet: A lightweight fully convolutional neural network for speech emotion recognition☆66Updated 2 years ago
- Estimating the Age, Height, and Gender of a speaker with their speech signal. https://arxiv.org/pdf/2110.13653.pdf☆64Updated 3 years ago
- ☆46Updated 11 months ago
- ☆27Updated 2 years ago
- ☆24Updated 9 months ago
- This is the official code for paper "Speech Emotion Recognition with Global-Aware Fusion on Multi-scale Feature Representation" published…☆43Updated 2 years ago
- Predicts the level of noise and reverberation on your audiofiles☆138Updated 5 months ago
- SafeEar: Content Privacy-Preserving Audio Deepfake Detection (Accepted by CCS 2024)☆44Updated 3 weeks ago
- This repository includes the code to reproduce our paper "Automatic speaker verification spoofing and deepfake detection using wav2vec 2.…☆107Updated last year
- SASV2 baseline, a track on ASVspoof5 phase2 challenge☆22Updated 3 months ago