awsaf49 / audio_classification_models
Tensorflow Audio Classification Models
☆12Updated last year
Related projects ⓘ
Alternatives and complementary repositories for audio_classification_models
- This repository includes the code to reproduce our paper "Raw Differentiable Architecture Search for Speech Deepfake and Spoofing Detecti…☆11Updated last year
- This repository is related to our Dataset and Detection code from the paper: AI-Synthesized Voice Detection Using Neural Vocoder Artifact…☆94Updated 2 months ago
- Implementation of Attack Agnostic Dataset: Towards Generalization and Stabilization of Audio DeepFake Detection paper☆54Updated last year
- Official implementation of the SPL paper "One-class Learning Towards Synthetic Voice Spoofing Detection"☆111Updated 2 months ago
- Implementation of "SpecRNet: Towards Faster and More Accessible Audio DeepFake Detection" paper☆32Updated last year
- Implementation of the paper "Improved DeepFake Detection Using Whisper Features"☆91Updated 6 months ago
- A comapartive analysis of voice spoofing detection systems, based on a paper available at https://arxiv.org/abs/2210.00417.☆13Updated 2 years ago
- ☆80Updated 2 years ago
- SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition☆71Updated 4 years ago
- PyTorch Implementation of Google's Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions. This implementation supp…☆47Updated last year
- Official implementation of our ASVspoof 2021 paper, "UR Channel-Robust Synthetic Speech Detection System for ASVspoof 2021"☆53Updated 2 years ago
- Official PyTorch implementation of "AASIST: Audio Anti-Spoofing using Integrated Spectro-Temporal Graph Attention Networks"☆176Updated last year
- Time-domain synthetic speech detection net (TSSDNet), having the classic ResNet and Inception Net style structures (Res-TSSDNet and Inc-T…☆65Updated 3 years ago
- An implementation of Speech Emotion Recognition, based on HuBERT model, training with PyTorch and HuggingFace framework, and fine-tuning …☆32Updated 2 years ago
- Pytorch implementation of INTEGRATED PARAMETER-EFFICIENT TUNING FOR GENERAL-PURPOSE AUDIO MODELS☆10Updated last year
- FastAudio is a Learnable Audio Frontend team Magnum's designed for the ASVspoof 2021 challenge☆45Updated last year
- speaker_diarization done on toy dataset and tested on timit dataset☆8Updated 2 years ago
- An implementation of the paper titled "Arabic Speech Emotion Recognition Employing Wav2vec2.0 and HuBERT Based on BAVED Dataset" https://…☆11Updated 2 years ago
- Code and data repository for paper "VoxCeleb enrichment for Age and Gender recognition" submitted at ASRU 2021☆64Updated 2 years ago
- Final project for the Speaker Recognition course on Udemy, 机器之心, 深蓝学院 and 语音之家☆43Updated 6 months ago
- ☆59Updated 2 months ago
- A Mixed Sample Data Augmentation method for Training with Time-Frequency Domain Features☆10Updated 2 years ago
- Official implementation of the Odyssey paper "A Probabilistic Fusion Framework for Spoofing Aware Speaker Verification"☆17Updated 2 years ago
- WavEncoder is a Python library for encoding audio signals, transforms for audio augmentation, and training audio classification models wi…☆89Updated 3 years ago
- [INTERSPEECH 2024] EmoBox: Multilingual Multi-corpus Speech Emotion Recognition Toolkit and Benchmark☆147Updated 5 months ago
- Dataset and baseline code for the VocalSound dataset (ICASSP2022).☆123Updated 2 years ago
- The description of FMFCC-A (audio track of FMFCC) dataset and Challenge resluts.☆24Updated 2 years ago
- [InterSpeech'2023] "Betray Oneself: A Novel Audio DeepFake Detection Model via Mono-to-Stereo Conversion"☆12Updated 8 months ago
- This is the official repository of the papers "Parameter-Efficient Transfer Learning of Audio Spectrogram Transformers" and "Efficient Fi…☆36Updated 3 months ago
- A mini, simple, and fast end-to-end automatic speech recognition toolkit.☆47Updated last year