sithu31296 / audio-taggingLinks
Easy to use Audio Tagging in PyTorch
☆22Updated 4 years ago
Alternatives and similar repositories for audio-tagging
Users that are interested in audio-tagging are comparing it to the libraries listed below
Sorting:
- Pytorch implementation of subband decomposition☆92Updated 3 years ago
- Unsupervised domain adaptation for conversational speech enhancement using RemixIT☆55Updated 2 years ago
- SERAB: a multi-lingual benchmark for speech emotion recognition☆28Updated 2 years ago
- ☆66Updated 2 years ago
- Conformer-based Metric GAN for speech enhancement☆26Updated last year
- This code is to run the WARP-Q speech quality metric.☆35Updated last year
- Algorithm for blind estimation of reverberation time☆34Updated last year
- Source code and demo for INTERSPEECH 2024 paper: Noise-robust Speech Separation with Fast Generative Correction☆45Updated last year
- ☆73Updated 3 years ago
- Streaming Audiotransformers for online Audio tagging☆49Updated last year
- ☆54Updated 2 years ago
- Repo for source code of EBEN: Extreme Bandwidth Extension Network☆76Updated 6 months ago
- Official PyTorch implementation of "RVAE-EM: Generative speech dereverberation based on recurrent variational auto-encoder and convolutiv…☆48Updated 8 months ago
- Fully Quantized Neural Networks For Speech Enhancement☆63Updated last year
- Non-intrusive Objective Speech Quality Assessment (NISQA) Challenge in Online Conferencing Applications☆46Updated 3 years ago
- ☆27Updated 2 years ago
- Machine and Deep Learning models for speech dereverberation☆118Updated 3 years ago
- This is the official implementation of reverberant speech to room impulse response estimator☆39Updated last year
- unofficial implementation of "CPTNN: CROSS-PARALLEL TRANSFORMER NEURAL NETWORK FOR TIME-DOMAIN SPEECH ENHANCEMENT"☆15Updated 2 years ago
- ☆14Updated 3 years ago
- A fundamental frequency estimation algorithm using features from the magnitude and phase spectrogram.☆24Updated 4 years ago
- TS-SEP: Joint Diarization and Separation Conditioned on Estimated Speaker Embeddings☆35Updated last month
- HiFi++: a Unified Framework for Neural Vocoding, Bandwidth Extension and Speech Enhancement☆158Updated 3 years ago
- The implementation of "Optimizing Shoulder to Shoulder: A Coordinated Sub-Band Fusion Model for Real-Time Full-Band Speech Enhancement"☆52Updated 2 years ago
- Evaluation and Benchmarking of Speech Super-resolution Methods☆152Updated 3 years ago
- ☆89Updated last year
- Multi-Task Audio Source Separation, Two-Stage Model, Complex Domain.☆93Updated 2 years ago
- A Diffusion Probabilistic Model for Target Sound Extraction☆41Updated last year
- ☆66Updated 4 years ago
- This is the code and dataset repo for Interspeech 2024 paper "Target conversation extraction: Source separation using turn-taking dynamic…☆52Updated 3 months ago