sithu31296 / audio-tagging
Easy to use Audio Tagging in PyTorch
☆20Updated 3 years ago
Alternatives and similar repositories for audio-tagging:
Users that are interested in audio-tagging are comparing it to the libraries listed below
- An official implementation of the ICASSP 2024 paper: Dual-Path TFC-TDF UNet for Music Source Separation☆83Updated 10 months ago
- ☆64Updated last year
- Pytorch implementation of subband decomposition☆91Updated 2 years ago
- Official implementation of DualCycleGAN for nonparallel audio super resolution☆51Updated 2 years ago
- Unsupervised domain adaptation for conversational speech enhancement using RemixIT☆53Updated last year
- We implemented the DEMUCS model for speech enhancement in the time-frequency domain, and additionally implemented HD-DEMUCS.☆21Updated last year
- Evaluation and Benchmarking of Speech Super-resolution Methods☆144Updated 2 years ago
- ☆13Updated last year
- Differentiable Mean Opinion Score Regularization for Perceptual Speech Enhancement☆22Updated last year
- ☆70Updated last month
- This package aims at simplifying the download of the AudioSet dataset.☆45Updated last year
- Single channel speech source separation by diffusion process (ICASSP 2023)☆96Updated 10 months ago
- ☆62Updated 4 months ago
- Repo for source code of EBEN: Extreme Bandwidth Extension Network☆71Updated 3 weeks ago
- The implementation of "X-TF-GridNet: A Time-Frequency Domain Target Speaker Extraction Network with Adaptive Speaker Embedding Fusion", w…☆46Updated 3 months ago
- SERAB: a multi-lingual benchmark for speech emotion recognition☆28Updated 2 years ago
- Fully Quantized Neural Networks For Speech Enhancement☆60Updated 11 months ago
- Source code for training models and using the hyperbolic interface proposed in our ICASSP 2023 paper, “Hyperbolic Audio Source Separation…☆62Updated last year
- High-Fidelity Neural Phonetic Posteriorgrams☆101Updated 2 months ago
- Conformer-based Metric GAN for speech enhancement☆26Updated 8 months ago
- Streaming Audiotransformers for online Audio tagging☆43Updated 7 months ago
- Clustering-based methods for overlapping diarization☆74Updated last year
- BAE-NET: A LOW COMPLEXITY AND HIGH FIDELITY BANDWIDTH-ADAPTIVE NEURAL NETWORK FOR SPEECH SUPER-RESOLUTION☆65Updated 4 months ago
- HiFi++: a Unified Framework for Neural Vocoding, Bandwidth Extension and Speech Enhancement☆154Updated 2 years ago
- ☆69Updated 2 years ago
- Open implementation of UNIVERSE and UNIVERSE++ diffusion-based speech enhancement models.☆79Updated 4 months ago
- How to use our public wav2vec2 age and gender model☆34Updated last year
- ☆29Updated 6 months ago
- Official PyTorch implementation of "RVAE-EM: Generative speech dereverberation based on recurrent variational auto-encoder and convolutiv…☆42Updated 9 months ago