sithu31296 / audio-tagging
Easy to use Audio Tagging in PyTorch
☆20Updated 3 years ago
Related projects ⓘ
Alternatives and complementary repositories for audio-tagging
- ☆64Updated last year
- Official implementation of DualCycleGAN for nonparallel audio super resolution☆50Updated 2 years ago
- Differentiable Mean Opinion Score Regularization for Perceptual Speech Enhancement☆22Updated last year
- Pytorch implementation of subband decomposition☆89Updated 2 years ago
- Source code for Consistent ensemble distillation for audio tagging☆16Updated 4 months ago
- This code is to run the WARP-Q speech quality metric.☆34Updated last month
- Evaluation and Benchmarking of Speech Super-resolution Methods☆141Updated 2 years ago
- Official PyTorch implementation of "RVAE-EM: Generative speech dereverberation based on recurrent variational auto-encoder and convolutiv …☆42Updated 8 months ago
- Streaming Audiotransformers for online Audio tagging☆41Updated 5 months ago
- Query-conditioned target sound extraction model☆17Updated 3 weeks ago
- ☆13Updated last year
- BAE-NET: A LOW COMPLEXITY AND HIGH FIDELITY BANDWIDTH-ADAPTIVE NEURAL NETWORK FOR SPEECH SUPER-RESOLUTION☆57Updated 3 months ago
- Clustering-based methods for overlapping diarization☆70Updated 10 months ago
- Fully Quantized Neural Networks For Speech Enhancement☆60Updated 9 months ago
- A simple package for Guided source separation (GSS)☆107Updated 6 months ago
- Towards High-Quality and Efficient Speech Bandwidth Extension with Parallel Amplitude and Phase Prediction☆49Updated 2 weeks ago
- We implemented the DEMUCS model for speech enhancement in the time-frequency domain, and additionally implemented HD-DEMUCS.☆22Updated last year
- HiFi++: a Unified Framework for Neural Vocoding, Bandwidth Extension and Speech Enhancement☆153Updated 2 years ago
- Autovocoder: Fast Waveform Generation from a Learned Speech Representation using Differentiable Digital Signal Processing☆68Updated last year
- Conformer-based Metric GAN for speech enhancement☆26Updated 6 months ago
- ☆68Updated 2 years ago
- Unsupervised domain adaptation for conversational speech enhancement using RemixIT☆52Updated last year
- ☆59Updated 2 months ago
- Open implementation of UNIVERSE and UNIVERSE++ diffusion-based speech enhancement models.☆72Updated 2 months ago
- ☆27Updated 4 months ago
- ☆19Updated last year
- This repo provides the network code and the processed samples of the manuscript "Glance and Gaze: A Collaborative Learning Framework for …☆65Updated 2 years ago
- ☆48Updated last year
- SERAB: a multi-lingual benchmark for speech emotion recognition☆28Updated last year
- An official implementation of the ICASSP 2024 paper: Dual-Path TFC-TDF UNet for Music Source Separation☆81Updated 8 months ago