sithu31296 / audio-tagging
Easy to use Audio Tagging in PyTorch
☆20Updated 3 years ago
Alternatives and similar repositories for audio-tagging:
Users that are interested in audio-tagging are comparing it to the libraries listed below
- Streaming Audiotransformers for online Audio tagging☆43Updated 9 months ago
- ☆64Updated last year
- Unsupervised domain adaptation for conversational speech enhancement using RemixIT☆53Updated last year
- Pytorch implementation of subband decomposition☆92Updated 2 years ago
- Source code for Consistent ensemble distillation for audio tagging☆27Updated 8 months ago
- Official implementation of DualCycleGAN for nonparallel audio super resolution☆53Updated 2 years ago
- Speech Parameter Estimation Using Differentiable Speech Synthesizer☆44Updated last year
- ☆24Updated 5 months ago
- Learning differentiable temporal resolution on time-series data.☆36Updated 2 years ago
- The implementation of "Optimizing Shoulder to Shoulder: A Coordinated Sub-Band Fusion Model for Real-Time Full-Band Speech Enhancement"☆52Updated 2 years ago
- ☆30Updated 8 months ago
- ☆21Updated last year
- Transformer with Local Modeling by Convolution for Speech Separation and Enhancement☆51Updated 7 months ago
- Paderborn Sound Event Detection☆73Updated last year
- Fully Quantized Neural Networks For Speech Enhancement☆61Updated last year
- Official implementation of EfficientLEAF, a learnable audio frontend.☆40Updated 2 years ago
- ☆48Updated 2 years ago
- Speech Dereverberation using Fully Convolutional Networks☆71Updated 4 years ago
- PyTorch Implementation of Generalized End-to-End Loss for Speaker Verification☆28Updated 4 years ago
- ☆78Updated 9 months ago
- We implemented the DEMUCS model for speech enhancement in the time-frequency domain, and additionally implemented HD-DEMUCS.☆28Updated last year
- Evaluation and Benchmarking of Speech Super-resolution Methods☆149Updated 2 years ago
- Code and data recipes for the paper: Heterogeneous Target Speech Separation☆41Updated 2 years ago
- Attention Backend for Aotumatic Speaker Verification with Multiple Enrollment Utterances☆49Updated 2 years ago
- Query-conditioned target sound extraction model☆20Updated this week
- Repo for source code of EBEN: Extreme Bandwidth Extension Network☆73Updated 2 months ago
- ☆54Updated 9 months ago
- ☆13Updated last year
- ☆13Updated last year
- Dataset and baseline code for the VocalSound dataset (ICASSP2022).☆132Updated 2 years ago