sithu31296 / audio-tagging
Easy to use Audio Tagging in PyTorch
☆20Updated 3 years ago
Alternatives and similar repositories for audio-tagging:
Users that are interested in audio-tagging are comparing it to the libraries listed below
- Pytorch implementation of subband decomposition☆92Updated 2 years ago
- Official implementation of DualCycleGAN for nonparallel audio super resolution☆52Updated 2 years ago
- ☆13Updated last year
- Differentiable Mean Opinion Score Regularization for Perceptual Speech Enhancement☆22Updated last year
- The implementation of "Optimizing Shoulder to Shoulder: A Coordinated Sub-Band Fusion Model for Real-Time Full-Band Speech Enhancement"☆51Updated 2 years ago
- Fully Quantized Neural Networks For Speech Enhancement☆61Updated last year
- This code is to run the WARP-Q speech quality metric.☆34Updated 4 months ago
- ☆64Updated last year
- Official implementation of the paper: "LDNet: Unified Listener Dependent Modeling in MOS Prediction for Synthetic Speech"☆62Updated 3 years ago
- Speech Human Evaluation Estimation Toolkit (SHEET)☆52Updated 3 months ago
- An official implementation of the ICASSP 2024 paper: Dual-Path TFC-TDF UNet for Music Source Separation☆84Updated 11 months ago
- HiFi++: a Unified Framework for Neural Vocoding, Bandwidth Extension and Speech Enhancement☆155Updated 2 years ago
- HAAQI-Net is a novel DNN-based non-intrusive method for assessing music audio quality in hearing aid users.☆13Updated last month
- Source code for training models and using the hyperbolic interface proposed in our ICASSP 2023 paper, “Hyperbolic Audio Source Separation…☆63Updated last year
- Repo for source code of EBEN: Extreme Bandwidth Extension Network☆72Updated last month
- Streaming Audiotransformers for online Audio tagging☆43Updated 8 months ago
- [Interspeech 2024] Hold Me Tight: Stable Encoder-Decoder Design for Speech Enhancement☆36Updated 2 months ago
- ☆22Updated last year
- ☆12Updated 2 years ago
- ☆20Updated last year
- ☆25Updated last year
- The implementation of "X-TF-GridNet: A Time-Frequency Domain Target Speaker Extraction Network with Adaptive Speaker Embedding Fusion", w…☆47Updated 4 months ago
- Speech Parameter Estimation Using Differentiable Speech Synthesizer☆44Updated last year
- ☆56Updated 4 years ago
- Translating Synthetic RIRs to Real RIRs☆41Updated last year
- BAE-NET: A LOW COMPLEXITY AND HIGH FIDELITY BANDWIDTH-ADAPTIVE NEURAL NETWORK FOR SPEECH SUPER-RESOLUTION☆67Updated 6 months ago
- ☆48Updated last year
- Source code and demo for INTERSPEECH 2024 paper: Noise-robust Speech Separation with Fast Generative Correction☆37Updated 3 months ago
- Transformer with Local Modeling by Convolution for Speech Separation and Enhancement☆39Updated 6 months ago
- ☆13Updated last year