sithu31296 / audio-taggingLinks
Easy to use Audio Tagging in PyTorch
☆22Updated 4 years ago
Alternatives and similar repositories for audio-tagging
Users that are interested in audio-tagging are comparing it to the libraries listed below
Sorting:
- Unsupervised domain adaptation for conversational speech enhancement using RemixIT☆54Updated 2 years ago
- Pytorch implementation of subband decomposition☆92Updated 3 years ago
- Streaming Audiotransformers for online Audio tagging☆47Updated last year
- Evaluation and Benchmarking of Speech Super-resolution Methods☆152Updated 3 years ago
- ☆65Updated 2 years ago
- Repo for source code of EBEN: Extreme Bandwidth Extension Network☆75Updated 3 months ago
- SERAB: a multi-lingual benchmark for speech emotion recognition☆28Updated 2 years ago
- Code and data recipes for the paper: Heterogeneous Target Speech Separation☆42Updated 2 years ago
- This is the code and dataset repo for Interspeech 2024 paper "Target conversation extraction: Source separation using turn-taking dynamic…☆48Updated last week
- TS-SEP: Joint Diarization and Separation Conditioned on Estimated Speaker Embeddings☆34Updated 11 months ago
- Author's repository for reproducing DcaseNet, an integrated pre-trained DNN that performs acoustic scene classification, audio tagging, a…☆42Updated 3 years ago
- ☆54Updated 2 years ago
- HiFi++: a Unified Framework for Neural Vocoding, Bandwidth Extension and Speech Enhancement☆155Updated 3 years ago
- A simple package for Guided source separation (GSS)☆128Updated last year
- ☆13Updated 2 years ago
- Official PyTorch implementation of "RVAE-EM: Generative speech dereverberation based on recurrent variational auto-encoder and convolutiv…☆46Updated 5 months ago
- BAE-NET: A LOW COMPLEXITY AND HIGH FIDELITY BANDWIDTH-ADAPTIVE NEURAL NETWORK FOR SPEECH SUPER-RESOLUTION☆69Updated last year
- Discriminative Training of VBx Diarization☆26Updated 11 months ago
- Estimating the Age, Height, and Gender of a speaker with their speech signal.☆14Updated 2 years ago
- NOTSOFAR-1 Challenge: Distant Diarization and ASR☆55Updated 6 months ago
- An official implementation of the ICASSP 2024 paper: Dual-Path TFC-TDF UNet for Music Source Separation☆92Updated last year
- High-Fidelity Neural Phonetic Posteriorgrams☆112Updated 6 months ago
- Conformer-based Metric GAN for speech enhancement☆26Updated last year
- Multi-Task Audio Source Separation, Two-Stage Model, Complex Domain.☆93Updated 2 years ago
- ☆60Updated 4 years ago
- Differentiable Mean Opinion Score Regularization for Perceptual Speech Enhancement☆23Updated 2 years ago
- Clustering-based methods for overlapping diarization☆81Updated last year
- This is the official implementation of reverberant speech to room impulse response estimator☆36Updated last year
- A Diffusion Probabilistic Model for Target Sound Extraction☆39Updated 11 months ago
- This code is to run the WARP-Q speech quality metric.☆35Updated 10 months ago