topel / audioset-convnext-infLinks
Adapting a ConvNeXt model to audio classification on AudioSet
☆25Updated 5 months ago
Alternatives and similar repositories for audioset-convnext-inf
Users that are interested in audioset-convnext-inf are comparing it to the libraries listed below
Sorting:
- A Pytorch implementation of the paper : SpecAugment++: A Hidden Space Data Augmentation Method for Acoustic Scene Classification☆33Updated 4 years ago
- Learning differentiable temporal resolution on time-series data.☆36Updated 2 years ago
- experiments about AudioSet☆44Updated 2 years ago
- SAMO: SPEAKER ATTRACTOR MULTI-CENTER ONE-CLASS LEARNING FOR VOICE ANTI-SPOOFING☆40Updated 2 years ago
- CoNeTTE: An efficient Audio Captioning system leveraging multiple datasets with Task Embedding☆20Updated 7 months ago
- Streaming Audiotransformers for online Audio tagging☆46Updated last year
- ☆30Updated 2 years ago
- Official implement of "Dual-stream Time-Delay Neural Network with Dynamic Global Filter for Speaker Verification" in PyTorch☆41Updated last year
- (Interspeech 2023 & ICASSP 2024) Official repository for ARMHuBERT and STaRHuBERT☆40Updated 11 months ago
- NOTSOFAR-1 Challenge: Distant Diarization and ASR☆55Updated 5 months ago
- System that ranks 2nd in DCASE 2022 Challenge Task 5: Few-shot Bioacoustic Event Detection☆28Updated 3 years ago
- This github repo is for Neurips 2021 and Interspeech 2022 papers on Non-Matching Reference based estimation of speech quality assessment.…☆102Updated 2 years ago
- Attention Backend for Aotumatic Speaker Verification with Multiple Enrollment Utterances☆50Updated 2 years ago
- ☆83Updated 2 months ago
- Public Code for the paper MAE-AST: Masked Autoencoding Audio Spectrogram Transformer☆88Updated 3 years ago
- Official repository of NeXt-TDNN for speaker verification☆75Updated 9 months ago
- Unsupervised domain adaptation for conversational speech enhancement using RemixIT☆54Updated 2 years ago
- FastAudio is a Learnable Audio Frontend team Magnum's designed for the ASVspoof 2021 challenge☆46Updated 2 years ago
- ASiT: Audio Spectrogram vIsion Transformer for General Audio Representation☆27Updated last year
- Unofficial implementation of FSD50k baselines for Sound Event Recognition☆26Updated last year
- Code and data recipes for the paper: Heterogeneous Target Speech Separation☆42Updated 2 years ago
- ☆54Updated 2 years ago
- ☆34Updated last year
- Source code for ICASSP2022 "Pseudo Strong labels for large scale weakly supervised audio tagging"☆30Updated 3 years ago
- Clustering-based methods for overlapping diarization☆81Updated last year
- Dynamic Mixing For Speech Processing (mix-on-the-fly)☆20Updated 3 years ago
- [ICLR 2025] Enhancing Self-Supervised Models with Audio Mixtures for Polyphonic Soundscapes☆48Updated 2 months ago
- Official repository for Mamba-based Segmentation Model for Speaker Diarization☆37Updated 2 months ago
- Train no-reference speech quality estimators with multiple datasets via learned, per-dataset alignments.☆17Updated last week
- ☆24Updated 9 months ago