Audio-WestlakeU / audiosslLinks
A library built for easier audio self-supervised training, downstream tasks evaluation
☆121Updated 9 months ago
Alternatives and similar repositories for audiossl
Users that are interested in audiossl are comparing it to the libraries listed below
Sorting:
- Masked Modeling Duo: Towards a Universal Audio Pre-training Framework☆103Updated 10 months ago
- EVAR ~ Evaluation package for Audio Representations☆58Updated this week
- This repo includes the official implementations of "Fine-tune the pretrained ATST model for sound event detection".☆133Updated 2 weeks ago
- Learning differentiable temporal resolution on time-series data.☆36Updated 2 years ago
- ☆33Updated last month
- Inference code for PaSST, using the HEAR API.☆33Updated last year
- ☆52Updated last month
- Pytorch implementation of subband decomposition☆92Updated 2 years ago
- ☆53Updated last month
- ☆65Updated 9 months ago
- Domestic environment sound event detection task☆145Updated last year
- Source for the Interspeech 2024 Paper "Scaling up masked audio encoder learning for general audio classification"☆68Updated 2 months ago
- Official data preparation scripts for the URGENT 2024 Challenge☆80Updated last month
- A 6-million Audio-Caption Paired Dataset Built with a LLMs and ALMs-based Automatic Pipeline☆155Updated 6 months ago
- PAM is a no-reference audio quality metric for audio generation tasks☆64Updated 11 months ago
- Audio Captioning datasets for PyTorch.☆119Updated 3 weeks ago
- Implementation of the paper "Self-supervised Learning with Random-projection Quantizer for Speech Recognition" in Pytorch.☆81Updated 2 years ago
- ☆121Updated 3 years ago
- ☆193Updated last year
- COG-MHEAR Audio-Visual Speech Enhancement Challenge☆40Updated last month
- Dataset and baseline code for the VocalSound dataset (ICASSP2022).☆142Updated 2 years ago
- A fast implementation of bss_eval metrics for blind source separation☆138Updated 3 years ago
- The implementation of "Dual-branch Attention-In-Attention Transformer for single-channel speech enhancement"☆121Updated 2 years ago
- Baseline method for sound event localization task of DCASE 2022 challenge☆55Updated 3 years ago
- PyTorch implementation of the LEAF audio frontend☆73Updated 2 years ago
- Single channel speech source separation by diffusion process (ICASSP 2023)☆109Updated last year
- Speech Human Evaluation Estimation Toolkit (SHEET)☆86Updated 3 weeks ago
- ☆84Updated last year
- Open implementation of UNIVERSE and UNIVERSE++ diffusion-based speech enhancement models.☆98Updated 9 months ago
- The implementation of "X-TF-GridNet: A Time-Frequency Domain Target Speaker Extraction Network with Adaptive Speaker Embedding Fusion", w…☆59Updated 8 months ago