MorenoLaQuatra / audioset-download
This package aims at simplifying the download of the AudioSet dataset.
☆48Updated last year
Alternatives and similar repositories for audioset-download:
Users that are interested in audioset-download are comparing it to the libraries listed below
- Official implementation for our paper "Audio Mamba: Selective State Spaces for Self-Supervised Audio Representations"☆38Updated 10 months ago
- ☆44Updated last month
- A 6-million Audio-Caption Paired Dataset Built with a LLMs and ALMs-based Automatic Pipeline☆126Updated 4 months ago
- Learning differentiable temporal resolution on time-series data.☆36Updated 2 years ago
- PAM is a no-reference audio quality metric for audio generation tasks☆58Updated 9 months ago
- COG-MHEAR Audio-Visual Speech Enhancement Challenge☆39Updated 3 weeks ago
- Official data preparation scripts for the URGENT 2024 Challenge☆76Updated 3 months ago
- EVAR ~ Evaluation package for Audio Representations☆51Updated 5 months ago
- ☆55Updated 2 years ago
- ☆48Updated 4 months ago
- A library built for easier audio self-supervised training, downstream tasks evaluation☆116Updated 7 months ago
- US-based professors who work on audio. For students who would like to apply for RA, PhD, postdoc in audio research.☆25Updated 3 weeks ago
- Query-conditioned target sound extraction model☆21Updated last month
- Generation scripts for EARS-WHAM and EARS-Reverb☆31Updated 7 months ago
- Speech Human Evaluation Estimation Toolkit (SHEET)☆65Updated 5 months ago
- ☆30Updated 5 months ago
- Boosting Self-Supervised Embeddings for Speech Enhancement☆47Updated 2 years ago
- Source code for Consistent ensemble distillation for audio tagging☆30Updated 9 months ago
- Official repo of ICASSP 2024 paper - Generative De-Quantization for Neural Speech Codec via Latent Diffusion.☆54Updated 3 months ago
- Pytorch implementation of subband decomposition☆92Updated 2 years ago
- ☆24Updated 6 months ago
- ☆35Updated 9 months ago
- ☆28Updated 11 months ago
- We propose C2SER, a novel audio-language model designed to enhance the stability and accuracy of speech emotion recognition through conte…☆26Updated last month
- ☆75Updated 6 months ago
- BAE-NET: A LOW COMPLEXITY AND HIGH FIDELITY BANDWIDTH-ADAPTIVE NEURAL NETWORK FOR SPEECH SUPER-RESOLUTION☆67Updated 8 months ago
- experiments about AudioSet☆44Updated last year
- Official code of ElasticAST (Interspeech 2024 paper)☆30Updated 8 months ago
- MANNER: Multi-view Attention Network for Noise ERasure (Speech enhancement in time-domain)☆60Updated 2 years ago
- It includes papers on speech&audio field. Now update: ICLR2023-2025, ICML2023-2024, NeurIPS2023-2024, ACMMM2024, AAAI2024, ACL2024, EMNLP…☆49Updated this week