DagsHub / audio-datasets
open-source audio datasets
☆149Updated last year
Alternatives and similar repositories for audio-datasets:
Users that are interested in audio-datasets are comparing it to the libraries listed below
- A collection of useful audio datasets and transforms for PyTorch.☆139Updated 2 years ago
- Dataset and baseline code for the VocalSound dataset (ICASSP2022).☆133Updated 2 years ago
- The official code repo for "Zero-shot Audio Source Separation through Query-based Learning from Weakly-labeled Data", in AAAI 2022☆199Updated 2 years ago
- A speaker embedding network in Pytorch that is very quick to set up and use for whatever purposes.☆88Updated 3 weeks ago
- Speaker identification/verification models for Machine Learning for Computer Vision class at UNIBO☆63Updated 2 years ago
- PyTorch wrappers for using your model in audacity!☆174Updated last year
- Reproducible experimental protocols for multimedia (audio, video, text) database☆100Updated 2 months ago
- Promting Whisper for Audio-Visual Speech Recognition, Code-Switched Speech Recognition, and Zero-Shot Speech Translation☆143Updated last year
- Pytorch implementation of deep audio embedding calculation☆105Updated last year
- A DDSP-based neural voice synthesiser.☆116Updated 5 months ago
- Python library for downloading, loading & working with sound datasets☆332Updated 6 months ago
- Masked Modeling Duo: Towards a Universal Audio Pre-training Framework☆97Updated 8 months ago
- ☆65Updated 7 months ago
- Code for the paper: GAMA: A Large Audio-Language Model with Advanced Audio Understanding and Complex Reasoning Abilities☆123Updated 4 months ago
- Scripts for computing the Intelligibility and CLVP scores for evaluating TTS models☆155Updated last year
- Repository hosting code and slides of the Audio Data Augmentation series on The Sound of AI YT channel.☆37Updated 3 years ago
- An in-depth analysis of audio classification on the RAVDESS dataset. Feature engineering, hyperparameter optimization, model evaluation, …☆75Updated 4 years ago
- Final project for the Speaker Recognition course on Udemy, 机器之心, 深蓝学院 and 语音之家☆44Updated 11 months ago
- REPeating Pattern Extraction Technique (REPET) in Python for audio source separation: original REPET, REPET extended, adaptive REPET, REP…☆33Updated last year
- Spot the conversation: speaker diarisation in the wild☆137Updated 2 years ago
- Official repository for the "Powerset multi-class cross entropy loss for neural speaker diarization" paper published in Interspeech 2023.☆81Updated last year
- SERAB: a multi-lingual benchmark for speech emotion recognition☆28Updated 2 years ago
- Pitch Estimating Neural Networks (PENN)☆249Updated 3 weeks ago
- A simple library for Fréchet Audio Distance (FAD) calculation☆202Updated last week
- ☆92Updated 2 years ago
- Code for the Paper Speech Recognition and Multi-Speaker Diarization of Long Conversations☆38Updated last year
- This repository aims at providing efficient CNNs for Audio Tagging. We provide AudioSet pre-trained models ready for downstream training …☆281Updated 5 months ago
- This repo contains the official PyTorch implementation of "Audio Super Resolution in the Spectral Domain" (ICASSP 2023)☆221Updated 9 months ago
- A mini, simple, and fast end-to-end automatic speech recognition toolkit.☆50Updated 2 years ago
- WavEncoder is a Python library for encoding audio signals, transforms for audio augmentation, and training audio classification models wi…☆90Updated 3 years ago