DagsHub / audio-datasets
open-source audio datasets
☆147Updated last year
Alternatives and similar repositories for audio-datasets:
Users that are interested in audio-datasets are comparing it to the libraries listed below
- A collection of useful audio datasets and transforms for PyTorch.☆137Updated 2 years ago
- The official code repo for "Zero-shot Audio Source Separation through Query-based Learning from Weakly-labeled Data", in AAAI 2022☆195Updated 2 years ago
- Repository hosting code and slides of the Audio Data Augmentation series on The Sound of AI YT channel.☆37Updated 3 years ago
- Dataset and baseline code for the VocalSound dataset (ICASSP2022).☆132Updated 2 years ago
- ☆63Updated 6 months ago
- Speaker identification/verification models for Machine Learning for Computer Vision class at UNIBO☆61Updated 2 years ago
- Official Implementation of the work "Audio Mamba: Bidirectional State Space Model for Audio Representation Learning"☆131Updated 3 months ago
- ☆66Updated 3 months ago
- An easy way to fine-tune Wav2Vec 2.0 for low-resource languages.☆81Updated last year
- Automated Reproducible Acoustical Analysis☆149Updated 7 months ago
- ☆91Updated 2 years ago
- A collection of datasets for the purpose of emotion recognition/detection in speech.☆315Updated 5 months ago
- Pitch Estimating Neural Networks (PENN)☆243Updated 7 months ago
- Code for the paper: GAMA: A Large Audio-Language Model with Advanced Audio Understanding and Complex Reasoning Abilities☆113Updated 3 months ago
- A library built for easier audio self-supervised training, downstream tasks evaluation☆112Updated 6 months ago
- Official implementation of "Contrastive Audio-Language Learning for Music" (ISMIR 2022)☆111Updated 3 months ago
- iSTFTNet : Fast and Lightweight Mel-spectrogram Vocoder Incorporating Inverse Short-time Fourier Transform☆242Updated last year
- These are Jupyter Notebooks to help guide people to learn how to use Praat-Parselmouth☆39Updated 3 years ago
- This repository aims at providing efficient CNNs for Audio Tagging. We provide AudioSet pre-trained models ready for downstream training …☆269Updated 3 months ago
- An official reimplementation of the method described in the INTERSPEECH 2021 paper - Speech Resynthesis from Discrete Disentangled Self-S…☆401Updated last year
- This project is about performing Speaker diarization for Hindi Language.☆48Updated 3 years ago
- Final project for the Speaker Recognition course on Udemy, 机器之心, 深蓝学院 and 语音之家☆43Updated 10 months ago
- A unified dataset of multilingual emotional human utterances☆24Updated 3 years ago
- A simple library for Fréchet Audio Distance (FAD) calculation☆184Updated last week
- REPeating Pattern Extraction Technique (REPET) in Python for audio source separation: original REPET, REPET extended, adaptive REPET, REP…☆32Updated last year
- 😎 Awesome lists about Speech Emotion Recognition☆83Updated 2 months ago
- see README☆336Updated 7 months ago
- Spot the conversation: speaker diarisation in the wild☆135Updated 2 years ago
- SA-toolkit: Speaker speech anonymization toolkit in python☆23Updated 2 weeks ago
- Code and data repository for paper "VoxCeleb enrichment for Age and Gender recognition" submitted at ASRU 2021☆67Updated 3 years ago