audeering / audbLinks
Manage audio and video datasets
☆31Updated 2 weeks ago
Alternatives and similar repositories for audb
Users that are interested in audb are comparing it to the libraries listed below
Sorting:
- ☆32Updated 3 years ago
- PodcastMix A dataset for separating music and speech in podcasts.☆44Updated 10 months ago
- [Batching/MultiGPU/DataLoader Implemented] Code for the paper Hybrid Spectrogram and Waveform Source Separation☆22Updated last year
- Codebase for the paper 'EncodecMAE: Leveraging neural codecs for universal audio representation learning'☆97Updated 11 months ago
- This code is to run the WARP-Q speech quality metric.☆35Updated 9 months ago
- ☆34Updated last year
- This is the official implementation of our multi-channel multi-speaker multi-spatial neural audio codec architecture.☆49Updated 3 months ago
- Source code for training models and using the hyperbolic interface proposed in our ICASSP 2023 paper, “Hyperbolic Audio Source Separation…☆69Updated 2 years ago
- SERAB: a multi-lingual benchmark for speech emotion recognition☆28Updated 2 years ago
- ☆44Updated last year
- Frechet Audio Distance evaluation in PyTorch☆36Updated 2 years ago
- ☆86Updated 9 months ago
- ☆13Updated last year
- Machine learning speaker characteristics☆36Updated this week
- ☆30Updated 2 years ago
- Unofficial implementation of wavenext vocoder☆48Updated 10 months ago
- A library built for easier audio self-supervised training, downstream tasks evaluation☆123Updated 10 months ago
- AudioSR-Upsampling (any -> 48kHz)☆41Updated last year
- Who calls the shots? Rethinking Few-Shot Learning for Audio (WASPAA 2021)☆43Updated 3 years ago
- VoicePAT is a modular and efficient toolkit for voice privacy research, with main focus on speaker anonymization.☆50Updated last year
- Viterbi decoding in PyTorch☆34Updated 2 months ago
- Official repository of the paper "Solving Audio Inverse Problems with a Diffusion Model", submitted to ICASSP 23☆117Updated 2 years ago
- Deep Speech Distances PyTorch☆29Updated 3 years ago
- TS-SEP: Joint Diarization and Separation Conditioned on Estimated Speaker Embeddings☆33Updated 9 months ago
- PyTorch implementation of the ICASSP-24 paper: "Improving Audio Captioning Models with Fine-grained Audio Features, Text Embedding Superv…☆37Updated last year
- (ICASSP 2025) Learning Source Disentanglement in Neural Audio Codec☆36Updated 2 months ago
- Prososdy Morph: A python library for manipulating pitch and duration in an algorithmic way, for resynthesizing speech.☆85Updated 2 years ago
- Autovocoder: Fast Waveform Generation from a Learned Speech Representation using Differentiable Digital Signal Processing☆70Updated 2 years ago
- Repo for source code of EBEN: Extreme Bandwidth Extension Network☆75Updated last month
- Reproducible experimental protocols for multimedia (audio, video, text) database☆104Updated 5 months ago