audeering / audb
Manage audio and video datasets
☆28Updated 2 weeks ago
Alternatives and similar repositories for audb
Users that are interested in audb are comparing it to the libraries listed below
Sorting:
- SERAB: a multi-lingual benchmark for speech emotion recognition☆28Updated 2 years ago
- This code is to run the WARP-Q speech quality metric.☆35Updated 6 months ago
- ☆43Updated 11 months ago
- A fast python library for aligning similar audio snippets passed in as NumPy arrays☆47Updated last month
- PodcastMix A dataset for separating music and speech in podcasts.☆43Updated 8 months ago
- easy-to-use implementation of the ISMIR 2013 Audio Degradation Toolbox☆49Updated 5 years ago
- This is the official implementation of our multi-channel multi-speaker multi-spatial neural audio codec architecture.☆49Updated last month
- ☆29Updated last year
- logWMSE, an audio quality metric with support for digital silence target. Useful for evaluating audio source separation systems, even whe…☆35Updated 8 months ago
- PyTorch implementation of the ICASSP-24 paper: "Improving Audio Captioning Models with Fine-grained Audio Features, Text Embedding Superv…☆36Updated last year
- Unofficial implementation of wavenext vocoder☆45Updated 8 months ago
- Codebase for the paper 'EncodecMAE: Leveraging neural codecs for universal audio representation learning'☆96Updated 9 months ago
- Qualtric or Qualtreat? Generate Qualtrics listening tests for Text-To-Speech evaluations.☆35Updated 10 months ago
- Machine learning speaker characteristics☆33Updated last week
- TS-SEP: Joint Diarization and Separation Conditioned on Estimated Speaker Embeddings☆32Updated 7 months ago
- System that ranks 2nd in DCASE 2022 Challenge Task 5: Few-shot Bioacoustic Event Detection☆28Updated 2 years ago
- A library built for easier audio self-supervised training, downstream tasks evaluation☆117Updated 8 months ago
- An automatic prosodic boundary annotation tool for Text-to-Speech Synthesis (TTS).☆48Updated 11 months ago
- Speech Human Evaluation Estimation Toolkit (SHEET)☆67Updated 6 months ago
- Viterbi decoding in PyTorch☆32Updated last month
- Autovocoder: Fast Waveform Generation from a Learned Speech Representation using Differentiable Digital Signal Processing☆70Updated 2 years ago
- ☆24Updated 2 years ago
- ☆32Updated last year
- Paderbox: A collection of utilities for audio / speech processing☆38Updated last week
- Enhanced Reverberation As Supervision (ERAS) for unsupervised reverberant speech separation☆12Updated 9 months ago
- Frechet Audio Distance evaluation in PyTorch☆35Updated last year
- EVAR ~ Evaluation package for Audio Representations☆54Updated last week
- VoicePAT is a modular and efficient toolkit for voice privacy research, with main focus on speaker anonymization.☆51Updated 11 months ago
- ☆54Updated last year
- Official implementation of the paper "Laughter Synthesis using Pseudo Phonetic Tokens with a Large-scale In-the-wild Laughter Corpus" acc…☆75Updated last year