audeering / audb
Manage audio and video datasets
☆28Updated 2 weeks ago
Alternatives and similar repositories for audb:
Users that are interested in audb are comparing it to the libraries listed below
- SERAB: a multi-lingual benchmark for speech emotion recognition☆28Updated 2 years ago
- ☆43Updated 10 months ago
- TS-SEP: Joint Diarization and Separation Conditioned on Estimated Speaker Embeddings☆30Updated 6 months ago
- This is the official implementation of our multi-channel multi-speaker multi-spatial neural audio codec architecture.☆49Updated last month
- This code is to run the WARP-Q speech quality metric.☆35Updated 6 months ago
- Easy to use Audio Tagging in PyTorch☆21Updated 3 years ago
- ☆32Updated 3 years ago
- Unofficial implementation of wavenext vocoder☆44Updated 7 months ago
- Repo for source code of EBEN: Extreme Bandwidth Extension Network☆73Updated 3 months ago
- Codebase for the paper 'EncodecMAE: Leveraging neural codecs for universal audio representation learning'☆95Updated 8 months ago
- PodcastMix A dataset for separating music and speech in podcasts.☆43Updated 8 months ago
- ☆31Updated last year
- Viterbi decoding in PyTorch☆30Updated 2 weeks ago
- Streaming Audiotransformers for online Audio tagging☆44Updated 10 months ago
- A fast python library for aligning similar audio snippets passed in as NumPy arrays☆45Updated last month
- Speech Human Evaluation Estimation Toolkit (SHEET)☆63Updated 5 months ago
- Enhanced Reverberation As Supervision (ERAS) for unsupervised reverberant speech separation☆12Updated 8 months ago
- Qualtric or Qualtreat? Generate Qualtrics listening tests for Text-To-Speech evaluations.☆35Updated 9 months ago
- ☆22Updated 3 years ago
- AudioSR-Upsampling (any -> 48kHz)☆40Updated last year
- Speech-To-Text forced-alignment Speech processing Universal PERformance Benchmark☆27Updated 9 months ago
- Pytorch implementation of subband decomposition☆92Updated 2 years ago
- Multipurpose Multi Speaker Mixture Signal Generator☆44Updated 2 months ago
- ☆13Updated last year
- logWMSE, an audio quality metric with support for digital silence target. Useful for evaluating audio source separation systems, even whe…☆34Updated 7 months ago
- ☆28Updated last year
- Implementation of the paper "BERTphone: Phonetically-aware Encoder Representations for Utterance-level Speaker and Language Recognition"☆17Updated 4 years ago
- Machine learning speaker characteristics☆33Updated last week
- Source Code for the Paper "UNIFIED KEYWORD SPOTTING AND AUDIO TAGGING ON MOBILE DEVICES WITH TRANSFORMERS"☆23Updated 2 years ago
- Transformer with Local Modeling by Convolution for Speech Separation and Enhancement☆57Updated 8 months ago