audeering/audb

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/audeering/audb)

audeering / audb

Manage audio and video datasets

☆36

Alternatives and similar repositories for audb

Users that are interested in audb are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

audeering / audformat
View on GitHub
Format to store media files and annotations
☆12May 12, 2026Updated 2 months ago
audeering / audiofile
View on GitHub
Handling audio files in Python
☆39Updated this week
felixbur / nkululeko
View on GitHub
Machine learning speaker characteristics
☆46Jul 9, 2026Updated 2 weeks ago
fxnn / vornamen
View on GitHub
German prenames as CSV data
☆13Mar 6, 2018Updated 8 years ago
autrainer / autrainer
View on GitHub
A Modular and Extensible Deep Learning Toolkit for Computer Audition Tasks.
☆24May 12, 2026Updated 2 months ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
hsnr-gamera / gamera-4
View on GitHub
Gamera 4 for Python 3
☆14May 16, 2025Updated last year
felixbur / Speechalyzer
View on GitHub
label and annotate large number of speech data files
☆12May 5, 2021Updated 5 years ago
ex3ndr / supervoice-gpt-facodec
View on GitHub
GPT for FACodec
☆13Mar 25, 2024Updated 2 years ago
bagustris / s3prl-ser
View on GitHub
S3PRL for Speech Emotion Recognition (see s3prl > downstream)
☆15Feb 28, 2026Updated 4 months ago
trigeorgis / multimodal_emotion_recognition
View on GitHub
☆13Feb 8, 2017Updated 9 years ago
openXBOW / openXBOW
View on GitHub
openXBOW - the Passau Open-Source Crossmodal Bag-of-Words Toolkit
☆85Feb 17, 2021Updated 5 years ago
introlab / uimvdr
View on GitHub
☆13Oct 11, 2024Updated last year
felixbur / Emofilt
View on GitHub
Emofilt is a program to simulate emotional arousal with speech synthesis based on the free-for-non-commercial-use MBROLA synthesis engine…
☆14Mar 17, 2022Updated 4 years ago
gibbona1 / neal
View on GitHub
NEAL (Nature+Energy Audio Labeller) is an open-source interactive audio data annotation tool.
☆20Jul 12, 2026Updated last week
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
speech-paper-reading / speech-paper-reading
View on GitHub
Repository for speech paper reading
☆33Aug 19, 2021Updated 4 years ago
MorrisXu-Driving / Speech-Augmentation-and-Endpoint-Detection
View on GitHub
This repository is developed in MATLAB. Speech Augmentation is based on Adaptive Filtering while Endpoint Detection is based on Voice Act…
☆10Dec 7, 2020Updated 5 years ago
jeremyxu177 / active-noise-control
View on GitHub
☆15Nov 3, 2020Updated 5 years ago
katkost / MazurkaBL
View on GitHub
Score-aligned loudness, beat, and expressive markings data for 2000 Chopin Mazurka recordings
☆14Jul 6, 2023Updated 3 years ago
p1an-lin-jung / wv_tts
View on GitHub
☆19Mar 22, 2024Updated 2 years ago
bloomen / featureimpact
View on GitHub
A Python package for estimating the impact of features on ML models
☆14May 18, 2023Updated 3 years ago
ZET-Speech / ZET-Speech-Demo
View on GitHub
ZET-Speech: Zero-shot adaptive Emotion-controllable Text-to-Speech Synthesis with Diffusion and Style-based Models (TTS)
☆10Mar 9, 2024Updated 2 years ago
rabitt / motif
View on GitHub
melodic object transcription framework
☆26Nov 15, 2017Updated 8 years ago
audeering / opensmile
View on GitHub
The Munich Open-Source Large-Scale Multimedia Feature Extractor
☆840Jan 26, 2026Updated 5 months ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
mt-upc / ZeroSwot
View on GitHub
Pushing the Limits of Zero-shot End-to-End Speech Translation
☆25Dec 12, 2024Updated last year
ex3ndr / supervoice-hybrid
View on GitHub
My hybrid TTS network that combines, VALL-E, VoiceBox, SpeechFlow, Seamless and TortoiseTTS into one
☆26Aug 5, 2024Updated last year
Takaaki-Saeki / ssl_speech_restoration_v2
View on GitHub
☆17Dec 18, 2023Updated 2 years ago
pilarOG / prosodic-analysis
View on GitHub
Tool to analyze an audio corpora in terms of intonation, intensity, duration and voice quality
☆23Jun 17, 2019Updated 7 years ago
Qin-Yi / Active-Noise-Control-System
View on GitHub
Feedforward Active Noise Control System with FxLMS (Jun, 2017)
☆17Oct 30, 2018Updated 7 years ago
auDeep / auDeep
View on GitHub
☆158Jan 24, 2021Updated 5 years ago
wistia / seamless-aac-split-and-stitch-demo
View on GitHub
Split and stitch AAC without the wait!
☆29Apr 17, 2024Updated 2 years ago
SEILSdataset / SEILSdataset
View on GitHub
The SEILS Dataset
☆18Oct 24, 2021Updated 4 years ago
zjumml / DiffSinger
View on GitHub
DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (SVS & TTS); AAAI 2022; Official code
☆10Mar 8, 2022Updated 4 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
NKU-HLT / AudioEditor
View on GitHub
☆47Apr 2, 2025Updated last year
francislata / unicats
View on GitHub
An unofficial implementation of "UniCATS: A Unified Context-Aware Text-to-Speech Framework with Contextual VQ-Diffusion and Vocoding".
☆26Nov 4, 2023Updated 2 years ago
keonlee9420 / evaluate-zero-shot-tts
View on GitHub
Evaluation Protocol for Large-Scale Zero-Shot TTS Literature
☆97Mar 12, 2025Updated last year
oatsu-gh / utau_renderer_with_diff_svc
View on GitHub
Render wav and convert it with [Diff-SVC](https://github.com/prophesier/diff-svc) model
☆10Aug 24, 2025Updated 11 months ago
vtuber-plan / vcvits
View on GitHub
Non Parallel Voice Conversion based on VITS
☆24Mar 31, 2023Updated 3 years ago
Neclow / SERAB
View on GitHub
SERAB: a multi-lingual benchmark for speech emotion recognition
☆28Dec 16, 2022Updated 3 years ago
kleberandrade / evolve-kart-unity
View on GitHub
Example of application of genetic algorithm for evolution kart navigation.
☆11Nov 21, 2019Updated 6 years ago