A Modular and Extensible Deep Learning Toolkit for Computer Audition Tasks.
☆23May 12, 2026Updated last month
Alternatives and similar repositories for autrainer
Users that are interested in autrainer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Format to store media files and annotations☆12May 12, 2026Updated last month
- label and annotate large number of speech data files☆12May 5, 2021Updated 5 years ago
- Manage audio and video datasets☆36Jun 8, 2026Updated last week
- Applet for explaining z transform☆14Apr 26, 2022Updated 4 years ago
- Emofilt is a program to simulate emotional arousal with speech synthesis based on the free-for-non-commercial-use MBROLA synthesis engine…☆14Mar 17, 2022Updated 4 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Handling audio files in Python☆39May 20, 2026Updated 3 weeks ago
- DDSP: Differentiable Digital Signal Processing forked on 18.6.2020, with personal adaptions☆11Oct 4, 2020Updated 5 years ago
- Machine learning speaker characteristics☆46Updated this week
- German prenames as CSV data☆13Mar 6, 2018Updated 8 years ago
- Evaluation of a number of loudness meter implementations☆13Aug 28, 2021Updated 4 years ago
- Source code for "Modulation Extraction for LFO-driven Audio Effects".☆32Mar 25, 2026Updated 2 months ago
- ☆31Jun 30, 2023Updated 2 years ago
- ☆56Oct 8, 2022Updated 3 years ago
- Official repository for the paper "Audio xLSTMs: Learning Self-supervised audio representations with xLSTMs"☆21Sep 7, 2025Updated 9 months ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- ☆15Sep 24, 2022Updated 3 years ago
- Ravetable synthesis - Latent signal processing☆38Sep 25, 2025Updated 8 months ago
- KEMAR HRTFs with low frequency correction☆13Mar 20, 2017Updated 9 years ago
- Code for the "NoiseBandNet: Controllable Time-Varying Neural Synthesis of Sound Effects Using Filterbanks" paper.☆39Jul 8, 2024Updated last year
- Dynamic Binaural Synthesis on Android Mobile Devices☆14Mar 21, 2016Updated 10 years ago
- Jax implementation of a flexible audio loudness meter in Python with implementation of ITU-R BS.1770-4 loudness algorithm☆13Jan 29, 2025Updated last year
- 🏥 🎤 The largest clinical study in the world to collect voice data labeled with health information (N>6,000 participants, 48 utterances…☆32Apr 2, 2025Updated last year
- A GPU accelerated and torch based audio DSP library☆134Updated this week
- ☆17Jun 11, 2025Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Fast, differentiable, and GPU-accelerated simulation framework for modelling the dynamics of strings, membranes, and plates using modal m…☆14May 13, 2026Updated last month
- Code and data repository for paper "VoxCeleb enrichment for Age and Gender recognition" submitted at ASRU 2021☆72Dec 18, 2021Updated 4 years ago
- HpTF compensation filters for binaural synthesis with Matlab script for creation of filters out of measurement data☆16Mar 21, 2017Updated 9 years ago
- BAD-VAE: A VAE framework for unsupervised disentanglement of sequential data☆12May 25, 2022Updated 4 years ago
- S3PRL for Speech Emotion Recognition (see s3prl > downstream)☆15Feb 28, 2026Updated 3 months ago
- IPython notebook with synthetic experiments for AFLite, based on the ICML 2020 paper, "Adversarial Filters of Dataset Biases".☆16Aug 14, 2020Updated 5 years ago
- ☆11Jan 22, 2017Updated 9 years ago
- A screaming vocal samples dataset.☆13Apr 14, 2023Updated 3 years ago
- A Rust binding for Wwise☆17Jan 4, 2023Updated 3 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆29Oct 17, 2024Updated last year
- Auto-KWS 2021 Challenge 1st place solution.☆11Jul 20, 2021Updated 4 years ago
- Python wrapper for Espeak and Mbrola, for simple local TTS☆30Jan 1, 2025Updated last year
- Embedded Tajweed annotation for the Qur'an☆11Nov 30, 2025Updated 6 months ago
- A toolkit for benchmarking on a wide variety of audio deepfake datasets.☆31May 22, 2026Updated 3 weeks ago
- [APSIPA'22] Exploring Speaker Age Estimation on Different Self-Supervised Learning Models☆14Oct 19, 2022Updated 3 years ago
- A framework for overviewing the performance of F0 estimators☆19Sep 10, 2016Updated 9 years ago