π£οΈ A book and repo to get you started programming voice computing applications in Python (10 chapters and 200+ scripts).
β388Dec 8, 2022Updated 3 years ago
Alternatives and similar repositories for voicebook
Users that are interested in voicebook are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- π¦ Nala is an agile open-source voice assistant framework (20+ actions).β36Aug 8, 2023Updated 2 years ago
- π A comprehensive list of open-source datasets for voice and sound computing (95+ datasets).β2,201Jun 6, 2024Updated 2 years ago
- π Easily apply audio-related machine learning models trained on the AudioSet dataset (527+ models/classes).β30Jun 17, 2024Updated 2 years ago
- βοΈβοΈ Detect a person's gender from a voice file (90.7% +/- 1.3% accuracy).β91Jun 17, 2024Updated 2 years ago
- π€ An automated machine learning framework for audio, text, image, video, or .CSV files (50+ featurizers and 15+ model trainers). Python β¦β153Apr 2, 2025Updated last year
- Managed Database hosting by DigitalOcean β’ AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- π₯ π€ The largest clinical study in the world to collect voice data labeled with health information (N>6,000 participants, 48 utterancesβ¦β32Apr 2, 2025Updated last year
- Backpropagable pytorch implementation of https://craffel.github.io/mir_eval/.β35Jul 8, 2024Updated last year
- π An all-purpose eye tracking web application and API for Alzheimer's disease research (3 tasks, <3 mins). 1st place in the 2021 CNT hacβ¦β13Jun 17, 2021Updated 5 years ago
- Voice Activity Detection (VAD) using deep learning.β204Oct 14, 2019Updated 6 years ago
- Crowdsourced Audio Quality Evaluation Toolkitβ55Dec 7, 2022Updated 3 years ago
- Filtering and Noise Adding Toolβ29May 27, 2022Updated 4 years ago
- π A list of accessible speech corpora for ASR, TTS, and other Speech Technologiesβ1,398Jun 6, 2024Updated 2 years ago
- A python tool that converts Arabic diacritised text to a sequence of phonemes and creates a pronunciation dictionary. This code is based β¦β16Sep 5, 2017Updated 8 years ago
- Code for the paper: Unified Gradient Reweighting for Model Biasing with Applications to Source Separationβ14Nov 16, 2020Updated 5 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer β’ AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- An open-source speech separation and enhancement libraryβ214May 13, 2020Updated 6 years ago
- π€ quick library to extract pause lengths from audio files.β33Jun 5, 2019Updated 7 years ago
- A Convolutional Neural Network based Voice Activity Detector for Smartphonesβ70Apr 30, 2019Updated 7 years ago
- Interspeech 2019 tutorial materialsβ49Sep 26, 2019Updated 6 years ago
- An audio/acoustic activity detection and audio segmentation toolβ851May 14, 2026Updated last month
- Surrey CVSSP DCASE 2018 Task 2 systemβ20Dec 26, 2022Updated 3 years ago
- Python library for handling audio datasets.β139Jul 6, 2023Updated 2 years ago
- π This repo makes it easy to download the raw audio files from AudioSet (32.45 GB, 632 classes).β106Aug 1, 2023Updated 2 years ago
- A Python toolbox for speech features extractionβ165Feb 8, 2023Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient β’ AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- This repository is for wake-word detection in speech using recurrent neural networksβ17Feb 25, 2019Updated 7 years ago
- Sound Related Deep Learning Tasks boosting repository with pytorchβ88Jul 25, 2024Updated last year
- This repo contains the baseline model recipes and pre-trained model for GramVanni hindi ASR challengeβ16Mar 26, 2022Updated 4 years ago
- Face Landmark-based Speaker-Independent Audio-Visual Speech Enhancement in Multi-Talker Environmentsβ112Mar 19, 2024Updated 2 years ago
- Sound augmentation using Large-scale audio dataset (Audioset)β45Jun 29, 2021Updated 4 years ago
- A collection of basic python modules for spoken natural language processingβ55Dec 1, 2019Updated 6 years ago
- Open tools and data for cloudless automatic speech recognitionβ447Mar 30, 2021Updated 5 years ago
- A library for speech data augmentation in time-domainβ689Aug 30, 2021Updated 4 years ago
- Public repository for the paper "Learning Sound Event Classifiers from Web Audio with Noisy Labels"β99Jul 11, 2019Updated 6 years ago
- Virtual machines for every use case on DigitalOcean β’ AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Benchmark popular audio i/o packagesβ152Dec 19, 2023Updated 2 years ago
- Implementation of Multi speaker TTSβ51Jan 2, 2021Updated 5 years ago
- β229Feb 9, 2020Updated 6 years ago
- Voice activity detection (VAD) toolkit including DNN, bDNN, LSTM and ACAM based VAD. We also provide our directly recorded dataset.β869Jun 9, 2021Updated 5 years ago
- Streaming source separation for music and speech files, using the Open-Unmix LSTM architecture.β21Dec 8, 2022Updated 3 years ago
- Attempt at tracking states of the arts and recent results (bibliography) on speech recognition.β1,862Jun 27, 2022Updated 3 years ago
- Python library for audio augmentationβ85Jul 6, 2023Updated 2 years ago