π£οΈ A book and repo to get you started programming voice computing applications in Python (10 chapters and 200+ scripts).
β388Dec 8, 2022Updated 3 years ago
Alternatives and similar repositories for voicebook
Users that are interested in voicebook are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- π¦ Nala is an agile open-source voice assistant framework (20+ actions).β36Aug 8, 2023Updated 2 years ago
- π A comprehensive list of open-source datasets for voice and sound computing (95+ datasets).β2,155Jun 6, 2024Updated last year
- π Easily apply audio-related machine learning models trained on the AudioSet dataset (527+ models/classes).β30Jun 17, 2024Updated last year
- βοΈβοΈ Detect a person's gender from a voice file (90.7% +/- 1.3% accuracy).β91Jun 17, 2024Updated last year
- π€ An automated machine learning framework for audio, text, image, video, or .CSV files (50+ featurizers and 15+ model trainers). Python β¦β155Apr 2, 2025Updated last year
- Managed Kubernetes at scale on DigitalOcean β’ AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- π₯ π€ The largest clinical study in the world to collect voice data labeled with health information (N>6,000 participants, 48 utterancesβ¦β32Apr 2, 2025Updated last year
- Backpropagable pytorch implementation of https://craffel.github.io/mir_eval/.β35Jul 8, 2024Updated last year
- π An all-purpose eye tracking web application and API for Alzheimer's disease research (3 tasks, <3 mins). 1st place in the 2021 CNT hacβ¦β13Jun 17, 2021Updated 4 years ago
- Voice Activity Detection (VAD) using deep learning.β204Oct 14, 2019Updated 6 years ago
- Crowdsourced Audio Quality Evaluation Toolkitβ55Dec 7, 2022Updated 3 years ago
- Filtering and Noise Adding Toolβ29May 27, 2022Updated 3 years ago
- π A list of accessible speech corpora for ASR, TTS, and other Speech Technologiesβ1,391Jun 6, 2024Updated last year
- A python tool that converts Arabic diacritised text to a sequence of phonemes and creates a pronunciation dictionary. This code is based β¦β16Sep 5, 2017Updated 8 years ago
- Code for the paper: Unified Gradient Reweighting for Model Biasing with Applications to Source Separationβ14Nov 16, 2020Updated 5 years ago
- Managed Kubernetes at scale on DigitalOcean β’ AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- An open-source speech separation and enhancement libraryβ214May 13, 2020Updated 5 years ago
- π€ quick library to extract pause lengths from audio files.β33Jun 5, 2019Updated 6 years ago
- A Convolutional Neural Network based Voice Activity Detector for Smartphonesβ70Apr 30, 2019Updated 6 years ago
- Interspeech 2019 tutorial materialsβ49Sep 26, 2019Updated 6 years ago
- An audio/acoustic activity detection and audio segmentation toolβ844Apr 9, 2026Updated last week
- Surrey CVSSP DCASE 2018 Task 2 systemβ20Dec 26, 2022Updated 3 years ago
- Python library for handling audio datasets.β138Jul 6, 2023Updated 2 years ago
- π This repo makes it easy to download the raw audio files from AudioSet (32.45 GB, 632 classes).β106Aug 1, 2023Updated 2 years ago
- A Python toolbox for speech features extractionβ165Feb 8, 2023Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial β’ AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- This repository is for wake-word detection in speech using recurrent neural networksβ17Feb 25, 2019Updated 7 years ago
- Sound Related Deep Learning Tasks boosting repository with pytorchβ88Jul 25, 2024Updated last year
- This repo contains the baseline model recipes and pre-trained model for GramVanni hindi ASR challengeβ15Mar 26, 2022Updated 4 years ago
- Face Landmark-based Speaker-Independent Audio-Visual Speech Enhancement in Multi-Talker Environmentsβ111Mar 19, 2024Updated 2 years ago
- A collection of basic python modules for spoken natural language processingβ55Dec 1, 2019Updated 6 years ago
- Sound augmentation using Large-scale audio dataset (Audioset)β45Jun 29, 2021Updated 4 years ago
- Open tools and data for cloudless automatic speech recognitionβ446Mar 30, 2021Updated 5 years ago
- A library for speech data augmentation in time-domainβ685Aug 30, 2021Updated 4 years ago
- Public repository for the paper "Learning Sound Event Classifiers from Web Audio with Noisy Labels"β99Jul 11, 2019Updated 6 years ago
- Managed hosting for WordPress and PHP on Cloudways β’ AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Benchmark popular audio i/o packagesβ151Dec 19, 2023Updated 2 years ago
- Implementation of Multi speaker TTSβ51Jan 2, 2021Updated 5 years ago
- β231Feb 9, 2020Updated 6 years ago
- Voice activity detection (VAD) toolkit including DNN, bDNN, LSTM and ACAM based VAD. We also provide our directly recorded dataset.β869Jun 9, 2021Updated 4 years ago
- Streaming source separation for music and speech files, using the Open-Unmix LSTM architecture.β21Dec 8, 2022Updated 3 years ago
- Attempt at tracking states of the arts and recent results (bibliography) on speech recognition.β1,864Jun 27, 2022Updated 3 years ago
- 2019 κ΅μ΄κ²½μ§λν νκ΅μ΄ μ쑴ꡬ문 λΆμ λμ(λ¬Έμ²΄λΆ μ₯κ΄μ)β15Oct 26, 2022Updated 3 years ago