π£οΈ A book and repo to get you started programming voice computing applications in Python (10 chapters and 200+ scripts).
β388Dec 8, 2022Updated 3 years ago
Alternatives and similar repositories for voicebook
Users that are interested in voicebook are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- π¦ Nala is an agile open-source voice assistant framework (20+ actions).β36Aug 8, 2023Updated 2 years ago
- π A comprehensive list of open-source datasets for voice and sound computing (95+ datasets).β2,141Jun 6, 2024Updated last year
- π Easily apply audio-related machine learning models trained on the AudioSet dataset (527+ models/classes).β31Jun 17, 2024Updated last year
- βοΈβοΈ Detect a person's gender from a voice file (90.7% +/- 1.3% accuracy).β90Jun 17, 2024Updated last year
- π€ An automated machine learning framework for audio, text, image, video, or .CSV files (50+ featurizers and 15+ model trainers). Python β¦β153Apr 2, 2025Updated 11 months ago
- End-to-end encrypted cloud storage - Proton Drive β’ AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- π₯ π€ The largest clinical study in the world to collect voice data labeled with health information (N>6,000 participants, 48 utterancesβ¦β32Apr 2, 2025Updated 11 months ago
- Backpropagable pytorch implementation of https://craffel.github.io/mir_eval/.β35Jul 8, 2024Updated last year
- π An all-purpose eye tracking web application and API for Alzheimer's disease research (3 tasks, <3 mins). 1st place in the 2021 CNT hacβ¦β13Jun 17, 2021Updated 4 years ago
- Voice Activity Detection (VAD) using deep learning.β204Oct 14, 2019Updated 6 years ago
- Crowdsourced Audio Quality Evaluation Toolkitβ55Dec 7, 2022Updated 3 years ago
- Filtering and Noise Adding Toolβ29May 27, 2022Updated 3 years ago
- π A list of accessible speech corpora for ASR, TTS, and other Speech Technologiesβ1,390Jun 6, 2024Updated last year
- A python tool that converts Arabic diacritised text to a sequence of phonemes and creates a pronunciation dictionary. This code is based β¦β16Sep 5, 2017Updated 8 years ago
- Code for the paper: Unified Gradient Reweighting for Model Biasing with Applications to Source Separationβ14Nov 16, 2020Updated 5 years ago
- GPU virtual machines on DigitalOcean Gradient AI β’ AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- An open-source speech separation and enhancement libraryβ214May 13, 2020Updated 5 years ago
- π€ quick library to extract pause lengths from audio files.β32Jun 5, 2019Updated 6 years ago
- A Convolutional Neural Network based Voice Activity Detector for Smartphonesβ70Apr 30, 2019Updated 6 years ago
- Interspeech 2019 tutorial materialsβ49Sep 26, 2019Updated 6 years ago
- An audio/acoustic activity detection and audio segmentation toolβ843Updated this week
- Surrey CVSSP DCASE 2018 Task 2 systemβ20Dec 26, 2022Updated 3 years ago
- Python library for handling audio datasets.β138Jul 6, 2023Updated 2 years ago
- π This repo makes it easy to download the raw audio files from AudioSet (32.45 GB, 632 classes).β105Aug 1, 2023Updated 2 years ago
- A Python toolbox for speech features extractionβ165Feb 8, 2023Updated 3 years ago
- Managed Kubernetes at scale on DigitalOcean β’ AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- This repository is for wake-word detection in speech using recurrent neural networksβ17Feb 25, 2019Updated 7 years ago
- Sound Related Deep Learning Tasks boosting repository with pytorchβ88Jul 25, 2024Updated last year
- This repo contains the baseline model recipes and pre-trained model for GramVanni hindi ASR challengeβ15Mar 26, 2022Updated 4 years ago
- Face Landmark-based Speaker-Independent Audio-Visual Speech Enhancement in Multi-Talker Environmentsβ111Mar 19, 2024Updated 2 years ago
- A collection of basic python modules for spoken natural language processingβ55Dec 1, 2019Updated 6 years ago
- Sound augmentation using Large-scale audio dataset (Audioset)β45Jun 29, 2021Updated 4 years ago
- Open tools and data for cloudless automatic speech recognitionβ446Mar 30, 2021Updated 5 years ago
- A library for speech data augmentation in time-domainβ684Aug 30, 2021Updated 4 years ago
- Public repository for the paper "Learning Sound Event Classifiers from Web Audio with Noisy Labels"β99Jul 11, 2019Updated 6 years ago
- Virtual machines for every use case on DigitalOcean β’ AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Benchmark popular audio i/o packagesβ151Dec 19, 2023Updated 2 years ago
- Implementation of Multi speaker TTSβ51Jan 2, 2021Updated 5 years ago
- β231Feb 9, 2020Updated 6 years ago
- Voice activity detection (VAD) toolkit including DNN, bDNN, LSTM and ACAM based VAD. We also provide our directly recorded dataset.β869Jun 9, 2021Updated 4 years ago
- Streaming source separation for music and speech files, using the Open-Unmix LSTM architecture.β21Dec 8, 2022Updated 3 years ago
- Attempt at tracking states of the arts and recent results (bibliography) on speech recognition.β1,865Jun 27, 2022Updated 3 years ago
- Python library for audio augmentationβ85Jul 6, 2023Updated 2 years ago