jim-schwoebel / voicebookView external linksLinks
π£οΈ A book and repo to get you started programming voice computing applications in Python (10 chapters and 200+ scripts).
β387Dec 8, 2022Updated 3 years ago
Alternatives and similar repositories for voicebook
Users that are interested in voicebook are comparing it to the libraries listed below
Sorting:
- π A comprehensive list of open-source datasets for voice and sound computing (95+ datasets).β2,131Jun 6, 2024Updated last year
- π¦ Nala is an agile open-source voice assistant framework (20+ actions).β36Aug 8, 2023Updated 2 years ago
- π Easily apply audio-related machine learning models trained on the AudioSet dataset (527+ models/classes).β31Jun 17, 2024Updated last year
- π€ An automated machine learning framework for audio, text, image, video, or .CSV files (50+ featurizers and 15+ model trainers). Python β¦β151Apr 2, 2025Updated 10 months ago
- βοΈβοΈ Detect a person's gender from a voice file (90.7% +/- 1.3% accuracy).β90Jun 17, 2024Updated last year
- π An all-purpose eye tracking web application and API for Alzheimer's disease research (3 tasks, <3 mins). 1st place in the 2021 CNT hacβ¦β13Jun 17, 2021Updated 4 years ago
- Backpropagable pytorch implementation of https://craffel.github.io/mir_eval/.β35Jul 8, 2024Updated last year
- Crowdsourced Audio Quality Evaluation Toolkitβ55Dec 7, 2022Updated 3 years ago
- Voice Activity Detection (VAD) using deep learning.β204Oct 14, 2019Updated 6 years ago
- π A list of accessible speech corpora for ASR, TTS, and other Speech Technologiesβ1,384Jun 6, 2024Updated last year
- π₯ π€ The largest clinical study in the world to collect voice data labeled with health information (N>6,000 participants, 48 utterancesβ¦β32Apr 2, 2025Updated 10 months ago
- Python library for handling audio datasets.β138Jul 6, 2023Updated 2 years ago
- A set of scripts to use in preparing a corpus for speech-to-text processing with the Kaldi Automatic Speech Recognition Library.β15May 19, 2020Updated 5 years ago
- Interspeech 2019 tutorial materialsβ49Sep 26, 2019Updated 6 years ago
- Code for the paper: Unified Gradient Reweighting for Model Biasing with Applications to Source Separationβ14Nov 16, 2020Updated 5 years ago
- A Convolutional Neural Network based Voice Activity Detector for Smartphonesβ70Apr 30, 2019Updated 6 years ago
- Sound Related Deep Learning Tasks boosting repository with pytorchβ88Jul 25, 2024Updated last year
- Filtering and Noise Adding Toolβ29May 27, 2022Updated 3 years ago
- A Python toolbox for speech features extractionβ165Feb 8, 2023Updated 3 years ago
- β19May 9, 2019Updated 6 years ago
- A PyTorch implementation: "LASAFT-Net-v2: Listen, Attend and Separate by Attentively aggregating Frequency Transformation"β33Apr 11, 2022Updated 3 years ago
- An open-source speech separation and enhancement libraryβ214May 13, 2020Updated 5 years ago
- Streaming source separation for music and speech files, using the Open-Unmix LSTM architecture.β22Dec 8, 2022Updated 3 years ago
- Audio Keyword Searchβ12May 5, 2019Updated 6 years ago
- A python tool that converts Arabic diacritised text to a sequence of phonemes and creates a pronunciation dictionary. This code is based β¦β16Sep 5, 2017Updated 8 years ago
- An audio/acoustic activity detection and audio segmentation toolβ834Dec 11, 2024Updated last year
- An implementation of the Contrast Predictive Coding (CPC) method to train audio features in an unsupervised fashion.β367Oct 12, 2021Updated 4 years ago
- A library for speech data augmentation in time-domainβ682Aug 30, 2021Updated 4 years ago
- Collection of models and extensions for deployment in PyTorchβ24Nov 20, 2022Updated 3 years ago
- Surrey CVSSP DCASE 2018 Task 2 systemβ20Dec 26, 2022Updated 3 years ago
- Python library for downloading, loading & working with sound datasetsβ350Sep 23, 2025Updated 4 months ago
- My-Voice Analysis is a Python library for the analysis of voice (simultaneous speech, high entropy) without the need of a transcription. β¦β332Aug 31, 2021Updated 4 years ago
- Open tools and data for cloudless automatic speech recognitionβ446Mar 30, 2021Updated 4 years ago
- Upsampling Artifacts in Neural Audio Synthesis β https://arxiv.org/abs/2010.14356β82Feb 9, 2021Updated 5 years ago
- Python library for audio augmentationβ85Jul 6, 2023Updated 2 years ago
- Python code for handling the Clotho dataset.β85Nov 24, 2020Updated 5 years ago
- Attempt at tracking states of the arts and recent results (bibliography) on speech recognition.β1,867Jun 27, 2022Updated 3 years ago
- Face Landmark-based Speaker-Independent Audio-Visual Speech Enhancement in Multi-Talker Environmentsβ111Mar 19, 2024Updated last year
- Big Impulse Response Datasetβ156Oct 19, 2022Updated 3 years ago