jim-schwoebel/voicebook

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/jim-schwoebel/voicebook)

jim-schwoebel / voicebook

🗣️ A book and repo to get you started programming voice computing applications in Python (10 chapters and 200+ scripts).

☆389

Alternatives and similar repositories for voicebook

Users that are interested in voicebook are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

jim-schwoebel / nala
View on GitHub
🦁 Nala is an agile open-source voice assistant framework (20+ actions).
☆36Aug 8, 2023Updated 2 years ago
jim-schwoebel / voice_datasets
View on GitHub
🔊 A comprehensive list of open-source datasets for voice and sound computing (95+ datasets).
☆2,212Jun 6, 2024Updated 2 years ago
jim-schwoebel / audioset_models
View on GitHub
📊 Easily apply audio-related machine learning models trained on the AudioSet dataset (527+ models/classes).
☆31Jun 17, 2024Updated 2 years ago
jim-schwoebel / voice_gender_detection
View on GitHub
♂️♀️ Detect a person's gender from a voice file (90.7% +/- 1.3% accuracy).
☆91Jun 17, 2024Updated 2 years ago
jim-schwoebel / allie
View on GitHub
🤖 An automated machine learning framework for audio, text, image, video, or .CSV files (50+ featurizers and 15+ model trainers). Python …
☆153Apr 2, 2025Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
jim-schwoebel / voiceome
View on GitHub
🏥 🎤 The largest clinical study in the world to collect voice data labeled with health information (N>6,000 participants, 48 utterances…
☆32Apr 2, 2025Updated last year
JuanFMontesinos / torch_mir_eval
View on GitHub
Backpropagable pytorch implementation of https://craffel.github.io/mir_eval/.
☆35Jul 8, 2024Updated 2 years ago
nostalgia-cnt / vibe
View on GitHub
👀 An all-purpose eye tracking web application and API for Alzheimer's disease research (3 tasks, <3 mins). 1st place in the 2021 CNT hac…
☆14Jun 17, 2021Updated 5 years ago
nicklashansen / voice-activity-detection
View on GitHub
Voice Activity Detection (VAD) using deep learning.
☆204Oct 14, 2019Updated 6 years ago
interactiveaudiolab / CAQE
View on GitHub
Crowdsourced Audio Quality Evaluation Toolkit
☆55Dec 7, 2022Updated 3 years ago
i3thuan5 / FaNT
View on GitHub
Filtering and Noise Adding Tool
☆29May 27, 2022Updated 4 years ago
coqui-ai / open-speech-corpora
View on GitHub
💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies
☆1,398Jun 6, 2024Updated 2 years ago
motazsaad / ara-pronunciation-tool
View on GitHub
A python tool that converts Arabic diacritised text to a sequence of phonemes and creates a pronunciation dictionary. This code is based …
☆15Sep 5, 2017Updated 8 years ago
etzinis / biased_separation
View on GitHub
Code for the paper: Unified Gradient Reweighting for Model Biasing with Applications to Source Separation
☆14Nov 16, 2020Updated 5 years ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
speechLabBcCuny / onssen
View on GitHub
An open-source speech separation and enhancement library
☆214May 13, 2020Updated 6 years ago
SIP-Lab / CNN-VAD
View on GitHub
A Convolutional Neural Network based Voice Activity Detector for Smartphones
☆70Apr 30, 2019Updated 7 years ago
kan-bayashi / INTERSPEECH19_TUTORIAL
View on GitHub
Interspeech 2019 tutorial materials
☆49Sep 26, 2019Updated 6 years ago
amsehili / auditok
View on GitHub
An voice activity detection and audio segmentation tool
☆854Updated this week
jim-schwoebel / download_audioset
View on GitHub
📁 This repo makes it easy to download the raw audio files from AudioSet (32.45 GB, 632 classes).
☆106Aug 1, 2023Updated 2 years ago
tqbl / dcase2018_task2
View on GitHub
Surrey CVSSP DCASE 2018 Task 2 system
☆20Dec 26, 2022Updated 3 years ago
ynop / audiomate
View on GitHub
Python library for handling audio datasets.
☆139Jul 6, 2023Updated 3 years ago
bootphon / shennong
View on GitHub
A Python toolbox for speech features extraction
☆166Feb 8, 2023Updated 3 years ago
dr-pato / audio_visual_speech_enhancement
View on GitHub
Face Landmark-based Speaker-Independent Audio-Visual Speech Enhancement in Multi-Talker Environments
☆112Mar 19, 2024Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
sarangzambare / hey-siri
View on GitHub
This repository is for wake-word detection in speech using recurrent neural networks
☆18Feb 25, 2019Updated 7 years ago
AppleHolic / pytorch_sound
View on GitHub
Sound Related Deep Learning Tasks boosting repository with pytorch
☆88Jul 25, 2024Updated 2 years ago
gooofy / py-nltools
View on GitHub
A collection of basic python modules for spoken natural language processing
☆55Dec 1, 2019Updated 6 years ago
gpu-poor / gramvaani_hindi_asr
View on GitHub
This repo contains the baseline model recipes and pre-trained model for GramVanni hindi ASR challenge
☆16Mar 26, 2022Updated 4 years ago
AppleHolic / audioset_augmentor
View on GitHub
Sound augmentation using Large-scale audio dataset (Audioset)
☆45Jun 29, 2021Updated 5 years ago
edufonseca / icassp19
View on GitHub
Public repository for the paper "Learning Sound Event Classifiers from Web Audio with Noisy Labels"
☆99Jul 11, 2019Updated 7 years ago
faroit / python_audio_loading_benchmark
View on GitHub
Benchmark popular audio i/o packages
☆152Dec 19, 2023Updated 2 years ago
CoEDL / kaldi_helpers
View on GitHub
A set of scripts to use in preparing a corpus for speech-to-text processing with the Kaldi Automatic Speech Recognition Library.
☆15May 19, 2020Updated 6 years ago
tommy-fox / streaming-source-separation
View on GitHub
Streaming source separation for music and speech files, using the Open-Unmix LSTM architecture.
☆21Dec 8, 2022Updated 3 years ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
qiuqiangkong / audioset_classification
View on GitHub
☆229Feb 9, 2020Updated 6 years ago
gooofy / zamia-speech
View on GitHub
Open tools and data for cloudless automatic speech recognition
☆449Mar 30, 2021Updated 5 years ago
facebookresearch / WavAugment
View on GitHub
A library for speech data augmentation in time-domain
☆689Aug 30, 2021Updated 4 years ago
CODEJIN / multi_speaker_tts
View on GitHub
Implementation of Multi speaker TTS
☆50Jan 2, 2021Updated 5 years ago
jtkim-kaist / VAD
View on GitHub
Voice activity detection (VAD) toolkit including DNN, bDNN, LSTM and ACAM based VAD. We also provide our directly recorded dataset.
☆869Jun 9, 2021Updated 5 years ago
taylorlu / AudioKWS
View on GitHub
Audio Keyword Search
☆12May 5, 2019Updated 7 years ago
jim-schwoebel / sound_event_detection
View on GitHub
🎵 A repository for manually annotating files to create labeled acoustic datasets for machine learning.
☆47Feb 20, 2022Updated 4 years ago