jim-schwoebel / voicebookLinks
π£οΈ A book and repo to get you started programming voice computing applications in Python (10 chapters and 200+ scripts).
β381Updated 2 years ago
Alternatives and similar repositories for voicebook
Users that are interested in voicebook are comparing it to the libraries listed below
Sorting:
- A list of publically available audio data that anyone can download for ASR or other speech activitiesβ209Updated 3 years ago
- This is a list of features, scripts, blogs and resources for better using Kaldi ( http://kaldi-asr.org/ )β535Updated 3 years ago
- Speech noise reduction which was generated using existing post-production techniques implemented in Pythonβ178Updated 3 years ago
- Real-time Voice Activity Detection in Noisy Eniviroments using Deep Neural Networksβ444Updated 4 years ago
- feature extraction from speech signalsβ374Updated this week
- Utterance-level Aggregation For Speaker Recognition In The Wildβ368Updated 2 years ago
- A Python library for measuring the acoustic features of speech (simultaneous speech, high entropy) compared to ones of native speech.β252Updated 2 years ago
- The SpeechBrain project aims to build a novel speech toolkit fully based on PyTorch. With SpeechBrain users can easily create speech procβ¦β366Updated this week
- Voice Activity Detection based on Deep Learning & TensorFlowβ364Updated 2 years ago
- A library for speech data augmentation in time-domainβ661Updated 3 years ago
- Voice Activity Detector in Pythonβ475Updated 4 years ago
- Simple d-vector based Speaker Recognition (verification and identification) using Pytorchβ211Updated 4 years ago
- An End-to-End Architecture for Keyword Spotting and Voice Activity Detectionβ379Updated 2 years ago
- This repo contains my attempt to create a Speaker Recognition and Verification system using SideKit-1.3.1β111Updated 6 years ago
- Python library for handling audio datasets.β138Updated last year
- My-Voice Analysis is a Python library for the analysis of voice (simultaneous speech, high entropy) without the need of a transcription. β¦β318Updated 3 years ago
- CSS10: A Collection of Single Speaker Speech Datasets for 10 Languagesβ471Updated 5 years ago
- Tools for Speech Enhancement integrated with Kaldiβ413Updated last year
- Problem Agnostic Speech Encoderβ441Updated last year
- [deprecated] Pretrained models for pyannote-audio 1.xβ72Updated 2 years ago
- Two-talker Speech Separation with LSTM/BLSTM by Permutation Invariant Training method.β310Updated 3 years ago
- π§ Automatic Speech Recognition: DeepSpeech & Seq2Seq (TensorFlow)β224Updated 4 years ago
- A toolkit for reproducible evaluation, diagnostic, and error analysis of speaker diarization systemsβ210Updated 3 months ago
- Deep neural network based speech enhancement toolkitβ216Updated 5 years ago
- Voice Emotion Detector that detects emotion from audio speech using one dimensional CNNs (convolutional neural networks) using keras and β¦β108Updated 7 years ago
- PyTorch Implementation of FastSpeech 2 : Fast and High-Quality End-to-End Text to Speechβ230Updated 2 years ago
- Speaker diarization python system based on binary key speaker modellingβ61Updated 3 years ago
- Speech Enhancement Generative Adversarial Network in PyTorchβ394Updated last year
- This repository contains audio samples and supplementary materials accompanying publications by the "Speaker, Voice and Language" team atβ¦β416Updated 2 months ago
- Multispeaker & Emotional TTS based on Tacotron 2 and Waveglowβ128Updated 4 years ago