danijel3 / audio_guiLinks
Simple audio recorder that sends WAV from browser to server in Python (Flask).
☆31Updated 3 years ago
Alternatives and similar repositories for audio_gui
Users that are interested in audio_gui are comparing it to the libraries listed below
Sorting:
- Speaker diarization python system based on binary key speaker modelling☆60Updated 3 years ago
- ♂️♀️ Detect a person's gender from a voice file (90.7% +/- 1.3% accuracy).☆90Updated last year
- Helsinki Prosody Corpus and A System for Predicting Prosodic Prominence from Text☆246Updated 6 years ago
- Python library for handling audio datasets.☆138Updated 2 years ago
- A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.☆107Updated 2 years ago
- Python toolkit for speech processing☆72Updated 2 weeks ago
- [deprecated] Pretrained models for pyannote-audio 1.x☆71Updated 3 years ago
- PyTorch Implementation of FastSpeech 2 : Fast and High-Quality End-to-End Text to Speech☆232Updated 3 years ago
- Estimating the Age, Height, and Gender of a speaker with their speech signal. https://arxiv.org/pdf/2110.13653.pdf☆67Updated 4 years ago
- ☆76Updated 4 years ago
- 🗣️ A book and repo to get you started programming voice computing applications in Python (10 chapters and 200+ scripts).☆385Updated 3 years ago
- Deep Neural Network for Speaker Count Estimation☆156Updated 5 years ago
- A list of publically available audio data that anyone can download for ASR or other speech activities☆232Updated 4 years ago
- Yet another PyTorch implementation of Tacotron 2 with reduction factor and faster training speed.☆148Updated 3 years ago
- Word alignments generated by the Montreal Forced Aligner for the Librispeech dataset☆173Updated 6 years ago
- Paper: https://arxiv.org/abs/1702.02285☆64Updated 6 years ago
- Speaker Diarization is the problem of separating speakers in an audio. There could be any number of speakers and final result should stat…☆64Updated 4 years ago
- Advanced data structures for handling temporal segments with attached labels.☆122Updated 3 months ago
- Python library for audio augmentation☆84Updated 2 years ago
- VCTK multi-speaker tacotron for ICASSP 2020☆266Updated 3 years ago
- A Python toolbox for speech features extraction☆165Updated 2 years ago
- Pytorch implementation of Deepmind's WaveRNN model☆122Updated 6 years ago
- This repository is for wake-word detection in speech using recurrent neural networks☆17Updated 6 years ago
- Python module for syllabifying English ARPABET transcriptions☆71Updated 6 years ago
- A simple audio feature extraction library☆81Updated 6 years ago
- Repository containing experimentation platform on how to train, infer on wav2vec2 models.☆87Updated 3 years ago
- An easy way to fine-tune Wav2Vec 2.0 for low-resource languages.☆80Updated 2 years ago
- End-to-end spoken language identification out of the box.☆48Updated 5 years ago
- Forced Alignments for Common Voice☆31Updated 5 years ago
- Data and code for grapheme-to-phoneme transducers in lots of languages☆141Updated last year