danijel3 / audio_guiLinks
Simple audio recorder that sends WAV from browser to server in Python (Flask).
☆31Updated 3 years ago
Alternatives and similar repositories for audio_gui
Users that are interested in audio_gui are comparing it to the libraries listed below
Sorting:
- Speaker diarization python system based on binary key speaker modelling☆60Updated 4 years ago
- Python library for handling audio datasets.☆139Updated 2 years ago
- A simple audio feature extraction library☆81Updated 6 years ago
- Python toolkit for speech processing☆72Updated 3 weeks ago
- Advanced data structures for handling temporal segments with attached labels.☆124Updated 4 months ago
- Deep Neural Network for Speaker Count Estimation☆157Updated 5 years ago
- Articulatory features estimation using Listen Attend and Spell architecture.☆33Updated 5 years ago
- Machine learning experiment to perform gender classification from raw audio.☆23Updated 7 years ago
- ♂️♀️ Detect a person's gender from a voice file (90.7% +/- 1.3% accuracy).☆90Updated last year
- PyTorch Implementation of FastSpeech 2 : Fast and High-Quality End-to-End Text to Speech☆232Updated 3 years ago
- A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.☆106Updated 2 years ago
- Python module for syllabifying English ARPABET transcriptions☆72Updated 6 years ago
- [deprecated] Pretrained models for pyannote-audio 1.x☆71Updated 3 years ago
- Audio feature extraction and classification☆227Updated 2 years ago
- Python library for audio augmentation☆85Updated 2 years ago
- Helsinki Prosody Corpus and A System for Predicting Prosodic Prominence from Text☆246Updated 6 years ago
- ☆76Updated 4 years ago
- A Python toolbox for speech features extraction☆165Updated 3 years ago
- A python library for working with praat, textgrids, time aligned audio transcripts, and audio files. It is primarily used for extracting …☆345Updated 3 weeks ago
- Speaker diarization based on Kaldi x-vectors, tuned for 16k microphone data☆96Updated 2 years ago
- VoiceSplit: Targeted Voice Separation by Speaker-Conditioned Spectrogram☆265Updated last year
- Estimating the Age, Height, and Gender of a speaker with their speech signal. https://arxiv.org/pdf/2110.13653.pdf☆68Updated 4 years ago
- Flask-based web framework for visualisation and explorative listening of audio.☆54Updated 2 years ago
- Foreign Accent Conversion by Synthesizing Speech from Phonetic Posteriorgrams (Interspeech'19)☆148Updated 2 years ago
- A large, free audio sample database (10M words pronounced), a test bed for voice activity detection algorithms and for single-syllable wo…☆70Updated 8 years ago
- A list of publically available audio data that anyone can download for ASR or other speech activities☆232Updated 4 years ago
- This repository is for wake-word detection in speech using recurrent neural networks☆17Updated 6 years ago
- Identifying people from small audio fragments☆171Updated 5 years ago
- How to create your own model for vosk☆75Updated 4 years ago
- A better, faster, stronger version of the unbounded interleaved-state recurrent neural network (UIS-RNN)☆62Updated 5 years ago