danijel3 / audio_guiLinks
Simple audio recorder that sends WAV from browser to server in Python (Flask).
☆31Updated 2 years ago
Alternatives and similar repositories for audio_gui
Users that are interested in audio_gui are comparing it to the libraries listed below
Sorting:
- Python toolkit for speech processing☆68Updated last week
- Speaker diarization python system based on binary key speaker modelling☆61Updated 3 years ago
- ☆80Updated last year
- End-to-end spoken language identification out of the box.☆48Updated 4 years ago
- Machine learning experiment to perform gender classification from raw audio.☆23Updated 6 years ago
- Python library for handling audio datasets.☆138Updated last year
- Helsinki Prosody Corpus and A System for Predicting Prosodic Prominence from Text☆243Updated 5 years ago
- ☆76Updated 3 years ago
- A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.☆102Updated 2 years ago
- Estimating the Age, Height, and Gender of a speaker with their speech signal. https://arxiv.org/pdf/2110.13653.pdf☆65Updated 4 years ago
- Phoneme prediction from speech mel-spectrograms using RNN.☆14Updated 6 years ago
- pytorch implementation for MultiSpeech: Multi-Speaker Text to Speech with Transformer paper☆21Updated 2 years ago
- Forced Alignments for Common Voice☆31Updated 4 years ago
- This project is about performing Speaker diarization for Hindi Language.☆50Updated 4 years ago
- A light weight neural speaker embeddings extraction based on Kaldi and PyTorch.☆136Updated 5 years ago
- Real-time Speech Separation, Noise Suppression & Speaker Recognition☆18Updated 6 years ago
- [deprecated] Pretrained models for pyannote-audio 1.x☆72Updated 2 years ago
- PyTorch Implementation of FastSpeech 2 : Fast and High-Quality End-to-End Text to Speech☆230Updated 2 years ago
- This is the implementation of our Interspeech 2020 paper "Converting anyone's emotion: towards speaker-independent emotional voice conver…☆90Updated 4 years ago
- Confidence interval computation for evaluation in machine learning using the bootstrapping approach☆83Updated last year
- Prososdy Morph: A python library for manipulating pitch and duration in an algorithmic way, for resynthesizing speech.☆85Updated last year
- Code for Speaker Change Detection in Broadcast TV using Bidirectional Long Short-Term Memory Networks☆65Updated 4 years ago
- Tool to analyze an audio corpora in terms of intonation, intensity, duration and voice quality☆22Updated 5 years ago
- Python library for audio augmentation☆84Updated last year
- The Emotional Voices Database: Towards Controlling the Emotional Expressiveness in Voice Generation Systems☆268Updated last year
- Neural network based similarity scoring for diarization (pytorch implementation of "LSTM based Similarity Measurement with Spectral Clust…☆45Updated 4 years ago
- Foreign Accent Conversion by Synthesizing Speech from Phonetic Posteriorgrams (Interspeech'19)☆143Updated last year
- Luigi pipeline to download VoxCeleb(2) audio from YouTube and extract speaker segments☆43Updated 4 years ago
- End-to-End Speech Recognition using Neural Networks.☆35Updated 9 months ago
- A toolkit for non-parallel voice conversion based on vector-quantized variational autoencoder☆171Updated 10 months ago