danijel3 / audio_gui
Simple audio recorder that sends WAV from browser to server in Python (Flask).
☆31Updated 2 years ago
Alternatives and similar repositories for audio_gui
Users that are interested in audio_gui are comparing it to the libraries listed below
Sorting:
- Compute useful transcriptions metrics (CER, WER, SER, ...)☆27Updated 10 years ago
- Python toolkit for speech processing☆68Updated last week
- Interface for Controllable Expressive Talking Machine☆38Updated last year
- PyTorch Implementation of FastSpeech 2 : Fast and High-Quality End-to-End Text to Speech☆230Updated 2 years ago
- An easy way to fine-tune Wav2Vec 2.0 for low-resource languages.☆82Updated last year
- Python library for handling audio datasets.☆137Updated last year
- A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.☆102Updated 2 years ago
- ☆76Updated 3 years ago
- Flask-based web framework for visualisation and explorative listening of audio.☆53Updated 2 years ago
- SC-GlowTTS: an Efficient Zero-Shot Multi-Speaker Text-To-Speech Model☆107Updated 3 years ago
- 🏥 🎤 The largest clinical study in the world to collect voice data labeled with health information (N>6,000 participants, 48 utterances…☆28Updated last month
- End-to-end spoken language identification out of the box.☆48Updated 4 years ago
- Segment a given audio into utterances using a trained end-to-end ASR model.☆73Updated 4 years ago
- A simple audio feature extraction library☆80Updated 5 years ago
- Advanced data structures for handling temporal segments with attached labels.☆113Updated 3 months ago
- Foreign Accent Conversion by Synthesizing Speech from Phonetic Posteriorgrams (Interspeech'19)☆143Updated last year
- This python code performs an efficient speech reverberation starting from a dataset of close-talking speech signals and a collection of a…☆95Updated 4 years ago
- This is the implementation of our Interspeech 2020 paper "Converting anyone's emotion: towards speaker-independent emotional voice conver…☆90Updated 4 years ago
- Prososdy Morph: A python library for manipulating pitch and duration in an algorithmic way, for resynthesizing speech.☆85Updated last year
- An online speech recognition extension toolkit of Kaldi☆56Updated 3 years ago
- A toolkit for non-parallel voice conversion based on vector-quantized variational autoencoder☆171Updated 9 months ago
- Multilingual Grapheme to Phoneme☆49Updated 9 years ago
- ☆67Updated 5 months ago
- LogMMSE speech enhancement/noise reduction☆88Updated 5 years ago
- Official PyTorch implementation of Speaker Conditional WaveRNN☆110Updated 2 years ago
- Speaker diarization python system based on binary key speaker modelling☆61Updated 3 years ago
- ☆36Updated 2 weeks ago
- Python bindings for SoX, aiming to replicate a subset of the command line sox utility.☆54Updated 4 years ago
- [deprecated] Pretrained models for pyannote-audio 1.x☆72Updated 2 years ago
- Forced Alignments for Common Voice☆31Updated 4 years ago