danijel3 / audio_gui
Simple audio recorder that sends WAV from browser to server in Python (Flask).
☆31Updated 2 years ago
Alternatives and similar repositories for audio_gui:
Users that are interested in audio_gui are comparing it to the libraries listed below
- Speaker diarization python system based on binary key speaker modelling☆61Updated 3 years ago
- Python toolkit for speech processing☆68Updated 3 weeks ago
- This project is about performing Speaker diarization for Hindi Language.☆49Updated 4 years ago
- ☆75Updated 3 years ago
- Estimating the Age, Height, and Gender of a speaker with their speech signal. https://arxiv.org/pdf/2110.13653.pdf☆65Updated 3 years ago
- Speaker Diarization is the problem of separating speakers in an audio. There could be any number of speakers and final result should stat…☆65Updated 4 years ago
- 🎵 A repository for manually annotating files to create labeled acoustic datasets for machine learning.☆41Updated 3 years ago
- Creation of a multi user audio first annotation tool - GSoC 2021☆29Updated 2 years ago
- Helsinki Prosody Corpus and A System for Predicting Prosodic Prominence from Text☆242Updated 5 years ago
- End-to-End Speech Recognition using Neural Networks.☆35Updated 7 months ago
- Prososdy Morph: A python library for manipulating pitch and duration in an algorithmic way, for resynthesizing speech.☆85Updated last year
- Face Landmark-based Speaker-Independent Audio-Visual Speech Enhancement in Multi-Talker Environments☆107Updated last year
- 🏥 🎤 The largest clinical study in the world to collect voice data labeled with health information (N>6,000 participants, 48 utterances…☆28Updated 3 weeks ago
- Python library for audio augmentation☆84Updated last year
- Foreign Accent Conversion by Synthesizing Speech from Phonetic Posteriorgrams (Interspeech'19)☆143Updated last year
- This is the implementation of our Interspeech 2020 paper "Converting anyone's emotion: towards speaker-independent emotional voice conver…☆89Updated 4 years ago
- End-to-end spoken language identification out of the box.☆48Updated 4 years ago
- A light weight neural speaker embeddings extraction based on Kaldi and PyTorch.☆136Updated 5 years ago
- Python library for handling audio datasets.☆137Updated last year
- A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.☆102Updated 2 years ago
- LogMMSE speech enhancement/noise reduction☆88Updated 5 years ago
- ☆79Updated 11 months ago
- A simple audio feature extraction library☆79Updated 5 years ago
- Pypi installable TDNN and TDNN-F layers for PyTorch based acoustic model training☆39Updated 4 years ago
- A better, faster, stronger version of the unbounded interleaved-state recurrent neural network (UIS-RNN)☆62Updated 5 years ago
- Y-vector: Multiscale Waveform Encoder for Speaker Embedding☆24Updated 9 months ago
- Modular and extensible speech recognition library leveraging pytorch-lightning and hydra.☆45Updated 3 years ago
- Phoneme prediction from speech mel-spectrograms using RNN.☆13Updated 5 years ago
- Code for the paper: "Leveraging speaker attribute information using multi task learning for speaker verification and diarization" present…☆25Updated 2 years ago
- A Python toolbox for speech features extraction☆161Updated 2 years ago