danijel3 / audio_guiLinks
Simple audio recorder that sends WAV from browser to server in Python (Flask).
☆31Updated 3 years ago
Alternatives and similar repositories for audio_gui
Users that are interested in audio_gui are comparing it to the libraries listed below
Sorting:
- Speaker diarization python system based on binary key speaker modelling☆60Updated 4 years ago
- Python library for handling audio datasets.☆139Updated 2 years ago
- PyTorch Implementation of FastSpeech 2 : Fast and High-Quality End-to-End Text to Speech☆232Updated 3 years ago
- A simple audio feature extraction library☆81Updated 6 years ago
- Articulatory features estimation using Listen Attend and Spell architecture.☆33Updated 5 years ago
- Python toolkit for speech processing☆72Updated 3 weeks ago
- Advanced data structures for handling temporal segments with attached labels.☆124Updated 4 months ago
- A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.☆106Updated 2 years ago
- Estimating the Age, Height, and Gender of a speaker with their speech signal. https://arxiv.org/pdf/2110.13653.pdf☆68Updated 4 years ago
- Source code of paper <End-to-End Language Diarization for Bilingual Code-switching Speech>☆19Updated 4 years ago
- Festvox voice building tools☆108Updated 6 months ago
- A lightweight library to compute Diarization Error Rate (DER).☆62Updated 3 weeks ago
- 🏥 🎤 The largest clinical study in the world to collect voice data labeled with health information (N>6,000 participants, 48 utterances…☆32Updated 10 months ago
- Machine learning experiment to perform gender classification from raw audio.☆23Updated 7 years ago
- [deprecated] Pretrained models for pyannote-audio 1.x☆71Updated 3 years ago
- An easy way to fine-tune Wav2Vec 2.0 for low-resource languages.☆80Updated 2 years ago
- Python library for audio augmentation☆85Updated 2 years ago
- Rescoring methods for end-to-end Automatic Speech Recognition☆26Updated 5 years ago
- This repository describes our reproducible framework for assessing self-supervised representation learning from speech☆51Updated 4 years ago
- Helsinki Prosody Corpus and A System for Predicting Prosodic Prominence from Text☆246Updated 6 years ago
- ♂️♀️ Detect a person's gender from a voice file (90.7% +/- 1.3% accuracy).☆90Updated last year
- Python module for syllabifying English ARPABET transcriptions☆72Updated 6 years ago
- Interface for Controllable Expressive Talking Machine☆40Updated 4 months ago
- S3PRL for Speech Emotion Recognition (see s3prl > downstream)☆15Updated last year
- Flask-based web framework for visualisation and explorative listening of audio.☆54Updated 2 years ago
- ☆76Updated 4 years ago
- Pytorch implementation of Deepmind's WaveRNN model☆123Updated 6 years ago
- Speaker Diarization is the problem of separating speakers in an audio. There could be any number of speakers and final result should stat…☆64Updated 5 years ago
- Multistream CNN for Robust Acoustic Modeling☆40Updated 4 years ago
- A collection of Audio and Speech pre-trained models.☆193Updated 5 years ago