danijel3 / audio_gui
Simple audio recorder that sends WAV from browser to server in Python (Flask).
β31Updated 2 years ago
Alternatives and similar repositories for audio_gui:
Users that are interested in audio_gui are comparing it to the libraries listed below
- β74Updated 3 years ago
- Python library for audio augmentationβ83Updated last year
- π₯ π€ The largest clinical study in the world to collect voice data labeled with health information (N>6,000 participants, 48 utterancesβ¦β28Updated 3 years ago
- Speaker diarization python system based on binary key speaker modellingβ61Updated 3 years ago
- This repository describes our reproducible framework for assessing self-supervised representation learning from speechβ51Updated 3 years ago
- [deprecated] Pretrained models for pyannote-audio 1.xβ72Updated 2 years ago
- BurrMill coreβ21Updated 3 years ago
- Python toolkit for speech processingβ68Updated 3 weeks ago
- SERAB: a multi-lingual benchmark for speech emotion recognitionβ28Updated 2 years ago
- Estimating the Age, Height, and Gender of a speaker with their speech signal. https://arxiv.org/pdf/2110.13653.pdfβ64Updated 3 years ago
- A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.β100Updated last year
- Rescoring methods for end-to-end Automatic Speech Recognitionβ27Updated 4 years ago
- Repository containing experimentation platform on how to train, infer on wav2vec2 models.β86Updated 2 years ago
- Tool to analyze an audio corpora in terms of intonation, intensity, duration and voice qualityβ21Updated 5 years ago
- A simple audio feature extraction libraryβ79Updated 5 years ago
- Removing background noise in a sound fileβ62Updated 5 years ago
- End-to-end spoken language identification out of the box.β48Updated 4 years ago
- Articulatory features estimation using Listen Attend and Spell architecture.β32Updated 4 years ago
- πΈTTS recipes for different datasetsβ85Updated 2 years ago
- Forced Alignments for Common Voiceβ31Updated 4 years ago
- Interface for Controllable Expressive Talking Machineβ38Updated last year
- End-to-End Speech Recognition using Neural Networks.β35Updated 5 months ago
- An online speech recognition extension toolkit of Kaldiβ56Updated 3 years ago
- β79Updated 8 months ago
- β34Updated 4 months ago
- Speaker Diarization is the problem of separating speakers in an audio. There could be any number of speakers and final result should statβ¦β65Updated 4 years ago
- Real-Time High-Fidelity Speech Synthesis without GPUβ74Updated 6 months ago
- This project is about performing Speaker diarization for Hindi Language.β48Updated 3 years ago
- A lightweight library to compute Diarization Error Rate (DER).β59Updated last year
- Deep learning for Text to Speechβ26Updated 3 years ago