danijel3 / audio_gui
Simple audio recorder that sends WAV from browser to server in Python (Flask).
β32Updated 2 years ago
Related projects β
Alternatives and complementary repositories for audio_gui
- π₯ π€ The largest clinical study in the world to collect voice data labeled with health information (N>6,000 participants, 48 utterancesβ¦β28Updated 3 years ago
- Advanced data structures for handling temporal segments with attached labels.β99Updated 5 months ago
- Python toolkit for speech processingβ67Updated this week
- Speaker diarization python system based on binary key speaker modellingβ61Updated 2 years ago
- End-to-End Speech Recognition using Neural Networks.β35Updated 2 months ago
- Python library for audio augmentationβ83Updated last year
- [deprecated] Pretrained models for pyannote-audio 1.xβ71Updated 2 years ago
- Explore different way to mix speech model(wav2vec2, hubert) and nlp model(BART,T5,GPT) togetherβ42Updated last year
- Phoneme prediction from speech mel-spectrograms using RNN.β13Updated 5 years ago
- PyTorch Implementation of FastSpeech 2 : Fast and High-Quality End-to-End Text to Speechβ224Updated 2 years ago
- β74Updated 3 years ago
- A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.β100Updated last year
- It is an algorithm analysed the acoustic features of a voice and creates an acoustic classifier - USEFUL for auto-speech-raterβ11Updated 5 years ago
- Confidence interval computation for evaluation in machine learning using the bootstrapping approachβ66Updated 7 months ago
- An easy way to fine-tune Wav2Vec 2.0 for low-resource languages.β81Updated last year
- Estimating the Age, Height, and Gender of a speaker with their speech signal. https://arxiv.org/pdf/2110.13653.pdfβ64Updated 3 years ago
- Forced Alignments for Common Voiceβ31Updated 4 years ago
- Articulatory features estimation using Listen Attend and Spell architecture.β32Updated 4 years ago
- A light weight neural speaker embeddings extraction based on Kaldi and PyTorch.β137Updated 4 years ago
- An online speech recognition extension toolkit of Kaldiβ57Updated 3 years ago
- Neural network based similarity scoring for diarization (pytorch implementation of "LSTM based Similarity Measurement with Spectral Clustβ¦β43Updated 4 years ago
- REPeating Pattern Extraction Technique (REPET) in Python for audio source separation: original REPET, REPET extended, adaptive REPET, REPβ¦β32Updated 9 months ago
- Compute useful transcriptions metrics (CER, WER, SER, ...)β26Updated 10 years ago
- Python library for handling audio datasets.β131Updated last year
- Tool to analyze an audio corpora in terms of intonation, intensity, duration and voice qualityβ20Updated 5 years ago
- This repository describes our reproducible framework for assessing self-supervised representation learning from speechβ51Updated 3 years ago
- SC-GlowTTS: an Efficient Zero-Shot Multi-Speaker Text-To-Speech Modelβ106Updated 3 years ago
- Rescoring methods for end-to-end Automatic Speech Recognitionβ27Updated 4 years ago
- Repository containing experimentation platform on how to train, infer on wav2vec2 models.β85Updated 2 years ago
- Flask-based web framework for visualisation and explorative listening of audio.β51Updated last year