danijel3 / audio_guiLinks
Simple audio recorder that sends WAV from browser to server in Python (Flask).
β31Updated 2 years ago
Alternatives and similar repositories for audio_gui
Users that are interested in audio_gui are comparing it to the libraries listed below
Sorting:
- π₯ π€ The largest clinical study in the world to collect voice data labeled with health information (N>6,000 participants, 48 utterancesβ¦β29Updated 2 months ago
- Python toolkit for speech processingβ69Updated last week
- β76Updated 3 years ago
- Creation of a multi user audio first annotation tool - GSoC 2021β29Updated 2 years ago
- A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.β102Updated 2 years ago
- Tool to analyze an audio corpora in terms of intonation, intensity, duration and voice qualityβ22Updated 6 years ago
- Speaker diarization python system based on binary key speaker modellingβ60Updated 3 years ago
- Compute useful transcriptions metrics (CER, WER, SER, ...)β27Updated 10 years ago
- β56Updated 2 years ago
- SERAB: a multi-lingual benchmark for speech emotion recognitionβ28Updated 2 years ago
- Forced Alignments for Common Voiceβ31Updated 4 years ago
- Speaker diarization and speech to textβ14Updated 4 years ago
- Prososdy Morph: A python library for manipulating pitch and duration in an algorithmic way, for resynthesizing speech.β85Updated last year
- Estimating the Age, Height, and Gender of a speaker with their speech signal. https://arxiv.org/pdf/2110.13653.pdfβ65Updated 4 years ago
- A lightweight library to compute Diarization Error Rate (DER).β59Updated last year
- Python library for handling audio datasets.β138Updated last year
- A mini, simple, and fast end-to-end automatic speech recognition toolkit.β54Updated 2 years ago
- Code for the paper: "Leveraging speaker attribute information using multi task learning for speaker verification and diarization" presentβ¦β25Updated 2 years ago
- Feature extractor for DL speech processing.β65Updated 3 years ago
- [deprecated] Pretrained models for pyannote-audio 1.xβ72Updated 2 years ago
- Python bindings for SoX, aiming to replicate a subset of the command line sox utility.β54Updated 4 years ago
- This repository describes our reproducible framework for assessing self-supervised representation learning from speechβ51Updated 3 years ago
- Flask-based web framework for visualisation and explorative listening of audio.β53Updated 2 years ago
- This is the Python library for an unsupervised, fast method for robust voice activity detection (rVAD), as in the paper rVAD: An Unsupervβ¦β141Updated 3 weeks ago
- Manage audio and video datasetsβ31Updated this week
- β80Updated last year
- Rescoring methods for end-to-end Automatic Speech Recognitionβ27Updated 4 years ago
- End-to-end spoken language identification out of the box.β48Updated 4 years ago
- Helsinki Prosody Corpus and A System for Predicting Prosodic Prominence from Textβ242Updated 5 years ago
- Incorporating KenLM language model with HuggingFace implementation of Wav2Vec2CTC Model using beam search decodingβ75Updated 3 years ago