danijel3 / audio_guiLinks
Simple audio recorder that sends WAV from browser to server in Python (Flask).
☆31Updated 3 years ago
Alternatives and similar repositories for audio_gui
Users that are interested in audio_gui are comparing it to the libraries listed below
Sorting:
- Python library for handling audio datasets.☆137Updated 2 years ago
- Speaker diarization python system based on binary key speaker modelling☆60Updated 3 years ago
- Python toolkit for speech processing☆69Updated 2 weeks ago
- End-to-End Speech Recognition using Neural Networks.☆35Updated 11 months ago
- SoundPy (alpha stage) is a research-based python package for speech and sound. Applications include deep-learning, filtering, speech-enha…☆75Updated 6 months ago
- A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.☆101Updated 2 years ago
- This repository is for wake-word detection in speech using recurrent neural networks☆17Updated 6 years ago
- A large, free audio sample database (10M words pronounced), a test bed for voice activity detection algorithms and for single-syllable wo…☆69Updated 7 years ago
- Deep Neural Network for Speaker Count Estimation☆153Updated 4 years ago
- A simple audio feature extraction library☆80Updated 6 years ago
- A curated list of awesome Speech Enhancement papers, libraries, datasets, and other resources.☆67Updated 5 years ago
- Speaker Diarization is the problem of separating speakers in an audio. There could be any number of speakers and final result should stat…☆63Updated 4 years ago
- Code for the paper: "Leveraging speaker attribute information using multi task learning for speaker verification and diarization" present…☆25Updated 2 years ago
- Estimating the Age, Height, and Gender of a speaker with their speech signal. https://arxiv.org/pdf/2110.13653.pdf☆66Updated 4 years ago
- An easy way to fine-tune Wav2Vec 2.0 for low-resource languages.☆82Updated 2 years ago
- ♂️♀️ Detect a person's gender from a voice file (90.7% +/- 1.3% accuracy).☆87Updated last year
- Scripts to simplify data prepping for Mozilla DeepSpeech.☆14Updated 6 years ago
- Source code of paper <End-to-End Language Diarization for Bilingual Code-switching Speech>☆19Updated 3 years ago
- Forced Alignments for Common Voice☆31Updated 4 years ago
- PyTorch Implementation of FastSpeech 2 : Fast and High-Quality End-to-End Text to Speech☆229Updated 3 years ago
- 🏥 🎤 The largest clinical study in the world to collect voice data labeled with health information (N>6,000 participants, 48 utterances…☆29Updated 4 months ago
- Articulatory features estimation using Listen Attend and Spell architecture.☆32Updated 5 years ago
- This project is about performing Speaker diarization for Hindi Language.☆50Updated 4 years ago
- Yet another PyTorch implementation of Tacotron 2 with reduction factor and faster training speed.☆148Updated 3 years ago
- Code for Speaker Change Detection in Broadcast TV using Bidirectional Long Short-Term Memory Networks☆65Updated 5 years ago
- Advanced data structures for handling temporal segments with attached labels.☆114Updated 6 months ago
- A Python toolbox for speech features extraction☆164Updated 2 years ago
- Few-shot Keyword Spotting in Any Language and Multilingual Spoken Word Corpus☆176Updated 8 months ago
- Trained models for automatic speech recognition (ASR). A library to quickly build applications that require speech to text conversion.☆129Updated 4 years ago
- LogMMSE speech enhancement/noise reduction☆88Updated 5 years ago