danijel3 / audio_guiLinks
Simple audio recorder that sends WAV from browser to server in Python (Flask).
☆31Updated 3 years ago
Alternatives and similar repositories for audio_gui
Users that are interested in audio_gui are comparing it to the libraries listed below
Sorting:
- Python library for handling audio datasets.☆138Updated 2 years ago
- Speaker diarization python system based on binary key speaker modelling☆60Updated 3 years ago
- Python toolkit for speech processing☆72Updated this week
- Advanced data structures for handling temporal segments with attached labels.☆122Updated 2 months ago
- An easy way to fine-tune Wav2Vec 2.0 for low-resource languages.☆81Updated 2 years ago
- End-to-End Speech Recognition using Neural Networks.☆35Updated last year
- A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.☆107Updated 2 years ago
- Pytorch implementation of Deepmind's WaveRNN model☆122Updated 6 years ago
- Phoneme prediction from speech mel-spectrograms using RNN.☆15Updated 6 years ago
- [deprecated] Pretrained models for pyannote-audio 1.x☆71Updated 3 years ago
- PyTorch Implementation of FastSpeech 2 : Fast and High-Quality End-to-End Text to Speech☆232Updated 3 years ago
- Rescoring methods for end-to-end Automatic Speech Recognition☆27Updated 5 years ago
- End-to-end spoken language identification out of the box.☆48Updated 4 years ago
- Festvox voice building tools☆106Updated 3 months ago
- Forced Alignments for Common Voice☆31Updated 5 years ago
- Python library for audio augmentation☆84Updated 2 years ago
- Estimating the Age, Height, and Gender of a speaker with their speech signal. https://arxiv.org/pdf/2110.13653.pdf☆66Updated 4 years ago
- ☆76Updated 4 years ago
- ☆37Updated this week
- Scripts to simplify data prepping for Mozilla DeepSpeech.☆14Updated 6 years ago
- Trained models for automatic speech recognition (ASR). A library to quickly build applications that require speech to text conversion.☆130Updated 4 years ago
- Python module for syllabifying English ARPABET transcriptions☆69Updated 6 years ago
- Articulatory features estimation using Listen Attend and Spell architecture.☆32Updated 5 years ago
- A simple audio feature extraction library☆81Updated 6 years ago
- Zero-shot multimodal punctuation insertion and truecasing using Whisper☆119Updated 2 years ago
- Speaker diarization based on Kaldi x-vectors, tuned for 16k microphone data☆96Updated 2 years ago
- Some simple wrappers around eSpeak NG intended to make using this excellent TTS for waveform and IPA generation as convenient as possible…☆43Updated last year
- python wrapper for rnnoise library☆49Updated 2 years ago
- ☆56Updated 2 years ago
- A Python toolbox for speech features extraction☆165Updated 2 years ago