midas-research / audino
Open source audio annotation tool for humans
☆1,089Updated 2 months ago
Alternatives and similar repositories for audino:
Users that are interested in audino are comparing it to the libraries listed below
- Novoic's audio feature extraction library☆436Updated 3 years ago
- An On-Premises, Streaming Speech Recognition System☆683Updated 3 years ago
- Espresso: A Fast End-to-End Neural Speech Recognition Toolkit☆943Updated 7 months ago
- Working online speech recognition based on RNN Transducer. ( Trained model release available in release )☆293Updated 3 years ago
- Tutorial covering Open Source tools for Source Separation.☆369Updated 10 months ago
- This is a list of features, scripts, blogs and resources for better using Kaldi ( http://kaldi-asr.org/ )☆537Updated 3 years ago
- 💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies☆1,323Updated 10 months ago
- Mellotron: a multispeaker voice synthesis model based on Tacotron 2 GST that can make a voice emote and sing without emotive or singing t…☆860Updated last year
- CNN-based audio segmentation toolkit. Allows to detect speech, music, noise and speaker gender. Has been designed for large scale gender …☆797Updated 3 months ago
- ☆674Updated 6 months ago
- We provide a PyTorch implementation of the paper Voice Separation with an Unknown Number of Multiple Speakers In which, we present a new …☆1,284Updated last year
- Gecko - A Tool for Effective Annotation of Human Conversations☆280Updated 2 years ago
- This repo contains the scripts, models, and required files for the Deep Noise Suppression (DNS) Challenge.☆1,194Updated 8 months ago
- A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.☆1,726Updated 6 months ago
- A library for speech data augmentation in time-domain☆656Updated 3 years ago
- CSS10: A Collection of Single Speaker Speech Datasets for 10 Languages☆471Updated 5 years ago
- Large, modern dataset for speech recognition☆671Updated last year
- Trained models for automatic speech recognition (ASR). A library to quickly build applications that require speech to text conversion.☆129Updated 4 years ago
- A JavaScript interface for annotating and labeling audio files.☆454Updated 5 years ago
- A tutorial for Speech Enhancement researchers and practitioners. The purpose of this repo is to organize the world’s resources for speech…☆757Updated 4 years ago
- Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)We provide a PyTorch implementation of the paper Real Time Speech E…☆1,757Updated 2 years ago
- Tools for handling speech data in machine learning projects.☆1,007Updated 2 weeks ago
- speech to text benchmark framework☆642Updated 2 months ago
- A large-scale multilingual speech corpus for representation learning, semi-supervised learning and interpretation☆531Updated 2 years ago
- The PyTorch-based audio source separation toolkit for researchers☆2,360Updated 3 months ago
- ⏩ Generating speech in a single forward pass without any attention!☆579Updated 8 months ago
- List of speech synthesis papers.☆1,036Updated last year
- The Microsoft Scalable Noisy Speech Dataset (MS-SNSD) is a noisy speech dataset that can scale to arbitrary sizes depending on the number…☆518Updated 9 months ago
- Efficient neural speech synthesis☆1,167Updated 7 months ago
- Evaluate your speech-to-text system with similarity measures such as word error rate (WER)☆716Updated 2 months ago