midas-research / audino
Open source audio annotation tool for humans
☆1,092Updated 3 months ago
Alternatives and similar repositories for audino
Users that are interested in audino are comparing it to the libraries listed below
Sorting:
- An On-Premises, Streaming Speech Recognition System☆683Updated 3 years ago
- Espresso: A Fast End-to-End Neural Speech Recognition Toolkit☆943Updated 8 months ago
- Working online speech recognition based on RNN Transducer. ( Trained model release available in release )☆293Updated 3 years ago
- 💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies☆1,324Updated 11 months ago
- Novoic's audio feature extraction library☆436Updated 3 years ago
- speech to text benchmark framework☆646Updated 3 months ago
- ☆674Updated 7 months ago
- Tutorial covering Open Source tools for Source Separation.☆370Updated 11 months ago
- This is a list of features, scripts, blogs and resources for better using Kaldi ( http://kaldi-asr.org/ )☆537Updated 3 years ago
- CNN-based audio segmentation toolkit. Allows to detect speech, music, noise and speaker gender. Has been designed for large scale gender …☆800Updated 4 months ago
- A large-scale multilingual speech corpus for representation learning, semi-supervised learning and interpretation☆531Updated 2 years ago
- The SpeechBrain project aims to build a novel speech toolkit fully based on PyTorch. With SpeechBrain users can easily create speech proc…☆365Updated 5 months ago
- A library for speech data augmentation in time-domain☆659Updated 3 years ago
- Gecko - A Tool for Effective Annotation of Human Conversations☆282Updated 2 years ago
- Evaluate your speech-to-text system with similarity measures such as word error rate (WER)☆723Updated 2 months ago
- A collection of links and notes on forced alignment tools☆907Updated 3 years ago
- 🗣️ A book and repo to get you started programming voice computing applications in Python (10 chapters and 200+ scripts).☆380Updated 2 years ago
- Real-time Voice Activity Detection in Noisy Eniviroments using Deep Neural Networks☆442Updated 4 years ago
- Mellotron: a multispeaker voice synthesis model based on Tacotron 2 GST that can make a voice emote and sing without emotive or singing t…☆861Updated last year
- We provide a PyTorch implementation of the paper Voice Separation with an Unknown Number of Multiple Speakers In which, we present a new …☆1,289Updated last year
- A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.☆1,740Updated 6 months ago
- speaker diarization by uis-rnn and speaker embedding by vgg-speaker-recognition☆484Updated 3 years ago
- Flowtron is an auto-regressive flow-based generative network for text to speech synthesis with control over speech variation and style tr…☆898Updated last year
- A JavaScript interface for annotating and labeling audio files.☆456Updated 5 years ago
- Unofficial PyTorch implementation of Google AI's VoiceFilter system☆1,137Updated 9 months ago
- List of speech synthesis papers.☆1,039Updated last year
- Tools for handling speech data in machine learning projects.☆1,018Updated last week
- feature extraction from speech signals☆373Updated last week
- Large, modern dataset for speech recognition☆674Updated last year
- g2p: English Grapheme To Phoneme Conversion☆849Updated 2 years ago