ritazh / EchoML
π A web app to play, visualize, and annotate your audio files for machine learning
β120Updated 5 years ago
Alternatives and similar repositories for EchoML:
Users that are interested in EchoML are comparing it to the libraries listed below
- Code for Speaker Change Detection in Broadcast TV using Bidirectional Long Short-Term Memory Networksβ64Updated 4 years ago
- Speaker diarization scripts, based on AaltoASRβ190Updated 6 years ago
- Speaker diarization via transfer learningβ27Updated 6 years ago
- [deprecated] Pretrained models for pyannote-audio 1.xβ72Updated 2 years ago
- Adapting your own Language Model for Kaldiβ63Updated 6 years ago
- Train a Deep Learning model to classify audio embeddings on IBM's Deep Learning as a Service (DLaaS) platform - Watson Machine Learningβ101Updated 2 years ago
- A toolkit for reproducible evaluation, diagnostic, and error analysis of speaker diarization systemsβ205Updated 2 months ago
- Machine Learning Sound Classifierβ135Updated 5 years ago
- Use your data to create a speech recognition system in Kaldi. Fast.β65Updated 5 years ago
- Source code for 'Transfer Learning for Speech Recognition on a Budget' published at ACL 2017β46Updated 7 years ago
- A Collection of Speech Corpus for ASR and TTSβ113Updated 7 years ago
- An open-source speech separation and enhancement libraryβ211Updated 4 years ago
- Reproducible experimental protocols for multimedia (audio, video, text) databaseβ100Updated 2 months ago
- Speech-to-text based on wav2letter built for transfer learningβ97Updated 2 years ago
- Python library for audio augmentationβ83Updated last year
- Tool for creation, manipulation and maintenance of voice corporaβ81Updated 11 months ago
- maracas is a library for corrupting audio files with additive and convolutive noise.β72Updated 7 years ago
- Trims .wav audio files to the loudest section of a given lengthβ96Updated 7 years ago
- Python library for handling audio datasets.β137Updated last year
- Speaker Diarization is the problem of separating speakers in an audio. There could be any number of speakers and final result should statβ¦β65Updated 4 years ago
- An in-browser app for labeling audio clips at random, using Docker and Flask.β53Updated 7 years ago
- A large, free audio sample database (10M words pronounced), a test bed for voice activity detection algorithms and for single-syllable woβ¦β69Updated 7 years ago
- Server framework for Kaldi ASR Toolkitβ97Updated last year
- A didactic toolkit to rapidly prototype audio classifiers with pre-trained Tensorflow models and Scikit-learnβ143Updated 2 years ago
- Speaker diarization based on Kaldi x-vectors, tuned for 16k microphone dataβ96Updated last year
- A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.β102Updated 2 years ago
- A deep learning framework for Speech-Music discrimination of continuous audio streamsβ68Updated 6 years ago
- Trained models for automatic speech recognition (ASR). A library to quickly build applications that require speech to text conversion.β129Updated 4 years ago
- A collection of python scripts for extracting and analyzing acoustics from audio files.β97Updated last year
- β38Updated 4 years ago