hipstas / audio-tagging-toolkit
A Python package for audio annotation and classifier training. Developed in collaboration with the WGBH Foundation and the American Archive of Public Broadcasting.
☆17Updated 6 years ago
Related projects ⓘ
Alternatives and complementary repositories for audio-tagging-toolkit
- A tutorial diphone synthesizer in Python☆23Updated 5 years ago
- Source code to accompany my paper "Poetic sound similarity vectors using phonetic features"☆167Updated 7 years ago
- SailAlign is an open-source software toolkit for robust long speech-text alignment implementing an adaptive, iterative speech recognition…☆97Updated 2 years ago
- ISMIR 2016 Late-Breaking/Demo Papers☆25Updated 8 years ago
- A simple toolkit for speaker segmentation and identification☆30Updated 11 years ago
- A Dockerized Jupyter notebook environment with pre-installed audio machine learning tools.☆12Updated 5 years ago
- ☆16Updated 5 years ago
- Experiment in automatic insertion of timed transcript corrections☆21Updated 7 years ago
- Repo for NYPL's 2016 Event, Open Audio Weekend☆14Updated 8 years ago
- Extract Polyphonic Musical Motives from Audio Recordings☆20Updated 5 years ago
- singing voice analysis and detection tools☆21Updated 9 years ago
- ☆8Updated 6 years ago
- Code for "Extracting Ground Truth Information from MIDI Files: A MIDIfesto"☆18Updated 8 years ago
- A Docker image for the Kaldi speech recognition tool + training data from Pop Up Archive☆20Updated 5 years ago
- Automatic Music Performance Analysis and Comparison Toolkit☆44Updated 3 years ago
- Database of annotated field recording samples that can be used for training audio labelling algorithms☆10Updated 5 years ago
- Repository for subjective and objective evaluation of source separation algorithms☆12Updated 6 years ago
- Api.ai English Speech Recognition (ASR) Model for Kaldi☆36Updated 3 years ago
- Calculates and compares perceptual sound texture statistics☆16Updated 3 years ago
- DEPRECATED - A webapp for collecting speech samples for voice recognition testing and training☆20Updated 5 years ago
- Barista is an open-source framework for concurrent speech processing.☆36Updated 10 years ago
- The RadioTalk dataset of talk radio transcripts☆56Updated 3 years ago
- EESEN based offline transcriber VM using models trained on TEDLIUM and Cantab Research☆49Updated 5 years ago
- List of Reproducible Audio Research Papers☆72Updated 6 years ago
- Humphrey, E. J. "An Exploration of Deep Learning in Music Informatics." (2015) New York University.☆14Updated 8 years ago
- Minimal module for computing audio spectrograms☆15Updated 5 years ago
- ☆65Updated 10 years ago
- A Praat plug-in for performing interactive phonetic forced alignment☆26Updated 6 years ago