dsgou / annotator
Video and audio annotator
☆28Updated 7 years ago
Alternatives and similar repositories for annotator:
Users that are interested in annotator are comparing it to the libraries listed below
- SoundNet, built in Keras with pre-trained 8-layer model.☆29Updated 5 years ago
- Audio Classification using Image Classification☆48Updated 5 years ago
- The Additive Margin MobileNet1D is a new light weight deep learning model for Speaker Recognition which is based on the MobileNetV2 archi…☆30Updated last year
- mobile part of the open SSI framework☆12Updated 6 years ago
- 🔉 A web app to play, visualize, and annotate your audio files for machine learning☆120Updated 5 years ago
- Gentle and praatio scripts for easy forced alignment☆18Updated 2 years ago
- How to run GPU accelerated Signal Processing in TensorFlow☆23Updated 6 years ago
- RNN to classify accents by using YouTube Biritsh and American training samples☆12Updated 5 years ago
- A deep learning framework for Speech-Music discrimination of continuous audio streams☆68Updated 6 years ago
- Speaker diarization via transfer learning☆27Updated 6 years ago
- Identify sounds in short audio clips☆154Updated last year
- Compute useful transcriptions metrics (CER, WER, SER, ...)☆27Updated 10 years ago
- Python library for audio augmentation☆84Updated last year
- Collaborative audio annotation tool☆17Updated 2 years ago
- Code to detect scenes and transitions in videos and compose a video to visualize the data.☆28Updated 6 years ago
- This repository contains the code related to the paper 'DENet: a deep architecture for audio surveillance applications'.☆42Updated last year
- A simple video annotation made with python + OpenCV for detection in YoloV2 format☆16Updated 4 years ago
- Convolutional neural networks for sound classification☆20Updated 7 years ago
- Interface for using TTS and vocoder models in the form of a text editor☆20Updated 2 years ago
- 📊 Easily apply audio-related machine learning models trained on the AudioSet dataset (527+ models/classes).☆29Updated 10 months ago
- Spectral audio feature extraction using time-frequency reassignment☆42Updated 6 years ago
- Code for our ACML and INTERSPEECH papers: "Speaker Diarization as a Fully Online Bandit Learning Problem in MiniVox".☆27Updated 3 years ago
- 24-hour Automatic Speech Recognition☆27Updated 3 years ago
- A crash course for training speech recognition models using DeepSpeech.☆25Updated 3 years ago
- Implementation and reviews of Audio & Computer vision related papers in python using keras and tensorflow.☆40Updated 6 years ago
- LogMMSE speech enhancement/noise reduction☆30Updated 4 years ago
- Repository for Weak Label Learning for Audio Events - A closer look. Uses Audioset subset data provided for reproducibility.☆32Updated last year
- Implementation in Keras of Effnet (https://arxiv.org/abs/1801.06434)☆21Updated 6 years ago
- Code for the paper: Audio to Score Matching by Combining Phonetic and Duration Information☆27Updated 7 years ago
- Running Mozilla's implementation of Baidu DeepSpeech on Google Colaboratory☆16Updated 6 years ago