blacklight / micmon
A Python library and set of scripts to create labelled audio datasets from raw audio files and use them to train sound detection models.
☆42Updated 3 years ago
Alternatives and similar repositories for micmon:
Users that are interested in micmon are comparing it to the libraries listed below
- Experiments and tutorials with and for torchaudio☆13Updated 3 years ago
- ☆16Updated 8 years ago
- ☆9Updated 2 years ago
- YOLT (You Only Look Twice) - a tool that attempts to improve the accuracy of YOLOv4 in images☆21Updated 4 years ago
- ☆16Updated 2 years ago
- Collection of models and extensions for deployment in PyTorch☆24Updated 2 years ago
- From a large speech audio file and its corresponding body of text, automatically chunk the audio and text into (phrase, audio_snippet) pa…☆17Updated 9 years ago
- A very basic demonstration connecting speech recognition and text-to-speech☆19Updated 4 years ago
- Won't it be cool to build a speech assistant like Alexa or Siri yourself without voice API and network connection?☆34Updated 6 years ago
- How to run GPU accelerated Signal Processing in TensorFlow☆23Updated 6 years ago
- ☆40Updated 5 years ago
- ☆43Updated 4 years ago
- A tool for generic tracking-based CV annotation☆18Updated 4 years ago
- Code for blog posts from OpenCV.AI☆15Updated last year
- German Tacotron 2 and Multi-band MelGAN in TensorFlow with TF Lite inference support☆25Updated 3 years ago
- Sample implementation of 3D object detection with Intel OpenVINO☆15Updated 4 years ago
- Tensorflow camera calibrator☆14Updated 4 years ago
- Text detection and recognition in natural videos☆27Updated 7 years ago
- ☆13Updated 4 years ago
- Datasets for hackernews posts☆16Updated 3 years ago
- 🦁 Nala is an agile open-source voice assistant framework (20+ actions).☆35Updated last year
- ☆19Updated 4 years ago
- Matplotlib Image labeller for classifying images☆10Updated last month
- Simple text to phonemes converter for multiple languages☆20Updated 2 years ago
- A public repository of work for the Speech Verification component of the undergrad squad for Doubtfire.☆13Updated 3 years ago
- Speech to text library for Rhasspy using Kaldi☆14Updated last year
- ☆15Updated 4 years ago
- The History of Speech Recognition to the Year 2030☆12Updated 3 years ago
- A neural network for filtering target speaker's voice from audio written in tensorflow☆21Updated 6 years ago
- C++ Implementation of the Information Bottleneck System☆23Updated 6 years ago