SangwonSUH / realtime_YAMNETLinks
Simple real-time Sound Event Detector based on YAMNet and pyaudio.
β23Updated 5 years ago
Alternatives and similar repositories for realtime_YAMNET
Users that are interested in realtime_YAMNET are comparing it to the libraries listed below
Sorting:
- π΅ A repository for manually annotating files to create labeled acoustic datasets for machine learning.β45Updated 3 years ago
- This repository contains the code related to the paper 'DENet: a deep architecture for audio surveillance applications'.β42Updated 2 years ago
- Audio classification with VGGish as feature extractor in TensorFlowβ130Updated 3 years ago
- An in-depth analysis of audio classification on the RAVDESS dataset. Feature engineering, hyperparameter optimization, model evaluation, β¦β76Updated 4 years ago
- Voice Activity Detection based on Deep Learning & TensorFlowβ368Updated 2 years ago
- Estimating the Age, Height, and Gender of a speaker with their speech signal. https://arxiv.org/pdf/2110.13653.pdfβ66Updated 4 years ago
- General purpose sound recognition demoβ158Updated last year
- β94Updated 2 years ago
- Sound Classification using Librosa, ffmpeg, CNN, Keras, XGBOOST, Random Forest.β71Updated last year
- Classify daily life events using audio data.β53Updated 5 years ago
- Pytorch implementation of deep audio embedding calculationβ104Updated 2 years ago
- A simple Python wrapper for audio noise reduction RNNoise. Simplifies work with it, adds new trained models and detailed instructions forβ¦β169Updated last year
- In this repository, we explore using a hybrid system consisting of a Convolutional Neural Network and a Support Vector Machine for Keyworβ¦β101Updated 2 years ago
- This project is about performing Speaker diarization for Hindi Language.β51Updated 4 years ago
- π This repo makes it easy to download the raw audio files from AudioSet (32.45 GB, 632 classes).β102Updated 2 years ago
- Convert kaldi feature extraction and nnet3 models into Tensorflow Lite models. Currently aimed at converting kaldi's x-vector models and β¦β20Updated 2 years ago
- Python library for audio augmentationβ84Updated 2 years ago
- Speaker identification using voice MFCCs and GMMβ54Updated 4 years ago
- Voice Activity Detection (VAD) using deep learning.β198Updated 5 years ago
- EfficientNet-Absolute Zero for Continuous Speech Keyword Spottingβ23Updated 3 years ago
- Speaker identification/verification models for Machine Learning for Computer Vision class at UNIBOβ64Updated 2 years ago
- A two-stage polyphonic sound event detection and localization method for both SED and DOA.β115Updated 2 years ago
- Kaldi based speaker verificationβ47Updated 7 years ago
- Multi class audio classification using Deep Learning (MLP, CNN): The objective of this project is to build a multi class classifier to idβ¦β67Updated 4 years ago
- VoiceSplit: Targeted Voice Separation by Speaker-Conditioned Spectrogramβ254Updated last year
- REPeating Pattern Extraction Technique (REPET) in Python for audio source separation: original REPET, REPET extended, adaptive REPET, REPβ¦β33Updated last year
- β236Updated last year
- Tensorflow 2.0 implementation of the paper: A Fully Convolutional Neural Network for Speech Enhancementβ257Updated 4 years ago
- Reproducible experimental protocols for multimedia (audio, video, text) databaseβ107Updated 6 months ago
- A speaker embedding network in Pytorch that is very quick to set up and use for whatever purposes.β91Updated 5 months ago