A deep learning framework for Speech-Music discrimination of continuous audio streams
☆68Aug 3, 2018Updated 7 years ago
Alternatives and similar repositories for CNNs-Speech-Music-Discrimination
Users that are interested in CNNs-Speech-Music-Discrimination are comparing it to the libraries listed below
Sorting:
- Research_speech_speaker_verification_nist_sre2010☆12Mar 1, 2016Updated 10 years ago
- Speech/Music discrimination using SampleCNN☆18May 30, 2025Updated 9 months ago
- a music segmentation algorithm that I proposed and implemented as my undergraduate project. The basic function is: a song is loaded to th…☆16Apr 19, 2013Updated 12 years ago
- Training and using classifiers for textual documents☆15Sep 16, 2016Updated 9 years ago
- Api.ai English Speech Recognition (ASR) Model for Kaldi☆35Dec 27, 2020Updated 5 years ago
- Music structure segmentation with convnets☆13Mar 11, 2016Updated 9 years ago
- Various scripts that facilitate the preparation of Automatic Speech Recognition related resources☆17Apr 16, 2020Updated 5 years ago
- Audio Analysis by Conceptor☆30Aug 20, 2015Updated 10 years ago
- Deep Learning Tutorial in Python with Keras library☆21Feb 21, 2017Updated 9 years ago
- wake word spotting with kaldi☆19Dec 3, 2020Updated 5 years ago
- DNN-based speech enhancement using Tensorflow by Haoyu Li (Tokyo univ.)☆17Aug 31, 2017Updated 8 years ago
- A simple toolkit for speaker segmentation and identification☆31Jun 15, 2013Updated 12 years ago
- Generating drum loops using the Wave-U-Net conditioned on intuitive parameters.☆24Nov 19, 2020Updated 5 years ago
- This is the supplemental repository for ISMIR 2019 paper GENERATING STRUCTURED DRUM PATTERN USING VARIATIONAL AUTOENCODER AND SELF-SIMILA…☆23Oct 28, 2019Updated 6 years ago
- Compute the most likely permutation of a lattice given an LM☆10Jan 3, 2013Updated 13 years ago
- This is the home directory to speaker diarization module being developed for Hetergeneous News data in RedHen Labs as a GSOC Project☆10Sep 11, 2015Updated 10 years ago
- Multiobjective Optimization Training of PLDA for Speaker Verification☆10Jun 14, 2018Updated 7 years ago
- Scripts for recreating the Replication Dataset for Fundamental Frequency Estimation. Part of the dissertation "Pitch of Voiced Speech in …☆11Mar 29, 2021Updated 4 years ago
- Audio source separation using CASA approaches in Python.☆11Apr 2, 2015Updated 10 years ago
- A Speech Analytics Python Tool for Speaking Assessment☆14Dec 8, 2022Updated 3 years ago
- Unsupervised speech activity detection system.☆11Jul 2, 2018Updated 7 years ago
- Cython implementation of Moattar and Homayounpour's Voice Activity Detection (VAD) algorithm fast enough for real-time on an RPi 3.☆12Aug 18, 2018Updated 7 years ago
- DNN-for-speech-enhancement☆176Feb 23, 2023Updated 3 years ago
- singing voice analysis and detection tools☆21Jun 10, 2015Updated 10 years ago
- EAQUAL stands for Evaluation Of Audio Quality. It's an objective measurement technique used to measure the quality of encoded/decoded aud…☆25Dec 21, 2017Updated 8 years ago
- This repository contains the annotations and download scripts for the audio files of the GiantSteps Key data set. This data set was publi…☆23Mar 19, 2025Updated 11 months ago
- NNSVS向けの教師データのラベル作成支援ツールです。☆10Apr 5, 2023Updated 2 years ago
- Machine Learning model for creating karaoke music (stripping out vocals)☆16Aug 10, 2017Updated 8 years ago
- Python framework for Speech and Music Detection using Keras.☆109Mar 24, 2023Updated 2 years ago
- Convolutional REpresenations for Music Analysis☆12Jul 5, 2016Updated 9 years ago
- Transcription of drum sequences☆11Jul 6, 2015Updated 10 years ago
- This is an extension of kaldi speech recognition software which allows to perform decoding of speech with hybrid word and phoneme graphs.…☆11Feb 4, 2020Updated 6 years ago
- Score Normalization for NIST 2019 Speaker Recognition Evaluation☆10Nov 8, 2019Updated 6 years ago
- Perform the forced decoding with target transcription☆11Sep 12, 2018Updated 7 years ago
- A proof-of-concept app using KeenASR SDK on Android. WE ARE HIRING: https://keenresearch.com/careers.html☆28Feb 3, 2026Updated last month
- Voice Activity Detection system (Matlab-based implementation)☆108May 9, 2017Updated 8 years ago
- Free noise reduction of speech signals☆12Jul 26, 2016Updated 9 years ago
- The Additive Margin MobileNet1D is a new light weight deep learning model for Speaker Recognition which is based on the MobileNetV2 archi…☆30Oct 3, 2023Updated 2 years ago
- Text normalization scripts from IRISA lab☆14Jun 1, 2018Updated 7 years ago