Voice Activity Detection: In this first assignment, we will create a dataset that simulates speech in every-day scenarios. We train a classifier on this dataset for distinguishing voiced from non-voiced sections, a task called voice activity detection, VAD for short. This, of course, requires a ground truth in terms of VAD annotations.
☆18May 3, 2015Updated 10 years ago
Alternatives and similar repositories for AudioMLProject1
Users that are interested in AudioMLProject1 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- simple-minded audio classifier in python (using MFCC and GMM)☆84Mar 15, 2023Updated 3 years ago
- voice active detection (python ver/simple and easy-to-use)☆12May 1, 2017Updated 8 years ago
- python script for voice activity detection.☆36Aug 16, 2024Updated last year
- Mel-Generalized Cepstrum analysis☆19Jul 21, 2017Updated 8 years ago
- an Audio-Visual Voice Activity Detection using Deep Learning☆52Apr 7, 2019Updated 7 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Python wrapper for Kaldi decoders (Kaldi https://sourceforge.net/projects/kaldi/)☆80Dec 13, 2015Updated 10 years ago
- Recurrent neural network for audio noise reduction☆12Aug 18, 2022Updated 3 years ago
- Unsupervised speech activity detection system.☆11Jul 2, 2018Updated 7 years ago
- Filtering and Noise Adding Tool☆29May 27, 2022Updated 3 years ago
- Universal Deep neural network based speech enhancement demo and tools, well pre-trained DNN model☆67Feb 23, 2023Updated 3 years ago
- A Convolutional Neural Network based Voice Activity Detector for Smartphones☆70Apr 30, 2019Updated 6 years ago
- 🦁 Nala is an agile open-source voice assistant framework (20+ actions).☆36Aug 8, 2023Updated 2 years ago
- Cochlear.ai submission for dcase2018 task2☆15Sep 14, 2018Updated 7 years ago
- Speaker diarization with GMM-UBM and MAP Adaptation☆31Sep 13, 2018Updated 7 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- The implementation of 'Watch, Listen, Attend and Spell’ (WLAS) network that learns to transcribe videos of mouth motion to character on p…☆11Mar 23, 2018Updated 8 years ago
- Simple DNN based Voice Activity Detection (VAD) using Pytorch☆42Feb 8, 2020Updated 6 years ago
- MNSS (Music Noise Segmentation on a Spectrogram) is a deep-neural network based preprocessing technique that pre-filters unnecessary nois…☆11Dec 14, 2015Updated 10 years ago
- Program for audio-to-audio and audio-to-midi alignment☆18Aug 14, 2009Updated 16 years ago
- a music segmentation algorithm that I proposed and implemented as my undergraduate project. The basic function is: a song is loaded to th…☆16Apr 19, 2013Updated 13 years ago
- Various algorithms for voice activity detection☆22Jan 31, 2017Updated 9 years ago
- Develop speaker recognition model based on i-vector using TIMIT database☆16Jul 4, 2019Updated 6 years ago
- Tools for speech processing, keyword spotting☆16Mar 11, 2020Updated 6 years ago
- noise robust voice activity detection with noise tracker for ios/android☆18Dec 28, 2015Updated 10 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Code for the paper 'Weighting Finite State Transductions with Neural Context', Pushpendre Rastogi, Ryan Cotterell, Jason Eisner☆29May 11, 2019Updated 6 years ago
- Implementing speaker recognition using Python (GMM-UBM)☆29Apr 20, 2018Updated 8 years ago
- Surrey CVSSP DCASE 2018 Task 2 system☆20Dec 26, 2022Updated 3 years ago
- speaker recognition using keras☆36Nov 29, 2022Updated 3 years ago
- RawNet: Fast End-to-End Neural Vocoder☆42May 29, 2019Updated 6 years ago
- Octave port of the Fast Image Source Model by Eric A. Lehmann. Used for room acoustic modeling and impulse response simulation.☆12Aug 2, 2017Updated 8 years ago
- ☆106Mar 12, 2021Updated 5 years ago
- ☆26Dec 3, 2018Updated 7 years ago
- Feedforward Sequential Memory Networks☆17Aug 2, 2022Updated 3 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Online decoder for Kaldi NNET2 and GMM speech recognition models with Python bindings.☆49Jun 12, 2017Updated 8 years ago
- wake word spotting with kaldi☆19Dec 3, 2020Updated 5 years ago
- Script to simulate room impulse responses☆16Sep 29, 2016Updated 9 years ago
- Analytic signal-based source information analysis for YANGstraight and real-time interactive tools☆34Aug 20, 2019Updated 6 years ago
- ☆12Sep 2, 2016Updated 9 years ago
- DNN-based speech enhancement using Tensorflow by Haoyu Li (Tokyo univ.)☆17Aug 31, 2017Updated 8 years ago
- My personal solutions to some textbook problems☆11Feb 12, 2020Updated 6 years ago