The Main Aim of this project is to segment and cluster an audio sample based on speaker when number of speakers are not known before hand. Main challenge in the process of speaker recognition is separting audio based on speaker.It can enhance the readability of an automatic speech transcription by structuring the audio stream into speaker turns …
☆25Jan 25, 2022Updated 4 years ago
Alternatives and similar repositories for Automatic-speech-sequence-segmentation
Users that are interested in Automatic-speech-sequence-segmentation are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Text-Dependent Speaker Recognition System with Machine Learning Techniques☆10Dec 31, 2017Updated 8 years ago
- Real-time speech enhancement based on spectral subtraction☆16Feb 18, 2018Updated 8 years ago
- A MATLAB implementation of “Multiple Sound Source Counting and Localization Based on TF-Wise Spatial Spectrum Clustering” [TASLP 2019]☆11Oct 23, 2023Updated 2 years ago
- Perform the forced decoding with target transcription☆11Sep 12, 2018Updated 7 years ago
- DNN-based speech enhancement using Tensorflow by Haoyu Li (Tokyo univ.)☆17Aug 31, 2017Updated 8 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- MG-WFBP: Merging Gradients Wisely for Efficient Communication in Distributed Deep Learning☆12Apr 26, 2021Updated 4 years ago
- dual-mic noise reduction based on coherence function☆53Dec 10, 2019Updated 6 years ago
- python wrap for hts engine☆14Jan 30, 2018Updated 8 years ago
- Free noise reduction of speech signals☆12Jul 26, 2016Updated 9 years ago
- An implementation of frequency-invariant beamformer☆15Sep 3, 2021Updated 4 years ago
- Beamforming based binaural speech enhancement as a real time JUCE plugin☆28Apr 29, 2018Updated 7 years ago
- ☆11May 18, 2013Updated 12 years ago
- wake word spotting with kaldi☆19Dec 3, 2020Updated 5 years ago
- Python version of http://www.ee.columbia.edu/ln/rosa/matlab/gammatonegram/☆15Oct 15, 2018Updated 7 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- ☆11May 4, 2020Updated 5 years ago
- Examples of tests using selenium with cucumber jvm☆16Sep 30, 2014Updated 11 years ago
- To Implement the Generalized Side Lobe Canceller with Fixed Beamformer,parallel blocking matrix and adaptive interference canceller achie…☆29Oct 15, 2019Updated 6 years ago
- Real-Time Independent Vector Analysis☆16Jul 4, 2022Updated 3 years ago
- Learnable Gammatone Filterbank (LGTFB) and Equal-loudness Normalization (EN)☆12Apr 24, 2020Updated 5 years ago
- A model for Blind Source Separation using Dictionary Learning☆14Sep 30, 2019Updated 6 years ago
- In order to demonstrate any signal accurately it is important to know the noise containt in the signal. Thus, a fundamental measure is th…☆13May 10, 2021Updated 4 years ago
- Speaker Diarization library in Python. Performs VAD, Segmentation, Linear Clustering, Hierarchical Clustering☆15Jul 28, 2017Updated 8 years ago
- Phonetic and phonological vocoding platform☆17Nov 23, 2016Updated 9 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Sequence Segmentation using Joint RNN and Structured Prediction Models (ICASSP 2017)☆17Feb 25, 2017Updated 9 years ago
- Long audio alignment using Kaldi☆23Apr 22, 2021Updated 4 years ago
- Speech Analysis and Synthesis Toolkit for Python(2.X, 3.X).☆16Aug 27, 2019Updated 6 years ago
- parfda WMT'14 Datasets☆17Apr 25, 2019Updated 6 years ago
- Matlab toolbox for making audio denoising using several NMF techniques☆28Mar 28, 2014Updated 11 years ago
- Speaker Identification using GMM and Speech Recognition using HMMs☆38Apr 7, 2014Updated 11 years ago
- Software for psychoacoustics experiments☆25Oct 26, 2024Updated last year
- c++ Kaldi IO lib (static and dynamic).☆25Nov 26, 2018Updated 7 years ago
- tensorflow implementation of CondenseNet: An Efficient DenseNet using Learned Group Convolutions☆29Feb 1, 2018Updated 8 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- This project gives an example of dual microphone speech enhancement based on GSC beamformer and multiple channel postfilter.☆104Aug 22, 2018Updated 7 years ago
- 水下目标方位估计算法,包括:基于常规波束形成的时间窗方法以及基于卷积神经网络的时间窗方法☆21Jul 8, 2025Updated 8 months ago
- A Common Lisp framework for the creation of electronic art, visual design, game prototyping, game making, computer graphics, exploration …☆12Nov 25, 2019Updated 6 years ago
- A blind source separation package using non-negative matrix factorization and non-negative ICA☆17May 31, 2021Updated 4 years ago
- Dereverberation of Speech Signals Using Weighted Prediction Error☆23May 17, 2019Updated 6 years ago
- ⇨ The Speaker Recognition System consists of two phases, Feature Extraction and Recognition. ⇨ In the Extraction phase, the Speaker's vo…☆39Jan 13, 2020Updated 6 years ago
- ☆45Dec 5, 2019Updated 6 years ago