bscharan/Automatic-speech-sequence-segmentation

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/bscharan/Automatic-speech-sequence-segmentation)

bscharan / Automatic-speech-sequence-segmentation

The Main Aim of this project is to segment and cluster an audio sample based on speaker when number of speakers are not known before hand. Main challenge in the process of speaker recognition is separting audio based on speaker.It can enhance the readability of an automatic speech transcription by structuring the audio stream into speaker turns …

☆25

Alternatives and similar repositories for Automatic-speech-sequence-segmentation

Users that are interested in Automatic-speech-sequence-segmentation are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

alikaratana / SpeakerRecognition
View on GitHub
Text-Dependent Speaker Recognition System with Machine Learning Techniques
☆10Dec 31, 2017Updated 8 years ago
adamsolomou / Speech-Enhancement
View on GitHub
Real-time speech enhancement based on spectral subtraction
☆16Feb 18, 2018Updated 8 years ago
yongxuUSTC / DNN-SpeechEnhancement
View on GitHub
DNN-based speech enhancement using Tensorflow by Haoyu Li (Tokyo univ.)
☆17Aug 31, 2017Updated 8 years ago
homink / kaldi-asr.forced_decoding
View on GitHub
Perform the forced decoding with target transcription
☆11Sep 12, 2018Updated 7 years ago
BingYang-20 / TF-Wise-Spatial-Spectrum-Clustering
View on GitHub
A MATLAB implementation of “Multiple Sound Source Counting and Localization Based on TF-Wise Spatial Spectrum Clustering” [TASLP 2019]
☆11Oct 23, 2023Updated 2 years ago
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
AndreaCastiella / PsychoacousticParametersMeasurer
View on GitHub
This project is the translation to python of the most important parameters in the field of Psychoacoustics based on the book of Zwicker a…
☆14Jun 6, 2021Updated 5 years ago
HKBU-HPML / MG-WFBP
View on GitHub
MG-WFBP: Merging Gradients Wisely for Efficient Communication in Distributed Deep Learning
☆12Apr 26, 2021Updated 5 years ago
ydcnanhe / codes-icassp-2022
View on GitHub
Fast & analytical blind source separation algorithm to separate any number of sources using two microphones
☆12Jun 8, 2024Updated 2 years ago
i3thuan5 / hts_engine_python
View on GitHub
python wrap for hts engine
☆14Jan 30, 2018Updated 8 years ago
adamcsvarga / speaker-clustering
View on GitHub
Unsupervised Speaker Clustering & Speaker Recognition
☆13Jan 7, 2019Updated 7 years ago
875441459 / Design_DMA
View on GitHub
An implementation of frequency-invariant beamformer
☆14Sep 3, 2021Updated 4 years ago
SergMa / free-nross
View on GitHub
Free noise reduction of speech signals
☆12Jul 26, 2016Updated 10 years ago
kastur / speakerRecognition
View on GitHub
☆11May 18, 2013Updated 13 years ago
ronw / matlab_htk
View on GitHub
MATLAB functions that interface with the HTK Speech Recognition Toolkit (http://htk.eng.cam.ac.uk/) for training HMMs, GMMs and simple sp…
☆46Jan 4, 2017Updated 9 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
hkmogul / BeamformingSpeechEnhancer
View on GitHub
Beamforming based binaural speech enhancement as a real time JUCE plugin
☆28Apr 29, 2018Updated 8 years ago
hspark84 / lgtfb-en
View on GitHub
Learnable Gammatone Filterbank (LGTFB) and Equal-loudness Normalization (EN)
☆13Apr 24, 2020Updated 6 years ago
workmanjack / lyric-mood-classification
View on GitHub
UC Berkeley Masters of Information & Data Science | W266 Natural Language Processing with Deep Learning Group Project | Team: Cyprian Gas…
☆16Dec 8, 2022Updated 3 years ago
JarbasAl / kaldi_spotter
View on GitHub
wake word spotting with kaldi
☆19Dec 3, 2020Updated 5 years ago
mcusi / gammatonegram
View on GitHub
Python version of http://www.ee.columbia.edu/ln/rosa/matlab/gammatonegram/
☆15Oct 15, 2018Updated 7 years ago
taisedias / selenium-cucumber
View on GitHub
Examples of tests using selenium with cucumber jvm
☆16Sep 30, 2014Updated 11 years ago
Livefull / SphereDiar
View on GitHub
☆11May 4, 2020Updated 6 years ago
satyanamuduri / Speech-Enhancement-Using-GSC
View on GitHub
To Implement the Generalized Side Lobe Canceller with Fixed Beamformer,parallel blocking matrix and adaptive interference canceller achie…
☆29Oct 15, 2019Updated 6 years ago
FanmingL / independent-analysis
View on GitHub
Real-Time Independent Vector Analysis
☆16Jul 4, 2022Updated 4 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
jeroenvansaane / Deep-Learning-Based-Intrusion-Detection-NSL-KDD
View on GitHub
Deep Learning based Intrusion Detection on NSL-KDD Dataset
☆14Aug 24, 2019Updated 6 years ago
yujiacheng333 / Conv_TasNet
View on GitHub
Conv TaSNet follow work of KaiTuo Xu in TF-keras
☆14Oct 19, 2020Updated 5 years ago
DavideNardone / Blind-Source-Separation-using-Dictionary-Learning
View on GitHub
A model for Blind Source Separation using Dictionary Learning
☆13Sep 30, 2019Updated 6 years ago
sujit-deokar / Signal-to-Noise-ratio-SNR-
View on GitHub
In order to demonstrate any signal accurately it is important to know the noise containt in the signal. Thus, a fundamental measure is th…
☆13May 10, 2021Updated 5 years ago
gdebayan / Diarization_BIC
View on GitHub
Speaker Diarization library in Python. Performs VAD, Segmentation, Linear Clustering, Hierarchical Clustering
☆15Jul 28, 2017Updated 8 years ago
idiap / phonvoc
View on GitHub
Phonetic and phonological vocoding platform
☆17Nov 23, 2016Updated 9 years ago
adiyoss / DeepSegmentor
View on GitHub
Sequence Segmentation using Joint RNN and Structured Prediction Models (ICASSP 2017)
☆17Feb 25, 2017Updated 9 years ago
elsheikh21 / malware-analysis
View on GitHub
using Drebin dataset to distinguish between malwares and not malwares
☆13Jan 5, 2019Updated 7 years ago
niklub / NMFdenoiser
View on GitHub
Matlab toolbox for making audio denoising using several NMF techniques
☆28Mar 28, 2014Updated 12 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
shunsukeaihara / pysas
View on GitHub
Speech Analysis and Synthesis Toolkit for Python(2.X, 3.X).
☆16Aug 27, 2019Updated 6 years ago
srinivr / kaldi-long-audio-alignment
View on GitHub
Long audio alignment using Kaldi
☆23Apr 22, 2021Updated 5 years ago
sam81 / pychoacoustics
View on GitHub
Software for psychoacoustics experiments
☆24Oct 26, 2024Updated last year
ucbvislab / p2fa-vislab
View on GitHub
A script for audio/transcript alignment. Fork of p2fa.
☆69Mar 15, 2018Updated 8 years ago
markdtw / condensenet-tensorflow
View on GitHub
tensorflow implementation of CondenseNet: An Efficient DenseNet using Learned Group Convolutions
☆29Feb 1, 2018Updated 8 years ago
open-speech / kaldi-io
View on GitHub
c++ Kaldi IO lib (static and dynamic).
☆25Nov 26, 2018Updated 7 years ago
manthanthakker / speakerIdentificationNeuralNetworks
View on GitHub
⇨ The Speaker Recognition System consists of two phases, Feature Extraction and Recognition. ⇨ In the Extraction phase, the Speaker's vo…
☆39Jan 13, 2020Updated 6 years ago