AKBoles/Deep-Learning-Speech-Recognition

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/AKBoles/Deep-Learning-Speech-Recognition)

AKBoles / Deep-Learning-Speech-Recognition

Project to learn about speech recognition - both Speaker Diarization and other Speech Recognition applications.

☆50

Alternatives and similar repositories for Deep-Learning-Speech-Recognition

Users that are interested in Deep-Learning-Speech-Recognition are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

adamcsvarga / speaker-clustering
View on GitHub
Unsupervised Speaker Clustering & Speaker Recognition
☆13Jan 7, 2019Updated 7 years ago
netankit / AudioMLProject3
View on GitHub
Emotion recognition of Speaker's Speech Data. Employ speaker detection classifiers for emotion recognition, a multiclass classification p…
☆16Jun 28, 2015Updated 11 years ago
AI-Guru / SincNet
View on GitHub
Keras implementation of SincNet (https://github.com/mravanelli/SincNet, https://arxiv.org/abs/1808.00158)
☆12Aug 5, 2018Updated 7 years ago
aalto-speech / speaker-diarization
View on GitHub
Speaker diarization scripts, based on AaltoASR
☆191Jan 3, 2019Updated 7 years ago
prmelehan / Speaker-Recognition
View on GitHub
Recognizing a speaker using Deep Learning
☆11Dec 25, 2017Updated 8 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
i3thuan5 / hts_engine_python
View on GitHub
python wrap for hts engine
☆14Jan 30, 2018Updated 8 years ago
kusha / voiceid
View on GitHub
Speaker recognition/identification system in Python. Python3 port.
☆14May 2, 2015Updated 11 years ago
meyersbs / SPLAT
View on GitHub
Speech Processing & Linguistic Analysis Tool
☆11Jun 30, 2019Updated 7 years ago
hjkwon0609 / speech_separation
View on GitHub
☆12Jun 13, 2017Updated 9 years ago
MU94W / Tacotron
View on GitHub
TACOTRON: TOWARDS END-TO-END SPEECH SYNTHESIS
☆16Sep 26, 2017Updated 8 years ago
joaoantoniocn / AM-MobileNet1D
View on GitHub
The Additive Margin MobileNet1D is a new light weight deep learning model for Speaker Recognition which is based on the MobileNetV2 archi…
☆31Oct 3, 2023Updated 2 years ago
shikhar-srivastava / Recommender-System
View on GitHub
SQL-based Recommendation System for multi-topic recommendations
☆12Jun 1, 2021Updated 5 years ago
danijel3 / SparrowhawkTest
View on GitHub
A simple tutorial on setting up Sparrowhawk - a text-to-speech normalization engine
☆14Oct 16, 2017Updated 8 years ago
sorenchiron / Awesome-Speech-Enhancement
View on GitHub
A collection of trending speech enhancement papers
☆11Dec 4, 2020Updated 5 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
orchidas / Speaker-Recognition
View on GitHub
Automatic Speaker Recognition algorithms in Python
☆96Sep 25, 2021Updated 4 years ago
igormq / speech2text
View on GitHub
☆12Feb 9, 2021Updated 5 years ago
scelesticsiva / speaker_recognition_GMM_UBM
View on GitHub
A speaker recognition system which uses GMM-UBM for use in an Android application which helps in monitoring patients suffering from Schiz…
☆55Jun 13, 2018Updated 8 years ago
XapaJIaMnu / gLM
View on GitHub
A GPU language model, based on btree backed tries.
☆30Mar 6, 2018Updated 8 years ago
hbredin / TristouNet
View on GitHub
TristouNet: Triplet Loss for Speaker Turn Embedding
☆121Jul 6, 2017Updated 9 years ago
patyork / AutomaticSpeechChunker
View on GitHub
From a large speech audio file and its corresponding body of text, automatically chunk the audio and text into (phrase, audio_snippet) pa…
☆17May 15, 2015Updated 11 years ago
idiap / kaldi-ivector
View on GitHub
Extension to Kaldi implementing the standard i-vector hyperparameter estimation and i-vector extraction procedure
☆88Feb 23, 2018Updated 8 years ago
alikaratana / SpeakerRecognition
View on GitHub
Text-Dependent Speaker Recognition System with Machine Learning Techniques
☆10Dec 31, 2017Updated 8 years ago
johnkorn / speaker_recognition
View on GitHub
Speaker recognition and verification with deep learning
☆13Mar 7, 2017Updated 9 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
homink / kaldi-asr.forced_decoding
View on GitHub
Perform the forced decoding with target transcription
☆11Sep 12, 2018Updated 7 years ago
yuhaozhang / nnjm-global
View on GitHub
A python implementation of the neural network joint language model and an extension of it using global source context.
☆11May 17, 2017Updated 9 years ago
qqueing / SR_with_kaldi
View on GitHub
Speaker embedding(verification and recognition) using Tensorflow with Kaldi
☆41Sep 18, 2017Updated 8 years ago
gullabi / STT-align
View on GitHub
Coqui STT (🐸STT) based forced alignment tool
☆13Feb 24, 2022Updated 4 years ago
FragJage / SpeakerVoiceIdentifier
View on GitHub
SpeakerVoiceIdentifier can recognize the voice of a speaker by learning.
☆35Feb 20, 2017Updated 9 years ago
vishalshar / SpeakerDiarization_RNN_CNN_LSTM
View on GitHub
Speaker Diarization is the problem of separating speakers in an audio. There could be any number of speakers and final result should stat…
☆64Jan 8, 2021Updated 5 years ago
domerin0 / rnn-speech
View on GitHub
Character level speech recognizer using ctc loss with deep rnns in TensorFlow.
☆78Jun 9, 2018Updated 8 years ago
AKBoles / Voice-Identification
View on GitHub
Project to explore Speaker and Voice Identification. To follow will be further Speech Recognition tasks.
☆52Apr 29, 2019Updated 7 years ago
barthez / speaker-recognition-nn
View on GitHub
Speaker Recognition application using fast-forward NN
☆16Jun 14, 2012Updated 14 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
jacquelineCelia / lexicon_discovery
View on GitHub
Source code for "Unsupervised Lexicon Discovery from Acoustic Input ", Lee et al, 2015 TACL
☆10Aug 11, 2016Updated 9 years ago
deezer / interpretable_nn_attribution
View on GitHub
Source code from our RecSys 2020 paper: "Making neural network interpretable with attribution: application to implicit signals prediction…
☆14Oct 2, 2020Updated 5 years ago
BackupGGCode / voiceid
View on GitHub
Speaker recognition/identification system in Python
☆76Sep 20, 2018Updated 7 years ago
simonhughes22 / PythonNlpResearch
View on GitHub
☆14Dec 7, 2022Updated 3 years ago
adelsalehali1982 / Best-MNIST-Classification-ever-seen-Without-any-difficult-tricks
View on GitHub
Classification of MNIST digits by convolutional neural networks and then extracting features. After that I tune the to classes labels …
☆14Dec 31, 2016Updated 9 years ago
joaoantoniocn / AM-SincNet
View on GitHub
The Additive Margin SincNet (AM-SincNet) is a new approach for speaker recognition problems which is based in the neural network architec…
☆46Oct 3, 2023Updated 2 years ago
genzen2103 / Speaker-Recognition-System-using-GMM
View on GitHub
System for identifying speaker from given speech signal using MFCC,LPC features and Gaussian Mixture Models
☆21Nov 5, 2017Updated 8 years ago