JoergFranke/phoneme_recognition

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/JoergFranke/phoneme_recognition)

JoergFranke / phoneme_recognition

Phoneme Recognition using RecNet

☆97

Alternatives and similar repositories for phoneme_recognition

Users that are interested in phoneme_recognition are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

tbornt / phoneme_ctc
View on GitHub
Bidirectional dynamic RNN + CTC for phoneme recognition
☆47Jun 24, 2020Updated 6 years ago
AppleHolic / PytorchSR
View on GitHub
Pytorch based phoneme recognition (TIMIT phoneme classification)
☆35Apr 25, 2018Updated 8 years ago
fengxin-bupt / Application-of-Word2vec-in-Phoneme-Recognition
View on GitHub
Build an attention-based model for speech recogntion.Use the Word2vec model to help to train the attention model.
☆30Dec 18, 2019Updated 6 years ago
JoergFranke / recnet
View on GitHub
RecNet - Recurrent Neural Network Framework
☆73Apr 7, 2017Updated 9 years ago
gtiwari333 / speech-recognition-java-hidden-markov-model-vq-mfcc
View on GitHub
Automatically exported from code.google.com/p/speech-recognition-java-hidden-markov-model-vq-mfcc
☆12Jun 7, 2024Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
anjandeepsahni / speech_phoneme_prediction
View on GitHub
Phoneme prediction from speech mel-spectrograms using RNN.
☆15Jun 4, 2019Updated 7 years ago
HanSeokhyeon / Deep_learning_for_Phoneme_recognition
View on GitHub
다양한 feature와 deep learning을 이용한 Phoneme Recognition입니다.
☆13Nov 27, 2019Updated 6 years ago
OrcusCZ / NNAcousticModeling
View on GitHub
☆24Sep 25, 2018Updated 7 years ago
lucasondel / multilingual-bottleneck-features
View on GitHub
BUT Multilingual Bottleneck Features
☆15Mar 22, 2019Updated 7 years ago
twidddj / vqvae
View on GitHub
Tensorflow implementation of VQVAE for voice conversion
☆12Apr 3, 2018Updated 8 years ago
swshon / dialectID_siam
View on GitHub
Dialect identification using Siamese network
☆15Dec 12, 2017Updated 8 years ago
ASR-project / Multilingual-PR
View on GitHub
Phoneme Recognition using pre-trained models Wav2vec2, HuBERT and WavLM. Throughout this project, we compared specifically three differen…
☆266May 9, 2022Updated 4 years ago
npuichigo / extract_features_using_world
View on GitHub
using world vocoder to extract features and make data for training neural networks
☆11Oct 9, 2017Updated 8 years ago
r9y9 / MelGeneralizedCepstrums.jl
View on GitHub
Mel-Generalized Cepstrum analysis
☆19Jul 21, 2017Updated 9 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
Faur / TIMIT
View on GitHub
Framewise phoneme classification on the TIMIT dataset using neural networks
☆19Jul 14, 2016Updated 10 years ago
hirofumi0810 / asr_preprocessing
View on GitHub
Python implementation of pre-processing for End-to-End speech recognition
☆70Feb 19, 2018Updated 8 years ago
LEEYOONHYUNG / GraphTTS
View on GitHub
☆12Jul 6, 2023Updated 3 years ago
bajibabu / postfilt_gan
View on GitHub
This is an implementation of "Generative adversarial network-based postfilter for statistical parametric speech synthesis"
☆16Jun 27, 2018Updated 8 years ago
genzen2103 / Emotion-Detection-in-speech-using-Acoustic-and-Neural-Features
View on GitHub
System for Emotion Detection in given speech data using joint modelling of hand crafted prosody rich features , MFCC features and LSTM ba…
☆10Nov 15, 2017Updated 8 years ago
tobiasfshr / gmm-ubm-speaker-identification-verification
View on GitHub
Implementation of a speaker identification and a speaker verification system based on Gaussian Mixture Models (GMM) in combination with a…
☆21Mar 1, 2018Updated 8 years ago
persephone-tools / persephone
View on GitHub
A tool for automatic phoneme transcription
☆159Apr 18, 2023Updated 3 years ago
wenet-e2e / WeTextProcessing.deprecated
View on GitHub
☆61Jan 31, 2023Updated 3 years ago
sasanasadiabadi / speech_animation
View on GitHub
☆24May 23, 2018Updated 8 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
chenllliang / CTDNN
View on GitHub
MMM 2021: Crossed-Time Delay Neural Network for Speaker Recognition
☆11Dec 4, 2021Updated 4 years ago
m-toman / SALB
View on GitHub
C++ inference engine for HMM-based speech synthesis, deployed to early mobile devices
☆40Jun 9, 2026Updated last month
Diamondfan / CTC_pytorch
View on GitHub
CTC end -to-end ASR for timit and 863 corpus.
☆219Dec 20, 2019Updated 6 years ago
yogihbti / ccfdHMM
View on GitHub
Credit Card Fraud Detection using HMM ( Hidden Markow Model)
☆12Nov 2, 2017Updated 8 years ago
wassname / phoneme2grapheme
View on GitHub
Teaching machines to spell with deep learning (acc=>80%) e.g. a model hears "pɹˈaʊd˺ɚ" and writes "prowder" (but it should be "prouder")
☆19Jun 1, 2017Updated 9 years ago
felixkreuk / UnsupSeg
View on GitHub
Self-Supervised Contrastive Learning for Unsupervised Phoneme Segmentation (INTERSPEECH 2020)
☆146Aug 5, 2022Updated 3 years ago
TimovNiedek / timit_tf
View on GitHub
Code for phonetically classifying TIMIT using TensorFlow
☆17Jul 1, 2016Updated 10 years ago
Suhee05 / Zerospeech2019
View on GitHub
VQVAE for Unsupervised Voice Conversion
☆21Apr 25, 2019Updated 7 years ago
xinjli / allosaurus
View on GitHub
Allosaurus is a pretrained universal phone recognizer for more than 2000 languages
☆737Apr 26, 2024Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
YtJin-git / awesome-CZSL-papers
View on GitHub
Awesome Compositional Zero-shot Learning papers.
☆14Aug 26, 2025Updated 11 months ago
Tsung-Ping / Chord-Jazzification
View on GitHub
A dataset for chord coloring and voicing
☆20Nov 2, 2020Updated 5 years ago
ajinkyakulkarni14 / ERISHA
View on GitHub
ERISHA is a mulitilingual multispeaker expressive speech synthesis framework. It can transfer the expressivity to the speaker's voice for…
☆44Dec 17, 2020Updated 5 years ago
ivanvovk / compressed-tacotron2-pytorch
View on GitHub
Compressed version of Tacotron 2 using Tensor Train + Waveglow.
☆22Dec 26, 2019Updated 6 years ago
JasonSWFu / JD-NMF
View on GitHub
Joint Dictionary Learning-based Non-Negative Matrix Factorization for Voice Conversion (TBME 2016)
☆22Oct 14, 2017Updated 8 years ago
fantasy-fish / Generalized-Speech-Animation
View on GitHub
USC CS621 Course Project
☆26Apr 22, 2023Updated 3 years ago
mrdrozdov / pytorch-machines
View on GitHub
Stochastic Machines for Unsupervised Learning implemented in Pytorch.
☆10Sep 3, 2017Updated 8 years ago