Phoneme Recognition using RecNet
☆97Nov 22, 2016Updated 9 years ago
Alternatives and similar repositories for phoneme_recognition
Users that are interested in phoneme_recognition are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Pytorch based phoneme recognition (TIMIT phoneme classification)☆35Apr 25, 2018Updated 8 years ago
- Bidirectional dynamic RNN + CTC for phoneme recognition☆47Jun 24, 2020Updated 5 years ago
- Build an attention-based model for speech recogntion.Use the Word2vec model to help to train the attention model.☆30Dec 18, 2019Updated 6 years ago
- RecNet - Recurrent Neural Network Framework☆73Apr 7, 2017Updated 9 years ago
- Automatically exported from code.google.com/p/speech-recognition-java-hidden-markov-model-vq-mfcc☆12Jun 7, 2024Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Phoneme prediction from speech mel-spectrograms using RNN.☆15Jun 4, 2019Updated 7 years ago
- 다양한 feature와 deep learning을 이용한 Phoneme Recognition입니다.☆13Nov 27, 2019Updated 6 years ago
- ☆24Sep 25, 2018Updated 7 years ago
- BUT Multilingual Bottleneck Features☆15Mar 22, 2019Updated 7 years ago
- Tensorflow implementation of VQVAE for voice conversion☆12Apr 3, 2018Updated 8 years ago
- Dialect identification using Siamese network☆15Dec 12, 2017Updated 8 years ago
- Phoneme Recognition using pre-trained models Wav2vec2, HuBERT and WavLM. Throughout this project, we compared specifically three differen…☆263May 9, 2022Updated 4 years ago
- using world vocoder to extract features and make data for training neural networks☆11Oct 9, 2017Updated 8 years ago
- Mel-Generalized Cepstrum analysis☆19Jul 21, 2017Updated 8 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Python implementation of pre-processing for End-to-End speech recognition☆70Feb 19, 2018Updated 8 years ago
- ☆12Jul 6, 2023Updated 2 years ago
- This is an implementation of "Generative adversarial network-based postfilter for statistical parametric speech synthesis"☆16Jun 27, 2018Updated 7 years ago
- System for Emotion Detection in given speech data using joint modelling of hand crafted prosody rich features , MFCC features and LSTM ba…☆10Nov 15, 2017Updated 8 years ago
- Implementation of a speaker identification and a speaker verification system based on Gaussian Mixture Models (GMM) in combination with a…☆21Mar 1, 2018Updated 8 years ago
- A tool for automatic phoneme transcription☆159Apr 18, 2023Updated 3 years ago
- ☆24May 23, 2018Updated 8 years ago
- ☆61Jan 31, 2023Updated 3 years ago
- Self-Supervised Contrastive Learning for Unsupervised Phoneme Segmentation (INTERSPEECH 2020)☆146Aug 5, 2022Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- CTC end -to-end ASR for timit and 863 corpus.☆219Dec 20, 2019Updated 6 years ago
- Credit Card Fraud Detection using HMM ( Hidden Markow Model)☆12Nov 2, 2017Updated 8 years ago
- A Text2Speech Engine built in Pytorch.☆12Dec 9, 2018Updated 7 years ago
- Code for phonetically classifying TIMIT using TensorFlow☆17Jul 1, 2016Updated 9 years ago
- VQVAE for Unsupervised Voice Conversion☆21Apr 25, 2019Updated 7 years ago
- Stochastic Machines for Unsupervised Learning implemented in Pytorch.☆10Sep 3, 2017Updated 8 years ago
- A dataset for chord coloring and voicing☆20Nov 2, 2020Updated 5 years ago
- Allosaurus is a pretrained universal phone recognizer for more than 2000 languages☆731Apr 26, 2024Updated 2 years ago
- Voice conversion (VC) investigation using three variants of VAE☆59Oct 28, 2019Updated 6 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ERISHA is a mulitilingual multispeaker expressive speech synthesis framework. It can transfer the expressivity to the speaker's voice for…☆44Dec 17, 2020Updated 5 years ago
- Compressed version of Tacotron 2 using Tensor Train + Waveglow.☆22Dec 26, 2019Updated 6 years ago
- Awesome Compositional Zero-shot Learning papers.☆14Aug 26, 2025Updated 9 months ago
- C++ inference engine for HMM-based speech synthesis, deployed to early mobile devices☆40Jun 9, 2026Updated last week
- USC CS621 Course Project☆26Apr 22, 2023Updated 3 years ago
- MMM 2021: Crossed-Time Delay Neural Network for Speaker Recognition☆11Dec 4, 2021Updated 4 years ago
- Multimodal speech recognition using lipreading (with CNNs) and audio (using LSTMs). Sensor fusion is done with an attention network.☆69Nov 19, 2022Updated 3 years ago