anjandeepsahni/speech_phoneme_prediction

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/anjandeepsahni/speech_phoneme_prediction)

anjandeepsahni / speech_phoneme_prediction

Phoneme prediction from speech mel-spectrograms using RNN.

☆15

Alternatives and similar repositories for speech_phoneme_prediction

Users that are interested in speech_phoneme_prediction are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

lichard49 / HTK-Android
View on GitHub
A port of an HMM / speech recognition C library to Android
☆12Sep 23, 2016Updated 9 years ago
AppleHolic / PytorchSR
View on GitHub
Pytorch based phoneme recognition (TIMIT phoneme classification)
☆35Apr 25, 2018Updated 8 years ago
Magicboomliu / Viseme-Classification
View on GitHub
A pipeline from Dataset Gathering,Data annotations, Model training,Model Evaluation for viseme (visual sound phoneme) classification
☆15Jan 19, 2021Updated 5 years ago
arda-num / SFSRNet
View on GitHub
Reproduction of the paper SFSRNet: Super-resolution for single-channel Audio Source Separation by me (@arda-num) and @dritx16. Navigate P…
☆12Jul 7, 2022Updated 4 years ago
bushki / ios-voice-changer-libre
View on GitHub
iOS app written in swift. Records audio, plays back recorded audio using various sound effects.
☆18Oct 21, 2018Updated 7 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
JoergFranke / phoneme_recognition
View on GitHub
Phoneme Recognition using RecNet
☆97Nov 22, 2016Updated 9 years ago
shitian-ni / speech-recognition-transfer-learning
View on GitHub
Speech command recognition DenseNet transfer learning from UrbanSound8k in keras tensorflow
☆17Jan 19, 2018Updated 8 years ago
jemmec / metaface-utilities
View on GitHub
Work in progress Meta Quest Pro face and eye tracking utilities
☆17Sep 5, 2023Updated 2 years ago
chdh / pink-trombone-mod
View on GitHub
Modularized version of the Pink Trombone voice synthesizer
☆13May 5, 2019Updated 7 years ago
seyong92 / phoneme-informed-note-level-singing-transcription
View on GitHub
A pretrained model for "A Phoneme-informed Neural Network Model for Note-level Singing Transcription", ICASSP 2023
☆38Sep 9, 2023Updated 2 years ago
shakingWaves / LPCNet_torch
View on GitHub
torch version of LPCNet
☆22Jul 8, 2020Updated 6 years ago
sasanasadiabadi / speech_animation
View on GitHub
☆24May 23, 2018Updated 8 years ago
rishikksh20 / NU-Wave2-pytorch
View on GitHub
NU-Wave 2: A General Neural Audio Upsampling Model for Various Sampling Rates [WIP]
☆25Jul 5, 2022Updated 4 years ago
avi33 / StyleMelGan-Unofficial
View on GitHub
☆23Sep 14, 2021Updated 4 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
bekirbakar / replay-attack-detection
View on GitHub
Deep learning-based audio spoofing attack detection experiments for speaker verification.
☆14Apr 20, 2023Updated 3 years ago
LexicalStressDetection / lexical-stress-detection
View on GitHub
Deep Learning model for lexical stress detection in spoken English
☆28Mar 17, 2020Updated 6 years ago
microsoft / fl-simulation
View on GitHub
A flexible framework for running experiments with PyTorch models in a simulated Federated Learning (FL) environment.
☆15Aug 11, 2023Updated 2 years ago
zakaton / repsetter
View on GitHub
Repsetter - your new favorite workout diary
☆16Jun 25, 2023Updated 3 years ago
markusstrasser / pitchplease
View on GitHub
No buffers, no delay, no machine learning. Just instant polyphonic pitch detection
☆21Nov 30, 2025Updated 7 months ago
york135 / singing_transcription_ICASSP2021
View on GitHub
The source code and pre-trained model of the paper "On the Preparation and Validation of a Large-scale Dataset"
☆68Mar 5, 2026Updated 4 months ago
s-soltys / LipSync
View on GitHub
Lip animation app for 3D face models.
☆26Jun 17, 2026Updated 3 weeks ago
bluefireteam / mini_sprite
View on GitHub
A simple sprite format for building 1bit styled graphics.
☆16Oct 3, 2025Updated 9 months ago
hcy71o / SC-CNN
View on GitHub
SC-CNN: Effective Speaker Conditioning Method for Zero-Shot Multi-Speaker Text-to-Speech Systems
☆39Nov 1, 2023Updated 2 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
revsys / django-tracer
View on GitHub
Generate a UUID on all Django requests for traceability
☆14Jul 31, 2018Updated 7 years ago
zdmc23 / oneshot-audio
View on GitHub
Experiment with "one-shot learning" techniques to recognize a voice signature
☆24Mar 29, 2020Updated 6 years ago
rosinality / melgan-pytorch
View on GitHub
MelGAN and Tacotron 2 in PyTorch
☆11Oct 22, 2019Updated 6 years ago
hcy71o / AutoVocoder
View on GitHub
Autovocoder: Fast Waveform Generation from a Learned Speech Representation using Differentiable Digital Signal Processing
☆71Dec 2, 2022Updated 3 years ago
Debasishray19 / fdtd-vocaltract-model
View on GitHub
A three-dimensional vocal tract acoustic model using the finite-difference time-domain (FDTD) numerical scheme.
☆18Sep 25, 2022Updated 3 years ago
aishoot / DTWSpeech
View on GitHub
A simple application of DTW Algorithm in isolate word speech recognition.
☆17Mar 9, 2020Updated 6 years ago
Shahabks / Machine-Learning-Algorithm-for-Voice-Analysis
View on GitHub
It is an algorithm analysed the acoustic features of a voice and creates an acoustic classifier - USEFUL for auto-speech-rater
☆11Mar 8, 2019Updated 7 years ago
shengcanxu / canoSpeech
View on GitHub
text to speech
☆10Mar 19, 2024Updated 2 years ago
ZiangLong / LPCNet_pytorch
View on GitHub
A Pytorch version of LPCNet, including dump weight
☆36May 5, 2022Updated 4 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
JeremyCCHsu / vc-vawgan
View on GitHub
Network specification and demo
☆35Jun 5, 2017Updated 9 years ago
jhuang448 / LyricsAlignment-MTL
View on GitHub
☆67Jun 26, 2025Updated last year
zedix / prose-editor-element
View on GitHub
Prose Editor is a web component wrapping TipTap 2.
☆10Apr 7, 2024Updated 2 years ago
joansj / pytorch-intro
View on GitHub
A couple of scripts to illustrate how to do CNNs and RNNs in PyTorch
☆37May 23, 2017Updated 9 years ago
y-chan / hifi-gan-misrnet
View on GitHub
unofficial pytorch implementation of HiFi-GAN with fast MISR.
☆15Mar 21, 2023Updated 3 years ago
morkertis / One-Shot-Face-Recognition
View on GitHub
One-Shot Face Recognition Using Siamese Neural Networks
☆14Mar 15, 2020Updated 6 years ago
inverse-ai / FINALLY-Speech-Enhancement
View on GitHub
FINALLY: Fast and universal speech enhancement model delivering studio-quality audio for a wide range of recordings.
☆28Apr 1, 2026Updated 3 months ago