Phoneme prediction from speech mel-spectrograms using RNN.
☆15Jun 4, 2019Updated 6 years ago
Alternatives and similar repositories for speech_phoneme_prediction
Users that are interested in speech_phoneme_prediction are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A Python library to accentuate Russian text☆11Dec 19, 2024Updated last year
- Pytorch based phoneme recognition (TIMIT phoneme classification)☆35Apr 25, 2018Updated 7 years ago
- A port of an HMM / speech recognition C library to Android☆12Sep 23, 2016Updated 9 years ago
- Reproduction of the paper SFSRNet: Super-resolution for single-channel Audio Source Separation by me (@arda-num) and @dritx16. Navigate P…☆11Jul 7, 2022Updated 3 years ago
- A pipeline from Dataset Gathering,Data annotations, Model training,Model Evaluation for viseme (visual sound phoneme) classification☆14Jan 19, 2021Updated 5 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- deadsimple immersive navigation: a single-player-verse component☆15Mar 11, 2026Updated 2 weeks ago
- This is the repository for my version of Kaldi for Dummies example.☆17Nov 18, 2018Updated 7 years ago
- A-Frame multi-user Croquet component☆12Aug 23, 2024Updated last year
- ☆20Feb 27, 2018Updated 8 years ago
- Phoneme Recognition using RecNet☆97Nov 22, 2016Updated 9 years ago
- Normalize Text in Russian☆28Nov 7, 2023Updated 2 years ago
- iOS app written in swift. Records audio, plays back recorded audio using various sound effects.☆18Oct 21, 2018Updated 7 years ago
- Pure JS fast phonemizer with rule-based G2P prediction☆24Mar 4, 2026Updated 3 weeks ago
- Modularized version of the Pink Trombone voice synthesizer☆12May 5, 2019Updated 6 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- torch version of LPCNet☆22Jul 8, 2020Updated 5 years ago
- A pretrained model for "A Phoneme-informed Neural Network Model for Note-level Singing Transcription", ICASSP 2023☆38Sep 9, 2023Updated 2 years ago
- No buffers, no delay, no machine learning. Just instant polyphonic pitch detection☆17Nov 30, 2025Updated 3 months ago
- ☆24May 23, 2018Updated 7 years ago
- An easy way to fine-tune Wav2Vec 2.0 for low-resource languages.☆80May 20, 2023Updated 2 years ago
- NU-Wave 2: A General Neural Audio Upsampling Model for Various Sampling Rates [WIP]☆25Jul 5, 2022Updated 3 years ago
- Work in progress Meta Quest Pro face and eye tracking utilities☆17Sep 5, 2023Updated 2 years ago
- A flexible framework for running experiments with PyTorch models in a simulated Federated Learning (FL) environment.☆15Aug 11, 2023Updated 2 years ago
- Deep Learning model for lexical stress detection in spoken English☆29Mar 17, 2020Updated 6 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Repsetter - your new favorite workout diary☆16Jun 25, 2023Updated 2 years ago
- USC CS621 Course Project☆26Apr 22, 2023Updated 2 years ago
- SC-CNN: Effective Speaker Conditioning Method for Zero-Shot Multi-Speaker Text-to-Speech Systems☆39Nov 1, 2023Updated 2 years ago
- The source code and pre-trained model of the paper "On the Preparation and Validation of a Large-scale Dataset"☆63Mar 5, 2026Updated 3 weeks ago
- Generate a UUID on all Django requests for traceability☆15Jul 31, 2018Updated 7 years ago
- Autovocoder: Fast Waveform Generation from a Learned Speech Representation using Differentiable Digital Signal Processing☆71Dec 2, 2022Updated 3 years ago
- Connect Unity Android builds to Frame glasses☆19Aug 25, 2024Updated last year
- A three-dimensional vocal tract acoustic model using the finite-difference time-domain (FDTD) numerical scheme.☆17Sep 25, 2022Updated 3 years ago
- A simple application of DTW Algorithm in isolate word speech recognition.☆17Mar 9, 2020Updated 6 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- text to speech☆10Mar 19, 2024Updated 2 years ago
- Any-to-one voice conversion using the data augment strategy: pitch shifted and duration remained.☆33Jan 10, 2022Updated 4 years ago
- A Pytorch version of LPCNet, including dump weight☆36May 5, 2022Updated 3 years ago
- Personalization with deep learning in 100 lines of code☆15Mar 31, 2023Updated 2 years ago
- unofficial pytorch implementation of HiFi-GAN with fast MISR.☆15Mar 21, 2023Updated 3 years ago
- ☆65Jun 26, 2025Updated 9 months ago
- Network specification and demo☆35Jun 5, 2017Updated 8 years ago