Phoneme prediction from speech mel-spectrograms using RNN.
☆15Jun 4, 2019Updated 6 years ago
Alternatives and similar repositories for speech_phoneme_prediction
Users that are interested in speech_phoneme_prediction are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A Python library to accentuate Russian text☆12Dec 19, 2024Updated last year
- Pytorch based phoneme recognition (TIMIT phoneme classification)☆35Apr 25, 2018Updated 8 years ago
- A port of an HMM / speech recognition C library to Android☆12Sep 23, 2016Updated 9 years ago
- Automatically exported from code.google.com/p/speech-recognition-java-hidden-markov-model-vq-mfcc☆12Jun 7, 2024Updated last year
- Reproduction of the paper SFSRNet: Super-resolution for single-channel Audio Source Separation by me (@arda-num) and @dritx16. Navigate P…☆11Jul 7, 2022Updated 3 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- This is the repository for my version of Kaldi for Dummies example.☆17Nov 18, 2018Updated 7 years ago
- deadsimple immersive navigation: a single-player-verse component☆15Mar 11, 2026Updated last month
- A-Frame multi-user Croquet component☆12Aug 23, 2024Updated last year
- ☆20Feb 27, 2018Updated 8 years ago
- sms-tools workspace☆14Dec 7, 2014Updated 11 years ago
- Phoneme Recognition using RecNet☆97Nov 22, 2016Updated 9 years ago
- Normalize Text in Russian☆29Nov 7, 2023Updated 2 years ago
- Pure JS fast phonemizer with rule-based G2P prediction☆27Apr 10, 2026Updated 3 weeks ago
- Modularized version of the Pink Trombone voice synthesizer☆12May 5, 2019Updated 7 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Speech command recognition DenseNet transfer learning from UrbanSound8k in keras tensorflow☆17Jan 19, 2018Updated 8 years ago
- A pretrained model for "A Phoneme-informed Neural Network Model for Note-level Singing Transcription", ICASSP 2023☆38Sep 9, 2023Updated 2 years ago
- torch version of LPCNet☆22Jul 8, 2020Updated 5 years ago
- Lazy extractors based build of youtube-dl☆24Feb 2, 2022Updated 4 years ago
- An easy way to fine-tune Wav2Vec 2.0 for low-resource languages.☆80May 20, 2023Updated 2 years ago
- NU-Wave 2: A General Neural Audio Upsampling Model for Various Sampling Rates [WIP]☆25Jul 5, 2022Updated 3 years ago
- ☆23Sep 14, 2021Updated 4 years ago
- Work in progress Meta Quest Pro face and eye tracking utilities☆17Sep 5, 2023Updated 2 years ago
- A flexible framework for running experiments with PyTorch models in a simulated Federated Learning (FL) environment.☆15Aug 11, 2023Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Deep Learning model for lexical stress detection in spoken English☆28Mar 17, 2020Updated 6 years ago
- Repsetter - your new favorite workout diary☆16Jun 25, 2023Updated 2 years ago
- No buffers, no delay, no machine learning. Just instant polyphonic pitch detection☆20Nov 30, 2025Updated 5 months ago
- The source code and pre-trained model of the paper "On the Preparation and Validation of a Large-scale Dataset"☆67Mar 5, 2026Updated 2 months ago
- SC-CNN: Effective Speaker Conditioning Method for Zero-Shot Multi-Speaker Text-to-Speech Systems☆39Nov 1, 2023Updated 2 years ago
- Vanilla single-precision radix-2 FFT for the ESP32.☆18Apr 16, 2025Updated last year
- Generate a UUID on all Django requests for traceability☆14Jul 31, 2018Updated 7 years ago
- Experiment with "one-shot learning" techniques to recognize a voice signature☆24Mar 29, 2020Updated 6 years ago
- Autovocoder: Fast Waveform Generation from a Learned Speech Representation using Differentiable Digital Signal Processing☆71Dec 2, 2022Updated 3 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Connect Unity Android builds to Frame glasses☆19Aug 25, 2024Updated last year
- A simple application of DTW Algorithm in isolate word speech recognition.☆17Mar 9, 2020Updated 6 years ago
- text to speech☆10Mar 19, 2024Updated 2 years ago
- Any-to-one voice conversion using the data augment strategy: pitch shifted and duration remained.☆34Jan 10, 2022Updated 4 years ago
- A Pytorch version of LPCNet, including dump weight☆36May 5, 2022Updated 4 years ago
- Personalization with deep learning in 100 lines of code☆15Mar 31, 2023Updated 3 years ago
- ☆65Jun 26, 2025Updated 10 months ago