Phoneme prediction from speech mel-spectrograms using RNN.
☆15Jun 4, 2019Updated 6 years ago
Alternatives and similar repositories for speech_phoneme_prediction
Users that are interested in speech_phoneme_prediction are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A Python library to accentuate Russian text☆12Dec 19, 2024Updated last year
- Pytorch based phoneme recognition (TIMIT phoneme classification)☆35Apr 25, 2018Updated 8 years ago
- Reproduction of the paper SFSRNet: Super-resolution for single-channel Audio Source Separation by me (@arda-num) and @dritx16. Navigate P…☆11Jul 7, 2022Updated 3 years ago
- deadsimple immersive navigation: a single-player-verse component☆15Mar 11, 2026Updated 2 months ago
- A-Frame multi-user Croquet component☆12Aug 23, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆20Feb 27, 2018Updated 8 years ago
- Phoneme Recognition using RecNet☆97Nov 22, 2016Updated 9 years ago
- Normalize Text in Russian☆29Nov 7, 2023Updated 2 years ago
- Pure JS fast phonemizer with rule-based G2P prediction☆27Updated this week
- Modularized version of the Pink Trombone voice synthesizer☆12May 5, 2019Updated 7 years ago
- Speech command recognition DenseNet transfer learning from UrbanSound8k in keras tensorflow☆17Jan 19, 2018Updated 8 years ago
- A pretrained model for "A Phoneme-informed Neural Network Model for Note-level Singing Transcription", ICASSP 2023☆38Sep 9, 2023Updated 2 years ago
- torch version of LPCNet☆22Jul 8, 2020Updated 5 years ago
- ☆24May 23, 2018Updated 8 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- An easy way to fine-tune Wav2Vec 2.0 for low-resource languages.☆80May 20, 2023Updated 3 years ago
- NU-Wave 2: A General Neural Audio Upsampling Model for Various Sampling Rates [WIP]☆25Jul 5, 2022Updated 3 years ago
- ☆23Sep 14, 2021Updated 4 years ago
- Work in progress Meta Quest Pro face and eye tracking utilities☆17Sep 5, 2023Updated 2 years ago
- Deep Learning model for lexical stress detection in spoken English☆28Mar 17, 2020Updated 6 years ago
- Repsetter - your new favorite workout diary☆16Jun 25, 2023Updated 2 years ago
- USC CS621 Course Project☆26Apr 22, 2023Updated 3 years ago
- The source code and pre-trained model of the paper "On the Preparation and Validation of a Large-scale Dataset"☆67Mar 5, 2026Updated 2 months ago
- SC-CNN: Effective Speaker Conditioning Method for Zero-Shot Multi-Speaker Text-to-Speech Systems☆39Nov 1, 2023Updated 2 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Vanilla single-precision radix-2 FFT for the ESP32.☆18Apr 16, 2025Updated last year
- Generate a UUID on all Django requests for traceability☆14Jul 31, 2018Updated 7 years ago
- Experiment with "one-shot learning" techniques to recognize a voice signature☆24Mar 29, 2020Updated 6 years ago
- Autovocoder: Fast Waveform Generation from a Learned Speech Representation using Differentiable Digital Signal Processing☆71Dec 2, 2022Updated 3 years ago
- Connect Unity Android builds to Frame glasses☆20Aug 25, 2024Updated last year
- A simple application of DTW Algorithm in isolate word speech recognition.☆17Mar 9, 2020Updated 6 years ago
- text to speech☆10Mar 19, 2024Updated 2 years ago
- Any-to-one voice conversion using the data augment strategy: pitch shifted and duration remained.☆34Jan 10, 2022Updated 4 years ago
- A Pytorch version of LPCNet, including dump weight☆36May 5, 2022Updated 4 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆65Jun 26, 2025Updated 11 months ago
- unofficial pytorch implementation of HiFi-GAN with fast MISR.☆15Mar 21, 2023Updated 3 years ago
- Acoustic-prosodic entrainment measurement in spoken dialogue and approximation of the evolution of a speaker’s a/p features.☆13Feb 26, 2024Updated 2 years ago
- Network specification and demo☆35Jun 5, 2017Updated 8 years ago
- Source code for 'Transfer Learning for Speech Recognition on a Budget' published at ACL 2017☆46May 30, 2017Updated 9 years ago
- A Svelte template slightly modified for use alongside django-svelte☆12Feb 8, 2024Updated 2 years ago
- Includes sample datasets for machine learning☆10Apr 1, 2017Updated 9 years ago