Phoneme prediction from speech mel-spectrograms using RNN.
☆15Jun 4, 2019Updated 7 years ago
Alternatives and similar repositories for speech_phoneme_prediction
Users that are interested in speech_phoneme_prediction are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A Python library to accentuate Russian text☆12Dec 19, 2024Updated last year
- Pytorch based phoneme recognition (TIMIT phoneme classification)☆35Apr 25, 2018Updated 8 years ago
- A port of an HMM / speech recognition C library to Android☆12Sep 23, 2016Updated 9 years ago
- Automatically exported from code.google.com/p/speech-recognition-java-hidden-markov-model-vq-mfcc☆12Jun 7, 2024Updated 2 years ago
- Reproduction of the paper SFSRNet: Super-resolution for single-channel Audio Source Separation by me (@arda-num) and @dritx16. Navigate P…☆12Jul 7, 2022Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- A pipeline from Dataset Gathering,Data annotations, Model training,Model Evaluation for viseme (visual sound phoneme) classification☆15Jan 19, 2021Updated 5 years ago
- This is the repository for my version of Kaldi for Dummies example.☆17Nov 18, 2018Updated 7 years ago
- deadsimple immersive navigation: a single-player-verse component☆15Mar 11, 2026Updated 3 months ago
- ☆20Feb 27, 2018Updated 8 years ago
- sms-tools workspace☆14Dec 7, 2014Updated 11 years ago
- Phoneme Recognition using RecNet☆97Nov 22, 2016Updated 9 years ago
- Fast Russian Text normalization for TTS using only RegEx.☆30Updated this week
- iOS app written in swift. Records audio, plays back recorded audio using various sound effects.☆18Oct 21, 2018Updated 7 years ago
- Pure JS fast phonemizer with rule-based G2P prediction☆27Updated this week
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Speech command recognition DenseNet transfer learning from UrbanSound8k in keras tensorflow☆17Jan 19, 2018Updated 8 years ago
- torch version of LPCNet☆22Jul 8, 2020Updated 5 years ago
- Lazy extractors based build of youtube-dl☆24Feb 2, 2022Updated 4 years ago
- Face Recognition using Faster R-CNN☆21May 11, 2019Updated 7 years ago
- An easy way to fine-tune Wav2Vec 2.0 for low-resource languages.☆80May 20, 2023Updated 3 years ago
- NU-Wave 2: A General Neural Audio Upsampling Model for Various Sampling Rates [WIP]☆25Jul 5, 2022Updated 3 years ago
- ☆23Sep 14, 2021Updated 4 years ago
- A flexible framework for running experiments with PyTorch models in a simulated Federated Learning (FL) environment.☆15Aug 11, 2023Updated 2 years ago
- Deep Learning model for lexical stress detection in spoken English☆28Mar 17, 2020Updated 6 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Repsetter - your new favorite workout diary☆16Jun 25, 2023Updated 2 years ago
- The source code and pre-trained model of the paper "On the Preparation and Validation of a Large-scale Dataset"☆68Mar 5, 2026Updated 3 months ago
- SC-CNN: Effective Speaker Conditioning Method for Zero-Shot Multi-Speaker Text-to-Speech Systems☆39Nov 1, 2023Updated 2 years ago
- Vanilla single-precision radix-2 FFT for the ESP32.☆18Apr 16, 2025Updated last year
- Generate a UUID on all Django requests for traceability☆14Jul 31, 2018Updated 7 years ago
- Experiment with "one-shot learning" techniques to recognize a voice signature☆24Mar 29, 2020Updated 6 years ago
- Autovocoder: Fast Waveform Generation from a Learned Speech Representation using Differentiable Digital Signal Processing☆71Dec 2, 2022Updated 3 years ago
- A simple application of DTW Algorithm in isolate word speech recognition.☆17Mar 9, 2020Updated 6 years ago
- text to speech☆10Mar 19, 2024Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Any-to-one voice conversion using the data augment strategy: pitch shifted and duration remained.☆34Jan 10, 2022Updated 4 years ago
- A Pytorch version of LPCNet, including dump weight☆36May 5, 2022Updated 4 years ago
- ☆65Jun 26, 2025Updated 11 months ago
- Prose Editor is a web component wrapping TipTap 2.☆10Apr 7, 2024Updated 2 years ago
- unofficial pytorch implementation of HiFi-GAN with fast MISR.☆15Mar 21, 2023Updated 3 years ago
- Acoustic-prosodic entrainment measurement in spoken dialogue and approximation of the evolution of a speaker’s a/p features.☆13Feb 26, 2024Updated 2 years ago
- Source code for 'Transfer Learning for Speech Recognition on a Budget' published at ACL 2017☆46May 30, 2017Updated 9 years ago