Simple automatic speech recognition system based on digits corpora (Polish language), created in Kaldi toolkit. Despite of the language difference, this is an effect of 'Kaldi for dummies' tutorial published in kaldi-help discussion group. No audio data - this is just an example.
☆11May 29, 2016Updated 9 years ago
Alternatives and similar repositories for kaldifordummies
Users that are interested in kaldifordummies are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Deep understanding and modelling of the hierarchical structure of prosody☆24May 12, 2019Updated 6 years ago
- Neural Network Semantic Parser for Almond☆15Apr 11, 2019Updated 6 years ago
- Code for paper titled "Using generative modelling to produce varied intonation for speech synthesis" submitted to the Speech Synthesis Wo…☆24Dec 8, 2019Updated 6 years ago
- A step-by-step problem set for implementing a high-quality deep dependency parser in Pytorch☆15Aug 12, 2017Updated 8 years ago
- Deep Learning For Ultrasound Tongue Imaging☆12Dec 17, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- ctc_beamsearch☆18Oct 26, 2016Updated 9 years ago
- MATLAB model of the auditory periphery☆17Nov 28, 2011Updated 14 years ago
- Code for TALLIP2019 paper "µ-Forcing: Training Variational Recurrent Autoencoders for Text Generation"☆12May 27, 2019Updated 6 years ago
- Java Speech Toolkit☆12Apr 25, 2021Updated 4 years ago
- A series of Jupyter notebooks on signal processing☆53Dec 16, 2018Updated 7 years ago
- A PyTorch implementation of DeepSpeech and DeepSpeech2.☆50Dec 4, 2018Updated 7 years ago
- Neural Language Models as Psycholinguistic Subjects: Representations of Syntactic State☆17Mar 4, 2019Updated 7 years ago
- PREDICTING TONGUE MOTION IN UNLABELED ULTRASOUND VIDEOS USING CONVOLUTIONAL LSTM NEURAL NETWORKS☆19Oct 29, 2018Updated 7 years ago
- Java API for the online speech recognition services provided by phon.ioc.ee☆18Jun 4, 2021Updated 4 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Fast and differentiable hidden Markov model in C++☆19Jan 20, 2023Updated 3 years ago
- MTracker is a tool for automatic splining tongue shapes in ultrasound images by harnessing the power of deep convolutional neural network…☆20Feb 12, 2021Updated 5 years ago
- Code accompanying the paper "Effective Estimation of Deep Generative Language Models".☆25May 1, 2020Updated 5 years ago
- An Ellipsis-aware Chinese Dependency Treebank for Web Text☆26May 14, 2018Updated 7 years ago
- MATLAB functions for training and evaluating HMMs and GMMs.☆22Jan 6, 2010Updated 16 years ago
- ☆51Feb 15, 2019Updated 7 years ago
- Assessing syntactic abilities of BERT☆40Jul 18, 2019Updated 6 years ago
- Recognizing a speaker using Deep Learning☆11Dec 25, 2017Updated 8 years ago
- A public dataset containing chord/beat annotation from a music game named 'osu!'.☆11Oct 17, 2017Updated 8 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Waffer-thin FlaskGPT on Vercel.☆12Jun 1, 2023Updated 2 years ago
- Train a LSTM neural networks on Vox Forge public audio data set to recognize speaker's gender☆13Oct 27, 2017Updated 8 years ago
- Text-Independent Speaker Recognition Using Gaussian Mixture Models☆12Jul 1, 2015Updated 10 years ago
- TTS model based on Transformer.☆58Aug 2, 2019Updated 6 years ago
- Reproducing code for Learning Disentangled Representations of Timbre and Pitch for Musical Instrument Sounds Using Gaussian Mixture Varia…☆29May 20, 2020Updated 5 years ago
- Representations of language in a model of visually grounded speech signal.☆23Apr 19, 2018Updated 7 years ago
- COVID-19 FAQ chatbot in python along with user interfce☆10Feb 2, 2024Updated 2 years ago
- using opencv play Lyto Different Color☆10Apr 28, 2020Updated 5 years ago
- Unsupervised Learning for Optical Flow Estimation Using Pyramid Convolution LSTM.☆37Jul 29, 2019Updated 6 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Morphological analysis and generation of Amharic, Oromo, and Tigrinya☆11Feb 18, 2017Updated 9 years ago
- A neural language model that estimates incremental processing complexity☆39Oct 27, 2021Updated 4 years ago
- Util code, issues, discussions☆29Aug 31, 2018Updated 7 years ago
- ☆14Dec 7, 2018Updated 7 years ago
- Code for replicating the work in "Targeted Syntactic Evaluation of Language Models." EMNLP 2018.☆43Apr 25, 2020Updated 5 years ago
- Mastodon server running for the Doubanius Tertius project☆10Apr 4, 2022Updated 3 years ago
- This is a github repository of the abandonware Sequitur G2P by Bisani & Ney☆175Dec 16, 2025Updated 3 months ago