Simple automatic speech recognition system based on digits corpora (Polish language), created in Kaldi toolkit. Despite of the language difference, this is an effect of 'Kaldi for dummies' tutorial published in kaldi-help discussion group. No audio data - this is just an example.
☆11May 29, 2016Updated 9 years ago
Alternatives and similar repositories for kaldifordummies
Users that are interested in kaldifordummies are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Deep understanding and modelling of the hierarchical structure of prosody☆24May 12, 2019Updated 6 years ago
- Neural Network Semantic Parser for Almond☆15Apr 11, 2019Updated 7 years ago
- Code for paper titled "Using generative modelling to produce varied intonation for speech synthesis" submitted to the Speech Synthesis Wo…☆24Dec 8, 2019Updated 6 years ago
- A step-by-step problem set for implementing a high-quality deep dependency parser in Pytorch☆15Aug 12, 2017Updated 8 years ago
- Deep Learning For Ultrasound Tongue Imaging☆13Dec 17, 2024Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ctc_beamsearch☆18Oct 26, 2016Updated 9 years ago
- MATLAB model of the auditory periphery☆17Nov 28, 2011Updated 14 years ago
- Code for TALLIP2019 paper "µ-Forcing: Training Variational Recurrent Autoencoders for Text Generation"☆12May 27, 2019Updated 6 years ago
- A series of Jupyter notebooks on signal processing☆53Dec 16, 2018Updated 7 years ago
- A PyTorch implementation of DeepSpeech and DeepSpeech2.☆50Dec 4, 2018Updated 7 years ago
- Neural Language Models as Psycholinguistic Subjects: Representations of Syntactic State☆17Mar 4, 2019Updated 7 years ago
- PREDICTING TONGUE MOTION IN UNLABELED ULTRASOUND VIDEOS USING CONVOLUTIONAL LSTM NEURAL NETWORKS☆19Oct 29, 2018Updated 7 years ago
- Java API for the online speech recognition services provided by phon.ioc.ee☆18Jun 4, 2021Updated 4 years ago
- Fast and differentiable hidden Markov model in C++☆19Jan 20, 2023Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- MTracker is a tool for automatic splining tongue shapes in ultrasound images by harnessing the power of deep convolutional neural network…☆20Feb 12, 2021Updated 5 years ago
- Data processing tools for preparing speech and labels for training TTS voices☆29Aug 13, 2020Updated 5 years ago
- ABX discrimination task in python☆45Oct 7, 2024Updated last year
- An Ellipsis-aware Chinese Dependency Treebank for Web Text☆26May 14, 2018Updated 7 years ago
- MATLAB functions for training and evaluating HMMs and GMMs.☆22Jan 6, 2010Updated 16 years ago
- Assessing syntactic abilities of BERT☆40Jul 18, 2019Updated 6 years ago
- Recognizing a speaker using Deep Learning☆11Dec 25, 2017Updated 8 years ago
- Courses Project I have done in Syracuse University☆10Jul 9, 2014Updated 11 years ago
- A public dataset containing chord/beat annotation from a music game named 'osu!'.☆11Oct 17, 2017Updated 8 years ago
- Deploy open-source AI quickly and easily - Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- ASR course at Chula 2018☆65Jun 15, 2018Updated 7 years ago
- Train a LSTM neural networks on Vox Forge public audio data set to recognize speaker's gender☆13Mar 26, 2026Updated 3 weeks ago
- Text-Independent Speaker Recognition Using Gaussian Mixture Models☆12Jul 1, 2015Updated 10 years ago
- TTS model based on Transformer.☆58Aug 2, 2019Updated 6 years ago
- Reproducing code for Learning Disentangled Representations of Timbre and Pitch for Musical Instrument Sounds Using Gaussian Mixture Varia…☆29May 20, 2020Updated 5 years ago
- Representations of language in a model of visually grounded speech signal.☆23Apr 19, 2018Updated 8 years ago
- using opencv play Lyto Different Color☆10Apr 28, 2020Updated 5 years ago
- COVID-19 FAQ chatbot in python along with user interfce☆10Feb 2, 2024Updated 2 years ago
- Use it to convert a whole directory from Python 2 to Python 3, including IPython Notebooks☆10Nov 23, 2015Updated 10 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Morphological analysis and generation of Amharic, Oromo, and Tigrinya☆11Feb 18, 2017Updated 9 years ago
- A neural language model that estimates incremental processing complexity☆40Oct 27, 2021Updated 4 years ago
- Official Tensorflow implementation of the paper "Y-Autoencoders: disentangling latent representations via sequential-encoding", Pattern R…☆50Oct 1, 2020Updated 5 years ago
- Util code, issues, discussions☆29Aug 31, 2018Updated 7 years ago
- ☆14Dec 7, 2018Updated 7 years ago
- Code for replicating the work in "Targeted Syntactic Evaluation of Language Models." EMNLP 2018.☆43Apr 25, 2020Updated 5 years ago
- Mastodon server running for the Doubanius Tertius project☆10Apr 4, 2022Updated 4 years ago