Simple automatic speech recognition system based on digits corpora (Polish language), created in Kaldi toolkit. Despite of the language difference, this is an effect of 'Kaldi for dummies' tutorial published in kaldi-help discussion group. No audio data - this is just an example.
☆11May 29, 2016Updated 10 years ago
Alternatives and similar repositories for kaldifordummies
Users that are interested in kaldifordummies are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Deep understanding and modelling of the hierarchical structure of prosody☆25May 12, 2019Updated 7 years ago
- Neural Network Semantic Parser for Almond☆15Apr 11, 2019Updated 7 years ago
- Code for paper titled "Using generative modelling to produce varied intonation for speech synthesis" submitted to the Speech Synthesis Wo…☆24Dec 8, 2019Updated 6 years ago
- A step-by-step problem set for implementing a high-quality deep dependency parser in Pytorch☆15Aug 12, 2017Updated 8 years ago
- Deep Learning For Ultrasound Tongue Imaging☆13Dec 17, 2024Updated last year
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- ctc_beamsearch☆18Oct 26, 2016Updated 9 years ago
- MATLAB model of the auditory periphery☆17Nov 28, 2011Updated 14 years ago
- Java Speech Toolkit☆12Apr 25, 2021Updated 5 years ago
- A series of Jupyter notebooks on signal processing☆53Dec 16, 2018Updated 7 years ago
- PREDICTING TONGUE MOTION IN UNLABELED ULTRASOUND VIDEOS USING CONVOLUTIONAL LSTM NEURAL NETWORKS☆19Oct 29, 2018Updated 7 years ago
- Java API for the online speech recognition services provided by phon.ioc.ee☆18Jun 4, 2021Updated 4 years ago
- MTracker is a tool for automatic splining tongue shapes in ultrasound images by harnessing the power of deep convolutional neural network…☆20Feb 12, 2021Updated 5 years ago
- Data processing tools for preparing speech and labels for training TTS voices☆29Aug 13, 2020Updated 5 years ago
- ABX discrimination task in python☆45Oct 7, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- An Ellipsis-aware Chinese Dependency Treebank for Web Text☆26May 14, 2018Updated 8 years ago
- MATLAB functions for training and evaluating HMMs and GMMs.☆22Jan 6, 2010Updated 16 years ago
- A repository for the EMNLP 2021 paper "Is Information Density Uniform in Task-Oriented Dialogues?" and for the CoNLL 2021 paper "Analysin…☆10Jun 17, 2024Updated last year
- ☆51Feb 15, 2019Updated 7 years ago
- Assessing syntactic abilities of BERT☆40Jul 18, 2019Updated 6 years ago
- Recognizing a speaker using Deep Learning☆11Dec 25, 2017Updated 8 years ago
- Waffer-thin FlaskGPT on Vercel.☆12Jun 1, 2023Updated 2 years ago
- 2D U-Net using deformable convolution☆28Dec 12, 2020Updated 5 years ago
- ASR course at Chula 2018☆65Jun 15, 2018Updated 7 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Train a LSTM neural networks on Vox Forge public audio data set to recognize speaker's gender☆13Mar 26, 2026Updated 2 months ago
- Text-Independent Speaker Recognition Using Gaussian Mixture Models☆12Jul 1, 2015Updated 10 years ago
- TTS model based on Transformer.☆58Aug 2, 2019Updated 6 years ago
- Reproducing code for Learning Disentangled Representations of Timbre and Pitch for Musical Instrument Sounds Using Gaussian Mixture Varia…☆29May 20, 2020Updated 6 years ago
- Representations of language in a model of visually grounded speech signal.☆23Apr 19, 2018Updated 8 years ago
- using opencv play Lyto Different Color☆10Apr 28, 2020Updated 6 years ago
- COVID-19 FAQ chatbot in python along with user interfce☆10Feb 2, 2024Updated 2 years ago
- Use it to convert a whole directory from Python 2 to Python 3, including IPython Notebooks☆10Nov 23, 2015Updated 10 years ago
- Official Tensorflow implementation of the paper "Y-Autoencoders: disentangling latent representations via sequential-encoding", Pattern R…☆50Oct 1, 2020Updated 5 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Util code, issues, discussions☆29Aug 31, 2018Updated 7 years ago
- Mastodon server running for the Doubanius Tertius project☆10Apr 4, 2022Updated 4 years ago
- This is a github repository of the abandonware Sequitur G2P by Bisani & Ney☆174Dec 16, 2025Updated 5 months ago
- TranscriberAG open development☆39Nov 16, 2014Updated 11 years ago
- Code for replicating the work in "Targeted Syntactic Evaluation of Language Models." EMNLP 2018.☆44Apr 25, 2020Updated 6 years ago
- Analysis and investigating the confounding effect of accents in end-to-end Automatic Speech Recognition models.☆15Jun 27, 2020Updated 5 years ago
- Emergent Communication of Generalizations, NeurIPS 2021☆13Sep 29, 2021Updated 4 years ago