witko0/kaldifordummies

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/witko0/kaldifordummies)

witko0 / kaldifordummies

Simple automatic speech recognition system based on digits corpora (Polish language), created in Kaldi toolkit. Despite of the language difference, this is an effect of 'Kaldi for dummies' tutorial published in kaldi-help discussion group. No audio data - this is just an example.

☆11

Alternatives and similar repositories for kaldifordummies

Users that are interested in kaldifordummies are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

stanford-oval / genie-parser
View on GitHub
Neural Network Semantic Parser for Almond
☆15Apr 11, 2019Updated 7 years ago
colaudiolab / DeepLearning4UTI
View on GitHub
Deep Learning For Ultrasound Tongue Imaging
☆13Dec 17, 2024Updated last year
rguthrie3 / DeepDependencyParsingProblemSet
View on GitHub
A step-by-step problem set for implementing a high-quality deep dependency parser in Pytorch
☆15Aug 12, 2017Updated 8 years ago
Saizheng / ctc_beamsearch
View on GitHub
ctc_beamsearch
☆18Oct 26, 2016Updated 9 years ago
shuiliwanwu / ConvLstm-ultrasound-videos
View on GitHub
PREDICTING TONGUE MOTION IN UNLABELED ULTRASOUND VIDEOS USING CONVOLUTIONAL LSTM NEURAL NETWORKS
☆19Oct 29, 2018Updated 7 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
gerazov / prosodeep
View on GitHub
Deep understanding and modelling of the hierarchical structure of prosody
☆25May 12, 2019Updated 7 years ago
dayihengliu / Mu-Forcing-VRAE
View on GitHub
Code for TALLIP2019 paper "µ-Forcing: Training Variational Recurrent Autoencoders for Text Generation"
☆12May 27, 2019Updated 7 years ago
langprocgroup / nn_syntactic_state
View on GitHub
Neural Language Models as Psycholinguistic Subjects: Representations of Syntactic State
☆17Mar 4, 2019Updated 7 years ago
jfbercher / LecturesSignalProcessing
View on GitHub
A series of Jupyter notebooks on signal processing
☆53Dec 16, 2018Updated 7 years ago
MyrtleSoftware / deepspeech
View on GitHub
A PyTorch implementation of DeepSpeech and DeepSpeech2.
☆50Dec 4, 2018Updated 7 years ago
ZackHodari / average_prosody
View on GitHub
Code for paper titled "Using generative modelling to produce varied intonation for speech synthesis" submitted to the Speech Synthesis Wo…
☆24Dec 8, 2019Updated 6 years ago
ZackHodari / tts_data_tools
View on GitHub
Data processing tools for preparing speech and labels for training TTS voices
☆29Aug 13, 2020Updated 5 years ago
tom-pelsmaeker / deep-generative-lm
View on GitHub
Code accompanying the paper "Effective Estimation of Deep Generative Language Models".
☆24May 1, 2020Updated 6 years ago
dmg-illc / uid-dialogue
View on GitHub
A repository for the EMNLP 2021 paper "Is Information Density Uniform in Task-Oriented Dialogues?" and for the CoNLL 2021 paper "Analysin…
☆10Jun 17, 2024Updated 2 years ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
tddpirate / pyguile
View on GitHub
Invoke Python libraries from Guile
☆13Sep 5, 2016Updated 9 years ago
huggingface / bert-syntax
View on GitHub
Assessing syntactic abilities of BERT
☆40Jul 18, 2019Updated 7 years ago
embatbr / graduation-project
View on GitHub
Text-Independent Speaker Recognition Using Gaussian Mixture Models
☆12Jul 1, 2015Updated 11 years ago
prmelehan / Speaker-Recognition
View on GitHub
Recognizing a speaker using Deep Learning
☆11Dec 25, 2017Updated 8 years ago
Ider / SU-Courses
View on GitHub
Courses Project I have done in Syracuse University
☆10Jul 9, 2014Updated 12 years ago
bootphon / ABXpy
View on GitHub
ABX discrimination task in python
☆45Oct 7, 2024Updated last year
moliqingcha / Deformable-U-Net
View on GitHub
2D U-Net using deformable convolution
☆28Dec 12, 2020Updated 5 years ago
JinScientist / voice-gender-recognition
View on GitHub
Train a LSTM neural networks on Vox Forge public audio data set to recognize speaker's gender
☆13Mar 26, 2026Updated 4 months ago
OuYangMinOa / Lyto-Different-Color
View on GitHub
using opencv play Lyto Different Color
☆10Apr 28, 2020Updated 6 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
gchrupala / visually-grounded-speech
View on GitHub
Representations of language in a model of visually grounded speech signal.
☆23Apr 19, 2018Updated 8 years ago
sarang0909 / faq_chatbot
View on GitHub
COVID-19 FAQ chatbot in python along with user interfce
☆10Feb 2, 2024Updated 2 years ago
Kaljurand / net-speech-api
View on GitHub
Java API for the online speech recognition services provided by phon.ioc.ee
☆18Jun 4, 2021Updated 5 years ago
mpsilfve / phonembedding
View on GitHub
☆14Dec 7, 2018Updated 7 years ago
mpatacchiola / Y-AE
View on GitHub
Official Tensorflow implementation of the paper "Y-Autoencoders: disentangling latent representations via sequential-encoding", Pattern R…
☆50Oct 1, 2020Updated 5 years ago
vansky / neural-complexity
View on GitHub
A neural language model that estimates incremental processing complexity
☆40Oct 27, 2021Updated 4 years ago
alumae / voxlingua107_sb
View on GitHub
VoxLingua107 recipe for SpeechBrain
☆13Jul 3, 2021Updated 5 years ago
doubanius / mastodon
View on GitHub
Mastodon server running for the Doubanius Tertius project
☆10Apr 4, 2022Updated 4 years ago
yanggeng1995 / vae_tacotron
View on GitHub
☆51Feb 15, 2019Updated 7 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
BeckyMarvin / LM_syneval
View on GitHub
Code for replicating the work in "Targeted Syntactic Evaluation of Language Models." EMNLP 2018.
☆44Apr 25, 2020Updated 6 years ago
archiki / ASR-Accent-Analysis
View on GitHub
Analysis and investigating the confounding effect of accents in end-to-end Automatic Speech Recognition models.
☆15Jun 27, 2020Updated 6 years ago
MathieuRita / Lazimpa
View on GitHub
Code for the paper LazImpa: Lazy and Impatient neural agents learn to communicate efficiently. Mathieu Rita, Rahma Chaabouni and Emmanuel…
☆17Nov 21, 2020Updated 5 years ago
jayelm / emergent-generalization
View on GitHub
Emergent Communication of Generalizations, NeurIPS 2021
☆13Sep 29, 2021Updated 4 years ago
heshenghuan / python-KNN
View on GitHub
python implementation of K nearest neighbors algorithm and kd-tree
☆13Oct 24, 2016Updated 9 years ago
espnet / icassp2020-tts
View on GitHub
ESPnet-TTS Audio Sample HP
☆21Oct 25, 2019Updated 6 years ago
30stomercury / hmm-backprop
View on GitHub
Fast and differentiable hidden Markov model in C++
☆19Jan 20, 2023Updated 3 years ago