kaldi-asr / kaldiLinks
kaldi-asr/kaldi is the official location of the Kaldi project.
☆15,292Updated 3 months ago
Alternatives and similar repositories for kaldi
Users that are interested in kaldi are comparing it to the libraries listed below
Sorting:
- Facebook AI Research's Automatic Speech Recognition Toolkit☆6,444Updated last month
- A small speech recognizer☆4,250Updated 2 weeks ago
- End-to-End Speech Processing Toolkit☆9,667Updated 2 weeks ago
- pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch,…☆2,392Updated 3 years ago
- Open-Source Large Vocabulary Continuous Speech Recognition Engine☆1,922Updated 6 months ago
- Speech-to-Text-WaveNet : End-to-end sentence level English speech recognition based on DeepMind's WaveNet and tensorflow☆4,000Updated 4 years ago
- Automatic Speech Recognition (ASR), Speaker Verification, Speech Synthesis, Text-to-Speech (TTS), Language Modelling, Singing Voice Synth…☆3,104Updated 2 years ago
- DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Ras…☆26,695Updated 6 months ago
- Speech Recognition using DeepSpeech2.☆2,136Updated 3 years ago
- 🎙Speech recognition using the tensorflow deep learning framework, sequence-to-sequence neural networks☆2,176Updated last year
- End-to-end Automatic Speech Recognition for Madarian and English in Tensorflow☆2,842Updated 2 years ago
- A PyTorch-based Speech Toolkit☆10,996Updated this week
- TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, French, Korean, Chinese, Germa…☆3,990Updated last year
- A Python wrapper for Kaldi☆1,032Updated last month
- Python interface to the WebRTC Voice Activity Detector☆2,417Updated last year
- On-device wake word detection powered by deep learning☆4,572Updated this week
- Production First and Production Ready End-to-End Speech Recognition Toolkit☆4,972Updated 2 weeks ago
- Real-time full-duplex speech recognition server, based on the Kaldi toolkit and the GStreamer framwork.☆1,090Updated last year
- Attempt at tracking states of the arts and recent results (bibliography) on speech recognition.☆1,870Updated 3 years ago
- Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text fronten…☆12,469Updated 2 months ago
- Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)☆10,090Updated 2 years ago
- This is the library for the Unbounded Interleaved-State Recurrent Neural Network (UIS-RNN) algorithm, corresponding to the paper Fully Su…☆1,588Updated last year
- Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker…☆8,895Updated 3 weeks ago
- Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node☆13,971Updated 3 weeks ago
- This library provides common speech features for ASR including MFCCs and filterbank energies.☆2,423Updated 4 years ago
- A TensorFlow implementation of Google's Tacotron speech synthesis with pre-trained model (unofficial)☆2,990Updated 2 years ago
- A Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统☆8,330Updated 3 months ago
- 🐸STT - The deep learning toolkit for Speech-to-Text. Training and deploying STT models has never been so easy.☆2,546Updated last year
- A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Auto…☆16,395Updated this week
- PyTorch implementation of convolutional neural networks-based text-to-speech synthesis models☆1,981Updated 2 years ago