theblackcat102 / edgedictLinks
Working online speech recognition based on RNN Transducer. ( Trained model release available in release )
☆293Updated 4 years ago
Alternatives and similar repositories for edgedict
Users that are interested in edgedict are comparing it to the libraries listed below
Sorting:
- End-to-end speech recognition using RNN Transducers in Tensorflow 2.0☆248Updated 5 months ago
- Towards hot directions in industrial end to end speech recognition☆330Updated 4 years ago
- ☆262Updated 3 years ago
- DeepSpeech based forced alignment tool☆239Updated 5 years ago
- Few-shot Keyword Spotting in Any Language and Multilingual Spoken Word Corpus☆180Updated last year
- Segment an audio file and obtain utterance alignments. (Python package)☆343Updated last year
- VCTK multi-speaker tacotron for ICASSP 2020☆266Updated 3 years ago
- A list of publically available audio data that anyone can download for ASR or other speech activities☆232Updated 4 years ago
- Python server for communicating with Kaldi from the browser using WebRTC☆69Updated 2 years ago
- INTERSPEECH 2019 Tutorial Materials☆194Updated 4 years ago
- A CRF-based ASR Toolkit☆359Updated last month
- Large, modern dataset for speech recognition☆709Updated last year
- Tools for Speech Enhancement integrated with Kaldi☆427Updated 2 years ago
- ☆276Updated 4 years ago
- Voice Activity Detection (VAD) using deep learning.☆201Updated 6 years ago
- Open tools and data for cloudless automatic speech recognition☆447Updated 4 years ago
- Moved to https://github.com/k2-fsa/icefall☆146Updated 3 years ago
- CUDA-Warp RNN-Transducer☆216Updated 2 years ago
- PyTorch implementation of LF-MMI for End-to-end ASR☆220Updated 4 years ago
- A pure python module for reading and writing kaldi ark files☆267Updated 9 months ago
- a lightweight speech processing toolkit based on Pytorch and (Py)Kaldi☆344Updated 4 years ago
- Wav2Keyword is keyword spotting(KWS) based on Wav2Vec 2.0. This model shows state-of-the-art in Speech commands dataset V1 and V2.☆109Updated 2 years ago
- The SpeechBrain project aims to build a novel speech toolkit fully based on PyTorch. With SpeechBrain users can easily create speech proc…☆373Updated 6 months ago
- A better, faster, stronger version of the unbounded interleaved-state recurrent neural network (UIS-RNN)☆62Updated 5 years ago
- CSS10: A Collection of Single Speaker Speech Datasets for 10 Languages☆478Updated 5 years ago
- UniSpeech - Large Scale Self-Supervised Learning for Speech☆473Updated last year
- Helsinki Prosody Corpus and A System for Predicting Prosodic Prominence from Text☆246Updated 6 years ago
- This repository is a collection of TTS Models in TFLite☆201Updated 4 years ago
- Some simple wrappers around kaldi-asr intended to make using kaldi's (online) decoders as convenient as possible.☆170Updated 4 years ago
- End-to-end trained speech recognition system, based on RNNs and the connectionist temporal classification (CTC) cost function.☆123Updated 5 years ago