pandeydivesh15 / AVSR-Deep-SpeechView external linksLinks
Google Summer of Code 2017 Project: Development of Speech Recognition Module for Red Hen Lab
☆45Aug 29, 2017Updated 8 years ago
Alternatives and similar repositories for AVSR-Deep-Speech
Users that are interested in AVSR-Deep-Speech are comparing it to the libraries listed below
Sorting:
- Audio Visual Speech Recognition☆23Aug 9, 2017Updated 8 years ago
- Transformer-based online speech recognition system with TensorFlow 2☆26Jan 22, 2021Updated 5 years ago
- ☆17Jun 30, 2020Updated 5 years ago
- ☆17Nov 25, 2019Updated 6 years ago
- From a large speech audio file and its corresponding body of text, automatically chunk the audio and text into (phrase, audio_snippet) pa…☆17May 15, 2015Updated 10 years ago
- In this repository, I try to combine k2 with speechbrain to decode well and fastly.☆16Jun 17, 2022Updated 3 years ago
- wake word spotting with kaldi☆19Dec 3, 2020Updated 5 years ago
- implementation of "EFFICIENT KEYWORD SPOTTING USING DILATED CONVOLUTIONS AND GATING"☆36Dec 8, 2019Updated 6 years ago
- ☆17Apr 8, 2016Updated 9 years ago
- DNN-based speech enhancement using Tensorflow by Haoyu Li (Tokyo univ.)☆16Aug 31, 2017Updated 8 years ago
- This is now the official location of the Kaldi project.☆10Aug 22, 2019Updated 6 years ago
- Audio-Visual Speech Recognition using Deep Learning☆61Nov 14, 2018Updated 7 years ago
- 📖 LanMIT: A Toolkit for Improving Language Models in Low-resourced Speech Recognition based on Kaldi.☆22Jul 12, 2019Updated 6 years ago
- Unsupervised speech activity detection system.☆11Jul 2, 2018Updated 7 years ago
- Wenet speech to text for react native☆10Nov 1, 2022Updated 3 years ago
- A C++ library for parsing and manipulating JSGF grammar files.☆14Feb 13, 2024Updated 2 years ago
- Multimodal speech recognition using lipreading (with CNNs) and audio (using LSTMs). Sensor fusion is done with an attention network.☆69Nov 19, 2022Updated 3 years ago
- A cross platform (Android/iOS/MacOS) Bahasa Indonesia speech recognizer library, written in Flutter.☆11Nov 18, 2025Updated 3 months ago
- This is an extension of kaldi speech recognition software which allows to perform decoding of speech with hybrid word and phoneme graphs.…☆11Feb 4, 2020Updated 6 years ago
- Simple Kaldi recipe for forced alignment☆11Jul 16, 2023Updated 2 years ago
- Speaker Recognition application using fast-forward NN☆16Jun 14, 2012Updated 13 years ago
- Transfer learning approach to pronunciation scoring☆11Jan 17, 2024Updated 2 years ago
- A PyTorch implementation of Speech Transformer with multi-GPUs, an End-to-End ASR with Transformer network on Mandarin Chinese. This code…☆10Dec 25, 2019Updated 6 years ago
- steps to perform text-based speaker diarization with kaldi toolkit☆12Nov 2, 2018Updated 7 years ago
- This ist the repository for the term project Speech Recognition using Deep Neural Networks for the course ELEC-E5510-Speech Recognition☆12Dec 8, 2015Updated 10 years ago
- 24-hour Automatic Speech Recognition☆27Jun 4, 2021Updated 4 years ago
- Free noise reduction of speech signals☆12Jul 26, 2016Updated 9 years ago
- Web page for ISCA Special Interest Group: Robust Speech Processing (RoSP)☆11Dec 4, 2023Updated 2 years ago
- Speaker recognition/identification system in Python. Python3 port.☆14May 2, 2015Updated 10 years ago
- Code of paper "Combining range and direction for improved localization" presented at ICASSP2018☆10Apr 20, 2018Updated 7 years ago
- ☆14May 7, 2019Updated 6 years ago
- ☆11Sep 5, 2025Updated 5 months ago
- The Additive Margin MobileNet1D is a new light weight deep learning model for Speaker Recognition which is based on the MobileNetV2 archi…☆30Oct 3, 2023Updated 2 years ago
- An automatic speech recognition environment for Icelandic based on Kaldi☆14Oct 12, 2017Updated 8 years ago
- Keras implementation of SincNet (https://github.com/mravanelli/SincNet, https://arxiv.org/abs/1808.00158)☆12Aug 5, 2018Updated 7 years ago
- A python tool that converts Arabic diacritised text to a sequence of phonemes and creates a pronunciation dictionary. This code is based …☆16Sep 5, 2017Updated 8 years ago
- python wrap for hts engine☆14Jan 30, 2018Updated 8 years ago
- ☆14Jun 12, 2015Updated 10 years ago
- Barista is an open-source framework for concurrent speech processing.☆36Mar 19, 2014Updated 11 years ago