apoorvpatne10 / fantastic-memoryLinks
Lip Reading using STCNNs and Bi-GRUs
☆11Updated 2 years ago
Alternatives and similar repositories for fantastic-memory
Users that are interested in fantastic-memory are comparing it to the libraries listed below
Sorting:
- Unsupervised speech activity detection system.☆11Updated 7 years ago
- The Additive Margin MobileNet1D is a new light weight deep learning model for Speaker Recognition which is based on the MobileNetV2 archi…☆30Updated last year
- Similarity Learning applied to Speaker Verification and Semantic Textual Similarity☆12Updated 5 years ago
- This is an extension of kaldi speech recognition software which allows to perform decoding of speech with hybrid word and phoneme graphs.…☆11Updated 5 years ago
- In this repository, I try to combine k2 with speechbrain to decode well and fastly.☆16Updated 3 years ago
- ☆17Updated 5 years ago
- Keras implementation of SincNet (https://github.com/mravanelli/SincNet, https://arxiv.org/abs/1808.00158)☆12Updated 7 years ago
- Speaker Diarization is the first step in many early audio processing and aims to solve the problem ”who spoke when”. It therefore relies …☆12Updated 6 years ago
- wake word spotting with kaldi☆19Updated 4 years ago
- Web page for ISCA Special Interest Group: Robust Speech Processing (RoSP)☆11Updated last year
- A set of scripts to use in preparing a corpus for speech-to-text processing with the Kaldi Automatic Speech Recognition Library.☆15Updated 5 years ago
- Demo page of our paper Efficiently Trainable Text-to-Speech System Based on Deep Convolutional Networks With Guided Attention, ICASSP 201…☆15Updated 4 years ago
- This repo contains the baseline model recipes and pre-trained model for GramVanni hindi ASR challenge☆15Updated 3 years ago
- Trained speaker embedding deep learning models and evaluation pipelines in pytorch and tesorflow for speaker recognition.☆36Updated 5 years ago
- MirasVoice is a data set consisting speech samples from bilinguals to train neural network for optimization of speaker verification algor…☆19Updated 5 years ago
- python wrap for hts engine☆14Updated 7 years ago
- A handy dataset of noises for ASR☆22Updated 6 years ago
- The Additive Margin SincNet (AM-SincNet) is a new approach for speaker recognition problems which is based in the neural network architec…☆45Updated last year
- SpeechYOLO Interspeech 2019☆44Updated 3 years ago
- Enable RNNLM lattice rescoring with Pytorch [kaldi]☆12Updated 5 years ago
- A python tool that converts Arabic diacritised text to a sequence of phonemes and creates a pronunciation dictionary. This code is based …☆16Updated 8 years ago
- Speaker Diarization library in Python. Performs VAD, Segmentation, Linear Clustering, Hierarchical Clustering☆15Updated 8 years ago
- Project to learn about speech recognition - both Speaker Diarization and other Speech Recognition applications.☆50Updated 8 years ago
- This is now the official location of the Kaldi project.☆10Updated 6 years ago
- Pronounce Arabic words☆19Updated 6 years ago
- Curriculum Vitae of Quan Wang☆15Updated last week
- Code for our ACML and INTERSPEECH papers: "Speaker Diarization as a Fully Online Bandit Learning Problem in MiniVox".☆28Updated 3 years ago
- A packaged convolutional voice activity detector for noisy environments.☆14Updated 6 years ago
- Real-time melgan based on cpu !!!☆13Updated 5 years ago
- A library of speech gadgets.☆13Updated 2 years ago